onnxruntime/docs
Viswanath Boga ad9d2e2e89
Prefix match in first iteration of beam search OP (#10231)
* Add BeamSearch op schema

* Add ONNX conversion for beams search

* remove attention_mask and change input order

* add option to run baseline

* add check data type NULL

* applies VerifyNodeAndOpMatch to subgraph

* update input_ids shape

* Add node name for Cast node

* expose API for topk

* parse parameters

* Add beam search scorer

* output results

* fix typo

* use c++ template and format python

* fix build pipeline errors

* symbolic shape infer of input onnx

* output scores

* add kernel def hash

* Handle vocab_mask; move CheckSubgraph

* undo insert_cast_transformer.cc and fusion_utils.py

* fix typo

* fix merge

* update doc

* add repetition penalty

* refactoring: add GptSubgraph class

* move BeamSearchState from .h to .cc file

* adjust logits processor order

* add batch generation example

* fix repetition penalty for dup words in sequence

* Add test

* Add no repeat ngram processor

* refactoring: move logits processor to classes

* fix build warning

* show latency

* use allocator in beam state

* use allocator in sequences

* fix build error

* move next_positions to beam state

* Changes for prefix matching

* removing debugs

* removing more debugs

* clean up

* clean up

* cpu doc updated

* Updated docs

* updated prefix_vocab_mask dimension in convert script

* changes to support bxs prefix_vocab_mask in beamsearchop kernel

* doc update

* OperatorKernels.md updated

* matching docs from artifacts

* minor change in logits processor

* Addressing comments

* Updated the prefix vocab mask usage properly

Co-authored-by: Tianlei Wu <tlwu@microsoft.com>
2022-02-03 00:14:39 +05:30
..
c_cxx Document C/C++ API documentation version info conventions. (#10396) 2022-01-27 10:20:13 -08:00
execution_providers/images Remove docs that have been migrated to https://onnxruntime.ai/docs (#6225) 2021-02-05 18:09:27 -08:00
images API Documentation (#8948) 2021-09-09 22:04:51 -07:00
python Standalone TVM Executor Provider (#10019) 2021-12-15 16:59:20 -08:00
ABI_Dev_Notes.md Fix some typos. (#3582) 2020-04-18 14:18:05 -07:00
Android_testing.md Removed BUILD.md from master as source now lives in gh-pages (#6709) 2021-02-19 11:34:21 -08:00
C_API_Guidelines.md Add C API Guidelines document (#5686) 2020-11-04 18:50:31 -08:00
cmake_guideline.md Add a doc for cmake (#1524) 2019-08-06 07:51:53 -07:00
Coding_Conventions_and_Standards.md Add abseil cgmanifest declaration. Update coding standards. (#10374) 2022-01-27 08:32:05 -08:00
ContribOperators.md Prefix match in first iteration of beam search OP (#10231) 2022-02-03 00:14:39 +05:30
FAQ.md Add FAQ page (#3324) 2020-05-06 15:43:32 -07:00
How_To_Update_ONNX_Dev_Notes.md Remove onnxruntime/core/protobuf (#8617) 2021-08-10 09:36:27 -07:00
Model_Test.md Renaming MKL-DNN as DNNL (#2515) 2019-12-03 07:34:23 -08:00
NotesOnThreading.md Support multi-loop parallel sections, use multi-loop sections in GRU (#5602) 2020-11-10 12:24:57 +00:00
ONNX_Runtime_Server_Usage.md Update docs/ONNX_Runtime_Server_Usage.md (#7818) 2021-05-26 16:17:20 -07:00
onnxruntime_dependencies.dot Update dependencies graph 2020-04-17 07:38:45 -07:00
onnxruntime_dependencies.png Update dependencies graph 2020-04-17 07:38:45 -07:00
onnxruntime_extensions.md Enable linking in exception throwing support library when build onnxruntime wasm. (#8973) 2021-09-10 22:09:16 +08:00
OperatorKernels.md Prefix match in first iteration of beam search OP (#10231) 2022-02-03 00:14:39 +05:30
ORTMobilePackageOperatorTypeSupport.md Update doc for operators/opsets supported by mobile package (#9899) 2021-12-02 13:51:22 +10:00
PR_Guidelines.md Add guidelines for writing a good PR. (#3830) 2020-05-05 16:28:21 -07:00
Privacy.md [C# and Python APIs] Expose knobs to enable/disable platform telemetry collection (#5481) 2020-10-21 10:32:13 -07:00
Python_Dev_Notes.md Changes related to the release binaries requiring Visual C++ 2019 runtime (#3871) 2020-05-12 17:07:06 -07:00
Reduced_Operator_Kernel_build.md Move reduced ops files into build directory (#10030) 2021-12-28 19:04:20 -08:00
ReleaseManagement.md Updated TPN for OpenMPI and cleanup (#3932) 2020-05-14 11:42:44 -07:00
Roadmap.md Doc updates for 1.5 (#5302) 2020-09-30 09:53:33 -07:00
Server.md Update documentation for contributing a PR and add deprecation notices for PyOp and ORT server. (#6172) 2020-12-18 02:00:42 -08:00
STVM_EP.md STVM, NUPHAR, remove tvm from submodules list, checks pointers are not null. (#10211) 2022-01-27 20:31:13 +01:00
Versioning.md Bump master version to 1.11 (#9957) 2021-12-14 23:32:06 -08:00
WinML_principles.md Winml_principles_change (#5727) 2020-11-12 10:39:24 -08:00