onnxruntime/docs
Bowen Bao e983f37121
Bifurcation detector for aggressive decoding (#9432)
```
Component for aggressive decoding. Find the bifurcation index of predicted tokens, between source tokens,
starting from previous suffix match index, and predicted tokens.
Concat predicted tokens, starting from bifurcation index, to the back
of current tokens. This forms the output tokens.
Detect suffix match index in source tokens, between source tokens and output tokens.
Detection is based on finding the appearances of last n-gram in output tokens
in source tokens.
A match is considered found if source tokens contain a single matching n-gram.
Return the index of the start of the n-gram in source tokens.
No matching if found if src tokens contain multiple or zero matching n-grams. Return -1.
```
2021-10-19 19:53:56 -07:00
..
c_cxx Fix S360 issue by using "use strict" for javascript code. (#9128) 2021-09-20 20:32:44 -07:00
execution_providers/images Remove docs that have been migrated to https://onnxruntime.ai/docs (#6225) 2021-02-05 18:09:27 -08:00
images API Documentation (#8948) 2021-09-09 22:04:51 -07:00
python Remove tensorflow, tf2onnx from the list of dependencies for the documentation (#9221) 2021-10-14 18:07:35 +02:00
ABI_Dev_Notes.md
Android_testing.md Removed BUILD.md from master as source now lives in gh-pages (#6709) 2021-02-19 11:34:21 -08:00
C_API_Guidelines.md
cmake_guideline.md
Coding_Conventions_and_Standards.md Change onnxruntime::make_unique to std::make_unique (#7502) 2021-04-29 17:04:53 -07:00
ContribOperators.md Bifurcation detector for aggressive decoding (#9432) 2021-10-19 19:53:56 -07:00
FAQ.md
How_To_Update_ONNX_Dev_Notes.md Remove onnxruntime/core/protobuf (#8617) 2021-08-10 09:36:27 -07:00
Model_Test.md
NotesOnThreading.md
ONNX_Runtime_Server_Usage.md Update docs/ONNX_Runtime_Server_Usage.md (#7818) 2021-05-26 16:17:20 -07:00
onnxruntime_dependencies.dot
onnxruntime_dependencies.png
onnxruntime_extensions.md Enable linking in exception throwing support library when build onnxruntime wasm. (#8973) 2021-09-10 22:09:16 +08:00
OperatorKernels.md Bifurcation detector for aggressive decoding (#9432) 2021-10-19 19:53:56 -07:00
ORTMobilePackageOperatorTypeSupport.md Add supported operators/types documentation for the ORT Mobile package (#7807) 2021-05-26 15:57:40 +10:00
PR_Guidelines.md
Privacy.md
Python_Dev_Notes.md
Reduced_Operator_Kernel_build.md Support required types when excluding typed registrations (#6871) 2021-03-08 08:22:07 -08:00
ReleaseManagement.md
Roadmap.md
Server.md Update documentation for contributing a PR and add deprecation notices for PyOp and ORT server. (#6172) 2020-12-18 02:00:42 -08:00
Versioning.md Bumping up to 1.10 (#9006) 2021-09-22 16:34:28 -07:00
WinML_principles.md