onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-07-10 17:37:14 +00:00

History

pengwa 16f5909f2d Introduce shrunken gather operator (#15396 ) ### Introduce shrunken gather operator Exist Gather operator schema won't guarantee output element count will be smaller than input element count. Actually, it is possible output element count >, =, or < input element count. For some cases we know for sure output element count MUST be <= input element count, we will upstream those Gather operators to reduce compute flops. So this PR introduces an ShrunkenGather which explicitly guarantee output count will be smaller than input count. The operator add additional restriction on inputs, but still re-use existing Gather's implementations plus input check during runtime. This is a requirement for subsequent optimization (Draft PR: https://github.com/microsoft/onnxruntime/pull/15401) we will do for label sparsity and embedding sparsity.		2023-04-07 15:12:58 +08:00
..
contrib_ops	Introduce shrunken gather operator (#15396 )	2023-04-07 15:12:58 +08:00
core	Introduce shrunken gather operator (#15396 )	2023-04-07 15:12:58 +08:00
python	Reuse QDQConv for ConvTranspose to generate the QDQ model (#15385 )	2023-04-06 15:07:44 -07:00
test	Introduce shrunken gather operator (#15396 )	2023-04-07 15:12:58 +08:00
tool/etw
wasm	[js/web] support flag 'optimizedModelFilePath' in session options (#14355 )	2023-02-24 15:50:15 -08:00
__init__.py	Adopt linrtunner as the linting tool - take 2 (#15085 )	2023-03-24 15:29:03 -07:00
ReformatSource.ps1
ReformatSourcePython.bat	Add python docstring linting in vscode settings (#11316 )	2022-04-23 06:23:04 -07:00
VSCodeCoverage.runsettings