onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-05-26 22:35:43 +00:00

History

Xiaoyu c8676ffbff Add ModelProto support for quantize api (#20018 ) ### Description Add ModelProto support for `quantize` api ### Motivation and Context Currently, the `quantize` API only accepts a model path as the input model. However, for large models, saving and loading from disk can be time-consuming. By adding `ModelProto` as an input option to the `quantize` API, significant time can be saved.		2024-03-27 10:40:08 -07:00
..
contrib_ops	Packed QKV and Rotary Embedding Support for sm<80 GQA (#20012 )	2024-03-23 14:30:35 -07:00
core	[TensorRT EP] Fix concurrency issue for TRT custom op list (#20093 )	2024-03-26 21:20:14 -07:00
python	Add ModelProto support for quantize api (#20018 )	2024-03-27 10:40:08 -07:00
test	Pin Onnx Version (#20073 )	2024-03-26 17:59:46 +08:00
tool/etw
wasm	[js/web] rewrite backend resolve to allow multiple EPs (#19735 )	2024-03-15 11:47:45 -07:00
__init__.py	[ORT 1.17.0 release] Bump up version to 1.18.0 (#19170 )	2024-01-17 11:18:32 -08:00
ReformatSource.ps1
ReformatSourcePython.bat
VSCodeCoverage.runsettings