onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-05-18 21:21:17 +00:00

History

Hector Li 4324d2173b [QNN EP] Enable Qnn context cache to save model initialization time (#15815 ) ### Description Enable Qnn Context cache feature to save model initialization time Provider options: qnn_context_cache_enable\|1 to enable the cache feature qnn_context_cache_path to set the cache path. It is set to model_file.onnx.bin by default. ### Motivation and Context Model initialization time takes long because the cost of conversion from Onnx model to Qnn model. Qnn have feature to serialize the Qnn context to file, then next time user can load it from the cache context and execute the graph to save the cost. --------- Co-authored-by: Adrian Lizarraga <adlizarraga@microsoft.com>		2023-05-19 10:52:17 -07:00
..
common	Run clang-format in CI (#15524 )	2023-04-18 09:26:58 -07:00
eager	Run clang-format in CI (#15524 )	2023-04-18 09:26:58 -07:00
framework	Remove onnxruntime_PYBIND_EXPORT_OPSCHEMA definition from onnxruntime (#15776 )	2023-05-03 13:08:35 -07:00
graph	Support WebNN EP (#15698 )	2023-05-08 21:25:10 -07:00
optimizer	fix compilation error in no absl build (#15769 )	2023-05-02 08:20:49 -07:00
platform	Implement mutex-free spin lock for task queue (#14834 )	2023-05-19 10:12:10 -07:00
providers	fix: setting builder optimization level to TRT 8.6 default (#15897 )	2023-05-12 13:29:30 -07:00
session	[QNN EP] Enable Qnn context cache to save model initialization time (#15815 )	2023-05-19 10:52:17 -07:00