onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-07-09 17:28:58 +00:00

History

Hector Li 4324d2173b [QNN EP] Enable Qnn context cache to save model initialization time (#15815 ) ### Description Enable Qnn Context cache feature to save model initialization time Provider options: qnn_context_cache_enable\|1 to enable the cache feature qnn_context_cache_path to set the cache path. It is set to model_file.onnx.bin by default. ### Motivation and Context Model initialization time takes long because the cost of conversion from Onnx model to Qnn model. Qnn have feature to serialize the Qnn context to file, then next time user can load it from the cache context and execute the graph to save the cost. --------- Co-authored-by: Adrian Lizarraga <adlizarraga@microsoft.com>		2023-05-19 10:52:17 -07:00
..
android_custom_build	Update cmake version in a few places (#15775 )	2023-05-02 22:56:28 -07:00
ci_build	[QNN EP] Enable Qnn context cache to save model initialization time (#15815 )	2023-05-19 10:52:17 -07:00
doc	Python documentation for onnxruntime-training (#15765 )	2023-05-02 16:58:16 -07:00
nuget	[DML EP] Update DirectML version to 1.12.0 (#16011 )	2023-05-18 19:37:12 -07:00
perf_view	fix json format (#11046 )	2022-03-30 16:15:33 -07:00
python	Update VERSION_NUMBER (#15773 )	2023-05-03 15:07:34 -07:00