onnxruntime/tools
Hector Li 4324d2173b
[QNN EP] Enable Qnn context cache to save model initialization time (#15815)
### Description
Enable Qnn Context cache feature to save model initialization time
Provider options:
qnn_context_cache_enable|1 to enable the cache feature
qnn_context_cache_path to set the cache path. It is set to model_file.onnx.bin by default.

### Motivation and Context
Model initialization time takes long because the cost of conversion from Onnx model to Qnn model. Qnn have feature to serialize the Qnn context to file, then next time user can load it from the cache context and execute the graph to save the cost.

---------

Co-authored-by: Adrian Lizarraga <adlizarraga@microsoft.com>
2023-05-19 10:52:17 -07:00
..
android_custom_build Update cmake version in a few places (#15775) 2023-05-02 22:56:28 -07:00
ci_build [QNN EP] Enable Qnn context cache to save model initialization time (#15815) 2023-05-19 10:52:17 -07:00
doc Python documentation for onnxruntime-training (#15765) 2023-05-02 16:58:16 -07:00
nuget [DML EP] Update DirectML version to 1.12.0 (#16011) 2023-05-18 19:37:12 -07:00
perf_view
python Update VERSION_NUMBER (#15773) 2023-05-03 15:07:34 -07:00