onnxruntime/include/onnxruntime/core
Hector Li 4324d2173b
[QNN EP] Enable Qnn context cache to save model initialization time (#15815)
### Description
Enable Qnn Context cache feature to save model initialization time
Provider options:
qnn_context_cache_enable|1 to enable the cache feature
qnn_context_cache_path to set the cache path. It is set to model_file.onnx.bin by default.

### Motivation and Context
Model initialization time takes long because the cost of conversion from Onnx model to Qnn model. Qnn have feature to serialize the Qnn context to file, then next time user can load it from the cache context and execute the graph to save the cost.

---------

Co-authored-by: Adrian Lizarraga <adlizarraga@microsoft.com>
2023-05-19 10:52:17 -07:00
..
common Run clang-format in CI (#15524) 2023-04-18 09:26:58 -07:00
eager Run clang-format in CI (#15524) 2023-04-18 09:26:58 -07:00
framework Remove onnxruntime_PYBIND_EXPORT_OPSCHEMA definition from onnxruntime (#15776) 2023-05-03 13:08:35 -07:00
graph Support WebNN EP (#15698) 2023-05-08 21:25:10 -07:00
optimizer fix compilation error in no absl build (#15769) 2023-05-02 08:20:49 -07:00
platform Implement mutex-free spin lock for task queue (#14834) 2023-05-19 10:12:10 -07:00
providers fix: setting builder optimization level to TRT 8.6 default (#15897) 2023-05-12 13:29:30 -07:00
session [QNN EP] Enable Qnn context cache to save model initialization time (#15815) 2023-05-19 10:52:17 -07:00