onnxruntime/onnxruntime
Hector Li 385fab5bae
[QNN EP] Qnn cache improvement (#17757)
### Description
Improve the QNN context binary cache feature to reduce the memory
overhead and initialization time overhead.
Instead of dumping a Qnn context binary file with metadata as header, we
dump a Onnx format file with metadata inside Onnx node.

### Motivation and Context
 reduce the memory overhead and initialization time overhead
2023-10-06 15:56:33 -07:00
..
contrib_ops Remove MPI dependency (#17624) 2023-10-06 15:33:18 +08:00
core [QNN EP] Qnn cache improvement (#17757) 2023-10-06 15:56:33 -07:00
python [TensorRT EP] Refactor OrtTensorRTProviderOptions initialization and make it easy to add new field (#17617) 2023-10-06 14:12:20 -07:00
test [QNN EP] Qnn cache improvement (#17757) 2023-10-06 15:56:33 -07:00
tool/etw Run clang-format in CI (#15524) 2023-04-18 09:26:58 -07:00
wasm [js/webgpu] support IO binding (#17480) 2023-09-29 11:24:42 -07:00
__init__.py Python API to check whether collective ops are available or not (#17730) 2023-09-29 14:11:05 -07:00
ReformatSource.ps1 Run clang-format in CI (#15524) 2023-04-18 09:26:58 -07:00
ReformatSourcePython.bat
VSCodeCoverage.runsettings