onnxruntime/tools
Hector Li 385fab5bae
[QNN EP] Qnn cache improvement (#17757)
### Description
Improve the QNN context binary cache feature to reduce the memory
overhead and initialization time overhead.
Instead of dumping a Qnn context binary file with metadata as header, we
dump a Onnx format file with metadata inside Onnx node.

### Motivation and Context
 reduce the memory overhead and initialization time overhead
2023-10-06 15:56:33 -07:00
..
android_custom_build [Linter] Bump ruff and remove pylint (#17797) 2023-10-05 21:07:33 -07:00
ci_build [QNN EP] Qnn cache improvement (#17757) 2023-10-06 15:56:33 -07:00
doc Disable PERF* rules in ruff to allow better readability (#16834) 2023-07-25 15:38:22 -07:00
nuget [TensorRT EP] Refactor OrtTensorRTProviderOptions initialization and make it easy to add new field (#17617) 2023-10-06 14:12:20 -07:00
perf_view fixed #16873 (#16932) 2023-09-26 09:57:01 -07:00
python [Linter] Bump ruff and remove pylint (#17797) 2023-10-05 21:07:33 -07:00
scripts Remove dnf update from docker build scripts (#17551) 2023-09-21 07:33:29 -07:00