onnxruntime/include/onnxruntime/core
Hector Li 190588bb64
Enable QNN weight sharing (#21077)
### Description
Enable QNN weight sharing across graphs in single context
Create tool to generate QNN context cache model with weight sharing enabled.
2024-09-04 11:20:33 -07:00
..
common revert forceinline for MakeString (#21943) 2024-09-02 19:01:08 -07:00
eager Fix typos - 1st Wave (#21278) 2024-07-11 13:35:08 +08:00
framework Introduce custom external data loader (#21634) 2024-08-27 12:18:52 -07:00
graph Memory Optimization for Compilation in OVEP (#21872) 2024-09-03 13:52:31 -07:00
optimizer Utilize ext data location to reduce qd matmul memory usage (#21451) 2024-07-30 15:22:46 -07:00
platform Update ruff and clang-format versions (#21479) 2024-07-24 11:50:11 -07:00
providers [TensorRT EP] No workspace size limit to TRT memory pool (#21643) 2024-08-09 17:30:51 -07:00
session Enable QNN weight sharing (#21077) 2024-09-04 11:20:33 -07:00