onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-07-05 04:17:53 +00:00

History

Hector Li 190588bb64 Enable QNN weight sharing (#21077 ) ### Description Enable QNN weight sharing across graphs in single context Create tool to generate QNN context cache model with weight sharing enabled.		2024-09-04 11:20:33 -07:00
..
common	revert forceinline for MakeString (#21943 )	2024-09-02 19:01:08 -07:00
eager	Fix typos - 1st Wave (#21278 )	2024-07-11 13:35:08 +08:00
framework	Introduce custom external data loader (#21634 )	2024-08-27 12:18:52 -07:00
graph	Memory Optimization for Compilation in OVEP (#21872 )	2024-09-03 13:52:31 -07:00
optimizer	Utilize ext data location to reduce qd matmul memory usage (#21451 )	2024-07-30 15:22:46 -07:00
platform	Update ruff and clang-format versions (#21479 )	2024-07-24 11:50:11 -07:00
providers	[TensorRT EP] No workspace size limit to TRT memory pool (#21643 )	2024-08-09 17:30:51 -07:00
session	Enable QNN weight sharing (#21077 )	2024-09-04 11:20:33 -07:00