onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-07-11 17:48:34 +00:00

History

Maximilian Müller ad4db12699 TensorRT EP - timing cache (#14767 ) ### Description This will enable a user to use a TensorRT timing cache based on #10297 to accelerate build times on a device with the same compute capability. This will work across models as it simply store kernel runtimes for specific configurations. Those files are usually very small (only a few MB) which makes them very easy to ship with an application to accelerate the build time on the user end. ### Motivation and Context Especially for workstation use cases TRT build times can be a roadblock. With a few model from ONNX model zoo i evaluated speedups when a timing cache is present. `./build/onnxruntime_perf_test -e tensorrt -I -t 5 -i "trt_timing_cache_enable\|true" <onnx_path>` \|Model \| no Cache \| with Cache\| \| ------------- \| ------------- \| ------------- \| \|efficientnet-lite4-11 \| 34.6 s \| 7.7 s\| \|yolov4 \| 108.62 s \| 9.4 s\| To capture this is had to modify the onnxruntime_perf_test. The time is sometimes not captured within "Session creation time cost:" which is why i introduced "First inference time cost:". --------- Co-authored-by: Chi Lo <Chi.Lo@microsoft.com>		2023-03-10 09:02:27 -08:00
..
eigen@d10b27fe37	Downgrade Eigen (#8817 )	2021-08-23 18:06:23 -07:00
emsdk@0ab19024f0	[wasm] upgrade emsdk from 3.1.19 to 3.1.32 (#14818 )	2023-02-28 11:06:09 -08:00
libprotobuf-mutator@7a2ed51a6b
onnx@9b7bca2a72	to work with onnx 1.13 rc, implement ver 18 reduce and optioanl ops, … (#13765 )	2023-01-09 10:26:16 -08:00
onnxruntime-extensions@81e7799c69	pin ort-ext to 81e7799c69044c745239202085eb0a98f102937b (#14044 )	2023-01-10 10:10:17 -08:00
protobuf@a20c65f2cd	upgrade protobuf to 3.20.2 and onnx to 1.13 (#14279 )	2023-01-31 12:55:09 -08:00
abseil-cpp.cmake	Let Cmake decide where to place abseil (#14057 )	2022-12-23 12:08:13 -08:00
abseil-cpp.natvis	Update absl to the latest release (#13990 )	2022-12-19 14:25:13 -08:00
composable_kernel.cmake	[ROCm] Add GemmFastGelu CK implementation (#13759 )	2023-01-05 17:53:30 +08:00
cutlass.cmake	Upgrade CUTLASS to v2.11 and add sequence length threshold for cutlass FMHA (#14401 )	2023-01-25 09:43:48 -08:00
dml.cmake	[DML EP] Upgrade DML to 1.10.1 (#14433 )	2023-01-25 21:07:10 -08:00
dnnl.cmake	[oneDNN] Update to oneDNN v3.0 (#14267 )	2023-02-17 09:56:29 -08:00
eigen.cmake	Improve dependency management (#13523 )	2022-12-01 09:51:59 -08:00
extensions.cmake	Migrating ORT Extensions from Git submodule to cmake FetchContent (#14298 )	2023-02-22 19:42:36 -08:00
find_snpe.cmake	Improve dependency management (#13523 )	2022-12-01 09:51:59 -08:00
FindNumPy.cmake
helper_functions.cmake	Enable cache for msbuild (#14085 )	2023-01-06 11:19:57 +08:00
ipp-crypto.cmake	[TVM EP] Hot fix of build on Windows of TVM EP with ipp-crypto (#12381 )	2022-07-31 14:36:54 +02:00
mimalloc.cmake	Improve dependency management (#13523 )	2022-12-01 09:51:59 -08:00
onnx_minimal.cmake	Improve dependency management (#13523 )	2022-12-01 09:51:59 -08:00
onnx_protobuf.natvis	Natvis adjustments to make debugging bearable (#13237 )	2022-10-10 10:06:55 -07:00
onnxruntime_external_deps.cmake	TensorRT EP - timing cache (#14767 )	2023-03-10 09:02:27 -08:00
protobuf_function.cmake	Improve dependency management (#13523 )	2022-12-01 09:51:59 -08:00
pybind11.cmake	Improve dependency management (#13523 )	2022-12-01 09:51:59 -08:00
pyxir.cmake	Check for Python_EXECUTABLE in pyxir.cmake to fix Vitis AI EP build (#8631 )	2021-08-24 08:39:50 -07:00
triton.cmake	CloudEP (#13855 )	2023-01-03 10:03:15 -08:00
tvm.cmake	[TVM EP] Support zero copying TVM EP output tensor to ONNX Runtime output tensor (#12593 )	2023-02-08 10:02:20 -08:00
wil.cmake	Improve dependency management (#13523 )	2022-12-01 09:51:59 -08:00
xnnpack.cmake	Improve dependency management (#13523 )	2022-12-01 09:51:59 -08:00