onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-07-02 03:55:34 +00:00

History

Enrico Galli 1e5bda88f0 [WebNN EP] Cache MLTensors between runs (#22278 ) ### Description This change enables caching `MLTensor`s between inferences runs. This is done by keeping a reference to `MLTensor`s alive after they have been released. `MLTensor`s are only destroyed once the sessions goes out of scope. ### Motivation and Context Creating and destroying `MTensor`s on every run has a non-trivial performance penalty. This performance penalty materializes when using `ort.Tensors`[location=cpu] for inputs/outputs or when using the CPU EP as a fallback EP for unsupported operators. The former could be mitigated by developer using `ort.Tensors`[location=ml-tensor]. The latter cannot be mitigated by developers.		2024-10-18 08:07:00 -07:00
..
onnxjs	[js] change default formatter for JavaScript/TypeScript from clang-format to Prettier (#21728 )	2024-08-14 16:51:22 -07:00
wasm	[WebNN EP] Cache MLTensors between runs (#22278 )	2024-10-18 08:07:00 -07:00
backend-onnxjs.ts	[js] change default formatter for JavaScript/TypeScript from clang-format to Prettier (#21728 )	2024-08-14 16:51:22 -07:00
backend-wasm.ts	[js/web] remove training release (#22103 )	2024-09-16 10:56:22 -07:00
build-def.d.ts	[js/web] allow build target for non dynamic import (#20898 )	2024-06-03 12:33:37 -07:00
index.ts	[js/web] remove training release (#22103 )	2024-09-16 10:56:22 -07:00
version.ts	bumps up version in main from 1.20 -> 1.21 (#22482 )	2024-10-17 12:32:35 -07:00