onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-06-25 02:50:42 +00:00

History

Jiajia Qin 7e0dd9d433 [js/webgpu] Optimize Expand (#22752 ) Use components = 4 if possible. llama3.2-1B becomes 20 tokens/s from 18 tokens/s on my iGPUs.		2024-11-12 12:37:19 -08:00
..
webgpu	[js/webgpu] Optimize Expand (#22752 )	2024-11-12 12:37:19 -08:00
webnn	[WebNN EP] Fix issues with MLTensor caching (#22701 )	2024-11-06 09:17:11 -08:00
backend-webgpu.ts	[JS/WebGPU] Creating devices with subgroup features enabled if possible (#21833 )	2024-11-07 02:13:40 -08:00
backend-webnn.ts	[WebNN] Fixed WebNN Module undefined issue (#22795 )	2024-11-11 21:31:24 -08:00
init.ts	[JS/WebGPU] Creating devices with subgroup features enabled if possible (#21833 )	2024-11-07 02:13:40 -08:00
log.ts	[js] change default formatter for JavaScript/TypeScript from clang-format to Prettier (#21728 )	2024-08-14 16:51:22 -07:00
tensor-view.ts	[js/webgpu] support float16 for Clip (#21584 )	2024-08-28 13:19:20 -07:00
util.ts	[JS/WebGPU] Support WASM64 (#21836 )	2024-10-24 20:21:51 -07:00