onnxruntime/js/web/lib/wasm/jsep
Jiajia Qin 7be006c466
[js/webgpu] Optimize convtranspose (#23302)
### Description
<!-- Describe your changes. -->
BUG #23273

With this change, I see the convTranspose time in that bug becomes ~7s
from ~90s on my Meteor Lake.

This PR does below things:
1. Use stride to update the increasement in the loop.
In the bug, the stride is 1024, which can greatly reduce the loop times.
2. Support components for A to reduce the memory access times.
3. When output channels is 1, the b components can be same with A to
further reduce the memory access times.
2025-01-09 11:24:42 -08:00
..
webgpu [js/webgpu] Optimize convtranspose (#23302) 2025-01-09 11:24:42 -08:00
webnn [WebNN] Fixes MLTensor caching across different contexts (#23100) 2024-12-17 12:51:16 -08:00
backend-webgpu.ts [JS/WebGPU] Creating devices with subgroup features enabled if possible (#21833) 2024-11-07 02:13:40 -08:00
backend-webnn.ts [WebNN] Fixed WebNN Module undefined issue (#22795) 2024-11-11 21:31:24 -08:00
init.ts [JS/WebGPU] Creating devices with subgroup features enabled if possible (#21833) 2024-11-07 02:13:40 -08:00
log.ts [js] change default formatter for JavaScript/TypeScript from clang-format to Prettier (#21728) 2024-08-14 16:51:22 -07:00
tensor-view.ts [js/webgpu] support float16 for Clip (#21584) 2024-08-28 13:19:20 -07:00
util.ts [JS/WebGPU] Support WASM64 (#21836) 2024-10-24 20:21:51 -07:00