onnxruntime/js/web/lib/wasm/jsep/webgpu/ops/3rd-party
Jiajia Qin 7be006c466
[js/webgpu] Optimize convtranspose (#23302)
### Description
<!-- Describe your changes. -->
BUG #23273

With this change, I see the convTranspose time in that bug becomes ~7s
from ~90s on my Meteor Lake.

This PR does below things:
1. Use stride to update the increasement in the loop.
In the bug, the stride is 1024, which can greatly reduce the loop times.
2. Support components for A to reduce the memory access times.
3. When output channels is 1, the b components can be same with A to
further reduce the memory access times.
2025-01-09 11:24:42 -08:00
..
activation_util.ts [js/webgpu] Fix conv2d with activation (#18388) 2023-11-10 12:54:35 -08:00
conv2d_mm_webgpu.ts [js/webgpu] fix Conv2DMatMul shader's out-of-bound read (#23085) 2024-12-12 11:33:53 -08:00
conv3d_naive_webgpu.ts [js] change default formatter for JavaScript/TypeScript from clang-format to Prettier (#21728) 2024-08-14 16:51:22 -07:00
conv_backprop_mm_webgpu.ts [js] change default formatter for JavaScript/TypeScript from clang-format to Prettier (#21728) 2024-08-14 16:51:22 -07:00
conv_backprop_webgpu.ts [js/webgpu] Optimize convtranspose (#23302) 2025-01-09 11:24:42 -08:00
conv_util.ts [js] change default formatter for JavaScript/TypeScript from clang-format to Prettier (#21728) 2024-08-14 16:51:22 -07:00
matmul_packed_webgpu.ts WebGPU JSEP: Make shader code not depend on input broadcasting patterns (#22536) 2024-11-08 11:00:51 -08:00