onnxruntime/js/web/test
Jiajia Qin 25f427466e
[js/webgpu] Optimize ConvTranspose (Continue) (#23429)
BUG #23273

This PR does below optimizations:
1. When output channels is one, 1) calculate the offset before the
inchannel loop to reduce indices to offsets calculation, 2) split the
`inputChannelsPerGroup` into `inputChannelsPerGroupInt` and
`inputChannelsRemainder` parts so that we can always access 4 data for
`inputChannelsPerGroupInt`.
2. Use precise initial value to reduce useless loop iterations. Thanks
@jiangzhaoming 's suggestion's on this.

With this PR, ConvTranspose becomes 3.7s from 8.4s on Intel Meteor Lake.
On NV RTX 2000 Ada, it becomes 1.6s from 2.7s.
2025-01-22 08:59:17 -08:00
..
data/ops [js/webgpu] Optimize ConvTranspose (Continue) (#23429) 2025-01-22 08:59:17 -08:00
e2e Bump vite from 6.0.7 to 6.0.11 in /js/web/test/e2e/exports/testcases/vite-default (#23446) 2025-01-21 17:18:39 -08:00
unittests [js] change default formatter for JavaScript/TypeScript from clang-format to Prettier (#21728) 2024-08-14 16:51:22 -07:00
op-test-schema.json [js/web] Add support for int4/uint4 tensor (#21720) 2024-08-15 21:32:10 -07:00
suite-test-list.jsonc [js/webgpu] Add GatherND (#22847) 2024-12-04 09:57:32 -08:00
test-main.ts [js/webgpu] Manage model download with a specific unittest option (#22214) 2024-09-30 18:27:43 -07:00
test-runner.ts [js/web] Update API for ort.env.webgpu (#23026) 2024-12-11 10:24:14 -08:00
test-shared.ts [js] change default formatter for JavaScript/TypeScript from clang-format to Prettier (#21728) 2024-08-14 16:51:22 -07:00
test-types.ts [js/webgpu] Manage model download with a specific unittest option (#22214) 2024-09-30 18:27:43 -07:00