onnxruntime/js/web/lib/wasm
Jiajia Qin 25f427466e
[js/webgpu] Optimize ConvTranspose (Continue) (#23429)
BUG #23273

This PR does below optimizations:
1. When output channels is one, 1) calculate the offset before the
inchannel loop to reduce indices to offsets calculation, 2) split the
`inputChannelsPerGroup` into `inputChannelsPerGroupInt` and
`inputChannelsRemainder` parts so that we can always access 4 data for
`inputChannelsPerGroupInt`.
2. Use precise initial value to reduce useless loop iterations. Thanks
@jiangzhaoming 's suggestion's on this.

With this PR, ConvTranspose becomes 3.7s from 8.4s on Intel Meteor Lake.
On NV RTX 2000 Ada, it becomes 1.6s from 2.7s.
2025-01-22 08:59:17 -08:00
..
jsep [js/webgpu] Optimize ConvTranspose (Continue) (#23429) 2025-01-22 08:59:17 -08:00
proxy-worker
proxy-messages.ts
proxy-wrapper.ts [js/web] fix package export for bundlers (#23257) 2025-01-09 11:01:00 -08:00
run-options.ts
session-handler-inference.ts
session-options.ts [JS/WebGPU] Support WASM64 (#21836) 2024-10-24 20:21:51 -07:00
wasm-common.ts [WebNN] Support int4 and uint4 data types (#22575) 2024-10-25 17:44:46 -07:00
wasm-core-impl.ts [js/common] use TS type inference to eliminate unknown (#23012) 2024-12-04 19:01:26 -08:00
wasm-factory.ts [js/web] fix package export for bundlers (#23257) 2025-01-09 11:01:00 -08:00
wasm-types.ts [WebNN] Fixed WebNN Module undefined issue (#22795) 2024-11-11 21:31:24 -08:00
wasm-utils-env.ts
wasm-utils-import.ts [js/web] fix package export for bundlers (#23257) 2025-01-09 11:01:00 -08:00
wasm-utils-load-file.ts
wasm-utils.ts [JS/WebGPU] Support WASM64 (#21836) 2024-10-24 20:21:51 -07:00