onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-07-05 04:17:53 +00:00

History

Jiajia Qin 25f427466e [js/webgpu] Optimize ConvTranspose (Continue) (#23429 ) BUG #23273 This PR does below optimizations: 1. When output channels is one, 1) calculate the offset before the inchannel loop to reduce indices to offsets calculation, 2) split the `inputChannelsPerGroup` into `inputChannelsPerGroupInt` and `inputChannelsRemainder` parts so that we can always access 4 data for `inputChannelsPerGroupInt`. 2. Use precise initial value to reduce useless loop iterations. Thanks @jiangzhaoming 's suggestion's on this. With this PR, ConvTranspose becomes 3.7s from 8.4s on Intel Meteor Lake. On NV RTX 2000 Ada, it becomes 1.6s from 2.7s.		2025-01-22 08:59:17 -08:00
..
data/ops	[js/webgpu] Optimize ConvTranspose (Continue) (#23429 )	2025-01-22 08:59:17 -08:00
e2e	Bump vite from 6.0.7 to 6.0.11 in /js/web/test/e2e/exports/testcases/vite-default (#23446 )	2025-01-21 17:18:39 -08:00
unittests	[js] change default formatter for JavaScript/TypeScript from clang-format to Prettier (#21728 )	2024-08-14 16:51:22 -07:00
op-test-schema.json	[js/web] Add support for int4/uint4 tensor (#21720 )	2024-08-15 21:32:10 -07:00
suite-test-list.jsonc	[js/webgpu] Add GatherND (#22847 )	2024-12-04 09:57:32 -08:00
test-main.ts	[js/webgpu] Manage model download with a specific unittest option (#22214 )	2024-09-30 18:27:43 -07:00
test-runner.ts	[js/web] Update API for `ort.env.webgpu` (#23026 )	2024-12-11 10:24:14 -08:00
test-shared.ts	[js] change default formatter for JavaScript/TypeScript from clang-format to Prettier (#21728 )	2024-08-14 16:51:22 -07:00
test-types.ts	[js/webgpu] Manage model download with a specific unittest option (#22214 )	2024-09-30 18:27:43 -07:00