onnxruntime/js/web/lib/wasm
Jiajia Qin b30e721dc8
[js/webgpu] Provide a naive vectorized matmul algorithm (#18758)
### Description
This PR provided a vectorized matmul algorithm. In most situations, we
still go to the workgroup memory optimized matmul. But for some
situations, like N and K are very small, using workgroup optimized
matmul can't fully utilize the underlying hardware due to the 32x32 tile
size. So for very small N/K, we switch to the naive vectorized matmul
algorithm to improve the hardware execution unit usage.

With this PR, matmul with input0: [1, 36864, 3], input1: [1, 3, 3],
input2: [3] becomes less than 1 ms from 4.34 ms on Intel Gen9 GPUs.
2023-12-13 09:03:23 -08:00
..
binding [js/web/training] Add CreateTrainingSession (#17891) 2023-10-26 09:22:10 -07:00
jsep [js/webgpu] Provide a naive vectorized matmul algorithm (#18758) 2023-12-13 09:03:23 -08:00
proxy-worker [js/web] set noUnusedParameters to true and fix a few bugs (#18404) 2023-11-15 09:16:29 -08:00
proxy-messages.ts [js/web/training] Add CreateTrainingSession (#17891) 2023-10-26 09:22:10 -07:00
proxy-wrapper.ts [js/web/training] Add CreateTrainingSession (#17891) 2023-10-26 09:22:10 -07:00
run-options.ts [js/web] enable ONNX Runtime Web error messages in JS (#16335) 2023-06-15 09:45:41 -07:00
session-handler-inference.ts [js/web/training] runTrainStep implementation (#18006) 2023-11-02 08:32:50 -07:00
session-handler-training.ts [js/web/training] lazyResetGrad implementation (#18711) 2023-12-11 17:36:54 -08:00
session-options.ts [WebNN EP] Support numThreads option for WebNN CPU device (#18054) 2023-11-12 16:45:10 -08:00
wasm-common.ts [js/webgpu] support IO binding (#17480) 2023-09-29 11:24:42 -07:00
wasm-core-impl.ts [js/web/training] runTrainStep implementation (#18006) 2023-11-02 08:32:50 -07:00
wasm-factory.ts Add "glue" between training WASM artifacts and training web (#17474) 2023-10-12 11:16:56 -07:00
wasm-training-core-impl.ts [js/web/training] lazyResetGrad implementation (#18711) 2023-12-11 17:36:54 -08:00
wasm-utils.ts [js/web] enable ONNX Runtime Web error messages in JS (#16335) 2023-06-15 09:45:41 -07:00