onnxruntime/js/web/lib/wasm
Xu Xing 8c59cd4fce
[js/webgpu] Support GroupQueryAttention (#20237)
TODOs:
1. Handle H * params.kvNumHeads greater than work group size limit.
2. Support BNSH kv cache.
2024-05-13 09:43:37 -07:00
..
binding [js/web] rewrite backend resolve to allow multiple EPs (#19735) 2024-03-15 11:47:45 -07:00
jsep [js/webgpu] Support GroupQueryAttention (#20237) 2024-05-13 09:43:37 -07:00
proxy-worker [js/web] transfer input buffer back to caller thread (#19677) 2024-03-01 14:50:06 -08:00
proxy-messages.ts [js/web] revise backend registration (#18715) 2023-12-20 14:45:55 -08:00
proxy-wrapper.ts [js/web] rewrite backend resolve to allow multiple EPs (#19735) 2024-03-15 11:47:45 -07:00
run-options.ts [js/web] enable ONNX Runtime Web error messages in JS (#16335) 2023-06-15 09:45:41 -07:00
session-handler-inference.ts [js] enable external data loading for ort-web (#19087) 2024-01-12 19:24:24 -08:00
session-handler-training.ts [js/web] revise backend registration (#18715) 2023-12-20 14:45:55 -08:00
session-options.ts [js/webgpu] Support capture and replay for jsep (#18989) 2024-01-30 18:28:03 -08:00
wasm-common.ts [js] changes to allow Float16Array if any polyfill is available (#19305) 2024-02-21 00:31:06 -08:00
wasm-core-impl.ts [js/webgpu] allow setting env.webgpu.adapter (#19940) 2024-03-19 12:55:00 -07:00
wasm-factory.ts [js/web] show warning when numThreads is set but threads is not supported (#19179) 2024-01-17 15:04:22 -08:00
wasm-training-core-impl.ts [js/web] revise backend registration (#18715) 2023-12-20 14:45:55 -08:00
wasm-utils-load-file.ts [js/web] allow proxy to load model with 1GB <= size < 2GB (#19178) 2024-01-17 15:03:43 -08:00
wasm-utils.ts [js/web] enable ONNX Runtime Web error messages in JS (#16335) 2023-06-15 09:45:41 -07:00