onnxruntime/js/web/lib
Xu Xing 8c59cd4fce
[js/webgpu] Support GroupQueryAttention (#20237)
TODOs:
1. Handle H * params.kvNumHeads greater than work group size limit.
2. Support BNSH kv cache.
2024-05-13 09:43:37 -07:00
..
onnxjs [js] Make error friendly when isOrtFormat is undefined (#19958) 2024-03-27 02:07:00 -07:00
wasm [js/webgpu] Support GroupQueryAttention (#20237) 2024-05-13 09:43:37 -07:00
backend-onnxjs.ts
backend-wasm-inference.ts
backend-wasm-training.ts
backend-wasm.ts
build-def.d.ts
index.ts
version.ts Bump up version in main from 1.18.0 to 1.19.0 (#20489) 2024-04-29 20:21:41 -07:00