onnxruntime/js/web/docs
Xu Xing 8c59cd4fce
[js/webgpu] Support GroupQueryAttention (#20237)
TODOs:
1. Handle H * params.kvNumHeads greater than work group size limit.
2. Support BNSH kv cache.
2024-05-13 09:43:37 -07:00
..
webgl-operators.md Integration with ONNX 1.16.0 (#19745) 2024-04-12 09:46:49 -07:00
webgpu-operators.md [js/webgpu] Support GroupQueryAttention (#20237) 2024-05-13 09:43:37 -07:00
webnn-operators.md [WebNN EP] Add operators support table (#20253) 2024-04-17 21:19:46 -07:00