onnxruntime/js/web/test/data
Xu Xing 8c59cd4fce
[js/webgpu] Support GroupQueryAttention (#20237)
TODOs:
1. Handle H * params.kvNumHeads greater than work group size limit.
2. Support BNSH kv cache.
2024-05-13 09:43:37 -07:00
..
ops [js/webgpu] Support GroupQueryAttention (#20237) 2024-05-13 09:43:37 -07:00