mirror of
https://github.com/saymrwulf/onnxruntime.git
synced 2026-05-24 22:17:32 +00:00
TODOs: 1. Handle H * params.kvNumHeads greater than work group size limit. 2. Support BNSH kv cache. |
||
|---|---|---|
| .. | ||
| webgpu | ||
| backend-webgpu.ts | ||
| init.ts | ||
| log.ts | ||
| tensor-view.ts | ||
| util.ts | ||