mirror of
https://github.com/saymrwulf/onnxruntime.git
synced 2026-06-25 02:50:42 +00:00
Use components = 4 if possible. llama3.2-1B becomes 20 tokens/s from 18 tokens/s on my iGPUs. |
||
|---|---|---|
| .. | ||
| webgpu | ||
| webnn | ||
| backend-webgpu.ts | ||
| backend-webnn.ts | ||
| init.ts | ||
| log.ts | ||
| tensor-view.ts | ||
| util.ts | ||