onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-07-16 18:31:27 +00:00

History

Jiajia Qin 7e0dd9d433 [js/webgpu] Optimize Expand (#22752 ) Use components = 4 if possible. llama3.2-1B becomes 20 tokens/s from 18 tokens/s on my iGPUs.		2024-11-12 12:37:19 -08:00
..
ops	[js/webgpu] Optimize Expand (#22752 )	2024-11-12 12:37:19 -08:00