mirror of
https://github.com/saymrwulf/onnxruntime.git
synced 2026-05-20 21:40:57 +00:00
### Description This PR provides a vectorized algorithm for NHWC GroupedConv to improve performance. The aggregate time of GroupedConv in mobilenetv2-12 becomes ~1ms from ~4ms on Intel Alder Lake machine. About 20% improvement for the whole model. |
||
|---|---|---|
| .. | ||
| data/ops | ||
| e2e | ||
| unittests | ||
| op-test-schema.json | ||
| suite-test-list.jsonc | ||
| test-main.ts | ||
| test-runner.ts | ||
| test-shared.ts | ||
| test-types.ts | ||