onnxruntime/tools
Chen Fu 1c84621020
Adding ARM64 depthwise convolution kernel for symmetric quantization (#9655)
Adding ARM64 depthwise convolution kernel for symmetric quantization

Motivation and Context
Two improvements against current kernel code :

1. Signed int8 based instructions, no need to extend from 8b to 16b before multiplication.
2. Unrolled loop with manual software pipelining

Co-authored-by: Chen Fu <fuchen@microsoft.com>
2021-11-15 12:18:43 -08:00
..
ci_build Adding ARM64 depthwise convolution kernel for symmetric quantization (#9655) 2021-11-15 12:18:43 -08:00
doc Add graphviz into Dockerfile images for Python API documentation (#7819) 2021-06-02 16:12:54 -07:00
nuget Enable building winml with --build_nuget (#9632) 2021-11-04 00:42:51 -07:00
perf_util Update mysql-connector-java (#5802) 2020-11-16 14:09:14 -08:00
python Remove experimental from ORT format namespace (#9729) 2021-11-11 19:46:30 -08:00