onnxruntime/tools/ci_build/github
Chen Fu 1c84621020
Adding ARM64 depthwise convolution kernel for symmetric quantization (#9655)
Adding ARM64 depthwise convolution kernel for symmetric quantization

Motivation and Context
Two improvements against current kernel code :

1. Signed int8 based instructions, no need to extend from 8b to 16b before multiplication.
2. Unrolled loop with manual software pipelining

Co-authored-by: Chen Fu <fuchen@microsoft.com>
2021-11-15 12:18:43 -08:00
..
android Disable partial runtime optimization implementation by default (#9748) 2021-11-12 17:37:29 -08:00
apple Disable partial runtime optimization implementation by default (#9748) 2021-11-12 17:37:29 -08:00
azure-pipelines Update training pipelines to pytorch 1.10 (#9709) 2021-11-15 11:21:55 -08:00
js update ONNX Runtime Web CI to use same script for package versioning (#9698) 2021-11-10 12:52:34 -08:00
linux Adding ARM64 depthwise convolution kernel for symmetric quantization (#9655) 2021-11-15 12:18:43 -08:00
pai work around ucx in rocm ci Dockerfile (#9360) 2021-10-14 09:49:31 -07:00
python_checks Add Python checks pipeline (#7032) 2021-08-09 10:37:05 -07:00
windows Abjindal/eager windows ci pipeline (#9587) 2021-11-05 09:09:09 -07:00
Doxyfile_csharp.cfg Add Xamarin support (#9436) 2021-10-27 20:07:07 +10:00