onnxruntime/onnxruntime/python/tools/quantization
Yufeng Li 8e54b76e2d
QDQ implementation (#7033)
* Add QDQ basic implementation
2021-03-25 09:17:23 -07:00
..
CalTableFlatBuffers update quantize to support basic optimization and e2e example for image classification (#6313) 2021-01-14 09:27:10 -08:00
E2E_example_model Quantization calibration refactor (#6893) 2021-03-19 01:09:11 -07:00
notebooks Update ORT to 1.5.1 in Bert Quantization Notebook (#5396) 2020-10-08 09:55:01 -07:00
operators fix a bug in global average pool and add unit test (#6913) 2021-03-22 20:01:27 -07:00
__init__.py Entropy method for calibration-based quantization (#6619) 2021-02-18 05:50:59 -08:00
calibrate.py Quantization calibration refactor (#6893) 2021-03-19 01:09:11 -07:00
onnx_model.py Fix a bug in quant GEMM and add an unit test (#7111) 2021-03-23 16:39:35 -07:00
onnx_quantizer.py QDQ implementation (#7033) 2021-03-25 09:17:23 -07:00
qdq_quantizer.py Improvements to quantizer: Removed unused qType field, add reshape op (#6179) 2021-02-26 10:21:37 -08:00
quant_utils.py Quantization calibration refactor (#6893) 2021-03-19 01:09:11 -07:00
quantize.py Entropy method for calibration-based quantization (#6619) 2021-02-18 05:50:59 -08:00
README.md Reroute quantization tool readme to /docs page (#6854) 2021-03-02 13:49:42 -08:00
registry.py Improvements to quantizer: Removed unused qType field, add reshape op (#6179) 2021-02-26 10:21:37 -08:00

Quantization Tool

This tool can be used to quantize select ONNX models. Support is based on operators in the model. Please refer to https://www.onnxruntime.ai/docs/how-to/quantization.html for usage details.