onnxruntime/onnxruntime/python/tools/quantization
2021-03-02 13:49:42 -08:00
..
CalTableFlatBuffers update quantize to support basic optimization and e2e example for image classification (#6313) 2021-01-14 09:27:10 -08:00
E2E_example_model fix issues caused by quantize/calibrate changes (#6802) 2021-02-25 05:41:21 -08:00
notebooks Update ORT to 1.5.1 in Bert Quantization Notebook (#5396) 2020-10-08 09:55:01 -07:00
operators Improvements to quantizer: Removed unused qType field, add reshape op (#6179) 2021-02-26 10:21:37 -08:00
__init__.py Entropy method for calibration-based quantization (#6619) 2021-02-18 05:50:59 -08:00
calibrate.py Make keepdims to its default value when adding ReduceMin/ReduceMax for quantization calibration (#6788) 2021-02-25 09:47:59 -08:00
onnx_model.py Avoid removing constant weight that is graph output (#6735) 2021-02-17 19:55:19 -08:00
onnx_quantizer.py Improvements to quantizer: Removed unused qType field, add reshape op (#6179) 2021-02-26 10:21:37 -08:00
qdq_quantizer.py Improvements to quantizer: Removed unused qType field, add reshape op (#6179) 2021-02-26 10:21:37 -08:00
quant_utils.py Improvements to quantizer: Removed unused qType field, add reshape op (#6179) 2021-02-26 10:21:37 -08:00
quantize.py Entropy method for calibration-based quantization (#6619) 2021-02-18 05:50:59 -08:00
README.md Reroute quantization tool readme to /docs page (#6854) 2021-03-02 13:49:42 -08:00
registry.py Improvements to quantizer: Removed unused qType field, add reshape op (#6179) 2021-02-26 10:21:37 -08:00

Quantization Tool

This tool can be used to quantize select ONNX models. Support is based on operators in the model. Please refer to https://www.onnxruntime.ai/docs/how-to/quantization.html for usage details.