mirror of
https://github.com/saymrwulf/onnxruntime.git
synced 2026-06-03 23:49:44 +00:00
### Description One quantization case was not covered by the current list of unit tests. This PR adds a unit test to cover that case with the fix. It fixes the issue #17619. ### Motivation and Context |
||
|---|---|---|
| .. | ||
| CalTableFlatBuffers | ||
| operators | ||
| __init__.py | ||
| calibrate.py | ||
| matmul_weight4_quantizer.py | ||
| onnx_model.py | ||
| onnx_quantizer.py | ||
| preprocess.py | ||
| qdq_loss_debug.py | ||
| qdq_quantizer.py | ||
| quant_utils.py | ||
| quantize.py | ||
| README.md | ||
| registry.py | ||
| shape_inference.py | ||
Quantization Tool
This tool can be used to quantize select ONNX models. Support is based on operators in the model. Please refer to https://onnxruntime.ai/docs/performance/quantization.html for usage details and https://github.com/microsoft/onnxruntime-inference-examples/tree/main/quantization for examples.