onnxruntime/onnxruntime
Xavier Dupré 7efb0dbe12
add option DefaultTensorType to specify the default tensor type to quantize (#19455)
### Description
The current quantization tool relies on shape inference to provide the
type of every intermediate tensor, then the tool knows which type it
must dequantize into (float32, float16). However, this information is
not available if shape inference fails. That happens every time the
model include an operator from a custom domain such as com.microsoft.

This PR introduces an extra option `DefaultTensorType` as a fall back
when the quantizer cannot find the type it needs.

### Motivation and Context
This fixes issue #19409.
2024-02-20 08:22:44 -08:00
..
contrib_ops [JS/WebGPU] Add MatMulNBits (#19446) 2024-02-17 09:19:17 -08:00
core Whisper Timestamps and Temperature (#19509) 2024-02-16 15:21:43 -08:00
python add option DefaultTensorType to specify the default tensor type to quantize (#19455) 2024-02-20 08:22:44 -08:00
test add option DefaultTensorType to specify the default tensor type to quantize (#19455) 2024-02-20 08:22:44 -08:00
tool/etw
wasm [js/webgpu] Support capture and replay for jsep (#18989) 2024-01-30 18:28:03 -08:00
__init__.py [ORT 1.17.0 release] Bump up version to 1.18.0 (#19170) 2024-01-17 11:18:32 -08:00
ReformatSource.ps1
ReformatSourcePython.bat
VSCodeCoverage.runsettings