onnxruntime/onnxruntime/python
Xavier Dupré 7efb0dbe12
add option DefaultTensorType to specify the default tensor type to quantize (#19455)
### Description
The current quantization tool relies on shape inference to provide the
type of every intermediate tensor, then the tool knows which type it
must dequantize into (float32, float16). However, this information is
not available if shape inference fails. That happens every time the
model include an operator from a custom domain such as com.microsoft.

This PR introduces an extra option `DefaultTensorType` as a fall back
when the quantizer cannot find the type it needs.

### Motivation and Context
This fixes issue #19409.
2024-02-20 08:22:44 -08:00
..
backend Bump linter versions (#18341) 2023-11-08 13:04:40 -08:00
datasets
providers/tvm
tools add option DefaultTensorType to specify the default tensor type to quantize (#19455) 2024-02-20 08:22:44 -08:00
torch_cpp_extensions [ORTModule] ATen Efficient Attention and Triton Flash Attention (#17959) 2023-10-27 10:29:27 +08:00
training
__init__.py
_ld_preload.py
_pybind_state.py.in Fix invalid escape sequence (#17145) 2023-08-15 10:29:54 -07:00
exported_symbols.lst
numpy_helper.h
onnxruntime_collect_build_info.py
onnxruntime_inference_collection.py Bump ruff linter to 0.2.1 (#19471) 2024-02-08 16:08:27 -08:00
onnxruntime_pybind.h
onnxruntime_pybind_exceptions.cc
onnxruntime_pybind_exceptions.h
onnxruntime_pybind_iobinding.cc Remove internal enforce for IO binding inputs (#18266) 2023-11-03 16:12:32 -07:00
onnxruntime_pybind_mlvalue.cc io_binding to handle optional input of sequence type_proto (#19273) 2024-01-30 10:25:14 -08:00
onnxruntime_pybind_mlvalue.h [DML EP] Complete python IO binding implementation (#17344) 2023-09-13 07:26:35 -07:00
onnxruntime_pybind_module.cc Fix warning C4003 in ORT python binding code (#18612) 2023-11-30 08:07:47 -08:00
onnxruntime_pybind_ortvalue.cc [DML EP] Complete python IO binding implementation (#17344) 2023-09-13 07:26:35 -07:00
onnxruntime_pybind_quant.cc Block-wise 4b quantization matmul operator change (#18172) 2023-11-03 15:29:43 -07:00
onnxruntime_pybind_schema.cc [DML EP] Add dynamic graph compilation (#17876) 2023-10-25 19:56:16 -07:00
onnxruntime_pybind_sparse_tensor.cc
onnxruntime_pybind_state.cc [VitisAI] Refactor the VAIEP to use MSFT's standalone API (#19058) 2024-01-31 21:08:26 -08:00
onnxruntime_pybind_state.h
onnxruntime_pybind_state_common.cc
onnxruntime_pybind_state_common.h [VitisAI] 1. api compatbile 2. dynamic load onnx (#18470) 2023-12-14 14:43:41 -08:00
onnxruntime_validation.py
pybind.def
version_script.lds
version_script_expose_onnx_protobuf.lds