onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-05-29 23:06:41 +00:00

History

Xavier Dupré 7efb0dbe12 add option DefaultTensorType to specify the default tensor type to quantize (#19455 ) ### Description The current quantization tool relies on shape inference to provide the type of every intermediate tensor, then the tool knows which type it must dequantize into (float32, float16). However, this information is not available if shape inference fails. That happens every time the model include an operator from a custom domain such as com.microsoft. This PR introduces an extra option `DefaultTensorType` as a fall back when the quantizer cannot find the type it needs. ### Motivation and Context This fixes issue #19409.		2024-02-20 08:22:44 -08:00
..
backend	Bump linter versions (#18341 )	2023-11-08 13:04:40 -08:00
datasets
providers/tvm
tools	add option DefaultTensorType to specify the default tensor type to quantize (#19455 )	2024-02-20 08:22:44 -08:00
torch_cpp_extensions	[ORTModule] ATen Efficient Attention and Triton Flash Attention (#17959 )	2023-10-27 10:29:27 +08:00
training
__init__.py
_ld_preload.py
_pybind_state.py.in	Fix invalid escape sequence (#17145 )	2023-08-15 10:29:54 -07:00
exported_symbols.lst
numpy_helper.h
onnxruntime_collect_build_info.py
onnxruntime_inference_collection.py	Bump ruff linter to 0.2.1 (#19471 )	2024-02-08 16:08:27 -08:00
onnxruntime_pybind.h
onnxruntime_pybind_exceptions.cc
onnxruntime_pybind_exceptions.h
onnxruntime_pybind_iobinding.cc	Remove internal enforce for IO binding inputs (#18266 )	2023-11-03 16:12:32 -07:00
onnxruntime_pybind_mlvalue.cc	io_binding to handle optional input of sequence type_proto (#19273 )	2024-01-30 10:25:14 -08:00
onnxruntime_pybind_mlvalue.h	[DML EP] Complete python IO binding implementation (#17344 )	2023-09-13 07:26:35 -07:00
onnxruntime_pybind_module.cc	Fix warning C4003 in ORT python binding code (#18612 )	2023-11-30 08:07:47 -08:00
onnxruntime_pybind_ortvalue.cc	[DML EP] Complete python IO binding implementation (#17344 )	2023-09-13 07:26:35 -07:00
onnxruntime_pybind_quant.cc	Block-wise 4b quantization matmul operator change (#18172 )	2023-11-03 15:29:43 -07:00
onnxruntime_pybind_schema.cc	[DML EP] Add dynamic graph compilation (#17876 )	2023-10-25 19:56:16 -07:00
onnxruntime_pybind_sparse_tensor.cc
onnxruntime_pybind_state.cc	[VitisAI] Refactor the VAIEP to use MSFT's standalone API (#19058 )	2024-01-31 21:08:26 -08:00
onnxruntime_pybind_state.h
onnxruntime_pybind_state_common.cc
onnxruntime_pybind_state_common.h	[VitisAI] 1. api compatbile 2. dynamic load onnx (#18470 )	2023-12-14 14:43:41 -08:00
onnxruntime_validation.py
pybind.def
version_script.lds
version_script_expose_onnx_protobuf.lds