onnxruntime/onnxruntime
Xavier Dupré d0316ee768
Updating QDQ to support Float8E4M3FN (#16550)
### Description
Naive update quantization tools to support Float8E4M3FN for Gemm.
2023-08-08 12:18:48 +02:00
..
contrib_ops 4b quantization for weights of LLMs (#16833) 2023-08-07 12:23:55 -07:00
core Updating QDQ to support Float8E4M3FN (#16550) 2023-08-08 12:18:48 +02:00
python Updating QDQ to support Float8E4M3FN (#16550) 2023-08-08 12:18:48 +02:00
test Updating QDQ to support Float8E4M3FN (#16550) 2023-08-08 12:18:48 +02:00
tool/etw
wasm [js/web] enable ONNX Runtime Web error messages in JS (#16335) 2023-06-15 09:45:41 -07:00
__init__.py ExecutionProvider API refactor - move allocator from EP level to SessionState level and indexed by OrtDevice (#15833) 2023-06-19 17:44:45 -07:00
ReformatSource.ps1
ReformatSourcePython.bat
VSCodeCoverage.runsettings