onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-07-21 19:18:55 +00:00

History

Xavier Dupré e726151b5c Introduce float 8 types (#14731 ) ### Description The PR implements FloatE4M3FN, FloatE5M2, FloatE4MEFNUZ, FloatE5M2FNUZ as described in PR https://github.com/onnx/onnx/pull/4805. It uses CUDA API to cast float/half to float8 if CUDA>=11.8, a custom implementation if CUDA<11.8. * It implements, Cast, QuantizeLinear, DequantizeLinear for all types on CPU, only for types FloatE4M3FN, FloatE5M2 on CUDA. * It extends the supported types for control flow operator, Shape, Reshape, Identity, If, Loop, Scan, Reshape * It implements Equal(19). * Cast, QuantizeLinear, DequantizeLinear operators now support a parameter `saturate` only valid for float 8 types. It is true by default. In that case, any value out of range is converted into the maximum float 8 value. If false, it is infinite. * QuantizeLinear, DequantizeLinear now supports multiple scales on CUDA (and ROCm by extension), scale = 1D tensor with one scale per channel ### Motivation and Context Supports latest onnx version. Fixes [AB#15395](https://aiinfra.visualstudio.com/6a833879-cd9b-44a4-a9de-adc2d818f13c/_workitems/edit/15395) --------- Co-authored-by: Xavier Dupre <xadupre@microsoft.com@orttrainingdev8.d32nl1ml4oruzj4qz3bqlggovf.px.internal.cloudapp.net> Co-authored-by: Randy Shuai <rashuai@microsoft.com> Co-authored-by: Edward Chen <18449977+edgchen1@users.noreply.github.com> Co-authored-by: Scott McKay <Scott.McKay@microsoft.com>		2023-05-30 13:25:58 -07:00
..
nodejs/templates	Add explicit 'set +x' before printing a vso[] command to avoid output getting parsed again with a trailing quote. (#15986 )	2023-05-17 19:30:28 -07:00
nuget/templates	More fixes on nuget pipeline (#16091 )	2023-05-27 08:59:12 +08:00
templates	More fixes on nuget pipeline (#16091 )	2023-05-27 08:59:12 +08:00
android-arm64-v8a-QNN-crosscompile-ci-pipeline.yml	[QNN EP] Update default QNN SDK to version 2.10.0 (#15818 )	2023-05-05 13:01:21 -07:00
android-x86_64-crosscompile-ci-pipeline.yml	Refactor web-ci pipeline and delete eager mode CI pipeline (#15416 )	2023-04-10 10:41:04 -07:00
binary-size-checks-pipeline.yml	Update binary size checks pipeline to use stages for separate checks. (#15408 )	2023-04-07 09:55:40 -07:00
build-perf-test-binaries-pipeline.yml	Disable XNNPack EP's tests in Windows CI pipeline (#15406 )	2023-04-13 12:19:32 -07:00
c-api-noopenmp-packaging-pipelines.yml	More fixes on nuget pipeline (#16091 )	2023-05-27 08:59:12 +08:00
clean-build-docker-image-cache-pipeline.yml
linux-ci-pipeline.yml	Various fixes to the CSharp setup (#15782 )	2023-05-05 14:27:30 +10:00
linux-cpu-aten-pipeline.yml	Rename onnxruntime-Linux-CPU-2019 machine pool (#15691 )	2023-04-27 12:46:18 -07:00
linux-cpu-eager-pipeline.yml	Rename onnxruntime-Linux-CPU-2019 machine pool (#15691 )	2023-04-27 12:46:18 -07:00
linux-cpu-minimal-build-ci-pipeline.yml	Introduce float 8 types (#14731 )	2023-05-30 13:25:58 -07:00
linux-dnnl-ci-pipeline.yml	Update python 3.11 and remove 3.7 for Linux (#15214 )	2023-03-27 14:46:30 -07:00
linux-gpu-ci-pipeline.yml	[Fix] No test result report while not using ctest (#15976 )	2023-05-17 08:31:16 -07:00
linux-gpu-tensorrt-ci-pipeline.yml	[TensorRT EP] avoid excessive library load/unload overhead when running unit tests. (#15639 )	2023-04-24 14:43:13 -07:00
linux-gpu-tensorrt-daily-perf-pipeline.yml	Fix EP Perf pipeline (#15507 )	2023-04-27 10:09:14 -07:00
linux-migraphx-ci-pipeline.yml	[ROCm] update ROCm/MIGraphX CI to ROCm5.5 (#15905 )	2023-05-15 10:28:15 +08:00
linux-multi-gpu-ci-pipeline.yml	Update Cuda to 11.8 in 2 Linux GPU workflows. (#15925 )	2023-05-14 12:51:25 +08:00
linux-multi-gpu-tensorrt-ci-pipeline.yml
linux-openvino-ci-pipeline.yml
linux-openvino-nightly-pipeline.yml
linux-qnn-ci-pipeline.yml	[QNN EP] Enable Qnn context cache to save model initialization time (#15815 )	2023-05-19 10:52:17 -07:00
mac-ci-pipeline.yml	Cjian/multi stage packaging pipeline (#14993 )	2023-03-24 23:39:15 -07:00
mac-coreml-ci-pipeline.yml	Refactor all Mac build steps (#15440 )	2023-04-11 12:12:46 +08:00
mac-ios-ci-pipeline.yml	Refactor all Mac build steps (#15440 )	2023-04-11 12:12:46 +08:00
mac-ios-packaging-pipeline.yml	Add explicit 'set +x' before printing a vso[] command to avoid output getting parsed again with a trailing quote. (#15986 )	2023-05-17 19:30:28 -07:00
mac-objc-static-analysis-ci-pipeline.yml	Add iOS Swift Package Manager support (#15297 )	2023-04-20 16:18:35 +10:00
mac-react-native-ci-pipeline.yml	Rename onnxruntime-Linux-CPU-2019 machine pool (#15691 )	2023-04-27 12:46:18 -07:00
npm-packaging-pipeline.yml	Disable XNNPack EP's tests in Windows CI pipeline (#15406 )	2023-04-13 12:19:32 -07:00
orttraining-linux-ci-pipeline.yml	clear cache stat. after building (#15439 )	2023-04-10 13:56:55 +08:00
orttraining-linux-external-custom-ops.yml	Update python 3.11 and remove 3.7 for Linux (#15214 )	2023-03-27 14:46:30 -07:00
orttraining-linux-gpu-amd-e2e-test-ci-pipeline.yml	Add explicit 'set +x' before printing a vso[] command to avoid output getting parsed again with a trailing quote. (#15986 )	2023-05-17 19:30:28 -07:00
orttraining-linux-gpu-ci-pipeline.yml	Add support for cuda 11.8 and python 3.11 for training (#15548 )	2023-04-20 12:56:45 -07:00
orttraining-linux-gpu-distributed-e2e-test-pipeline.yml
orttraining-linux-gpu-docker-release-pipeline.yml
orttraining-linux-gpu-ortmodule-distributed-test-ci-pipeline.yml	Add support for cuda 11.8 and python 3.11 for training (#15548 )	2023-04-20 12:56:45 -07:00
orttraining-linux-gpu-ortmodule-test-clear-cache-pipeline.yml	Rename onnxruntime-Linux-CPU-2019 machine pool (#15691 )	2023-04-27 12:46:18 -07:00
orttraining-linux-gpu-training-apis.yml	Add support for cuda 11.8 and python 3.11 for training (#15548 )	2023-04-20 12:56:45 -07:00
orttraining-linux-nightly-ortmodule-test-pipeline.yml
orttraining-mac-ci-pipeline.yml	Cjian/multi stage packaging pipeline (#14993 )	2023-03-24 23:39:15 -07:00
orttraining-pai-ci-pipeline.yml	Add explicit 'set +x' before printing a vso[] command to avoid output getting parsed again with a trailing quote. (#15986 )	2023-05-17 19:30:28 -07:00
orttraining-py-packaging-pipeline-cpu.yml	Add support for cuda 11.8 and python 3.11 for training (#15548 )	2023-04-20 12:56:45 -07:00
orttraining-py-packaging-pipeline-cuda.yml	Add support for cuda 11.8 and python 3.11 for training (#15548 )	2023-04-20 12:56:45 -07:00
orttraining-py-packaging-pipeline-rocm.yml	[ROCm] remove ROCm5.2.3, ROCm5.3, ROCm5.4 from pipeline (#16004 )	2023-05-19 10:29:01 +08:00
post-merge-jobs.yml	Update Win-CPU-2021 to onnxruntime-Win-CPU-2022 (#15967 )	2023-05-17 08:29:27 -07:00
py-package-build-pipeline.yml	Update python 3.11 and remove 3.7 for Linux (#15214 )	2023-03-27 14:46:30 -07:00
py-package-test-pipeline.yml	Disable XNNPack EP's tests in Windows CI pipeline (#15406 )	2023-04-13 12:19:32 -07:00
py-packaging-pipeline.yml	Update python 3.11 and remove 3.7 for Linux (#15214 )	2023-03-27 14:46:30 -07:00
qnn-ep-nuget-packaging-pipeline.yml	[QNN EP] Update default QNN SDK version to 2.10 for QNN NuGet pipeline (#15899 )	2023-05-15 09:17:42 -07:00
sign_ov_ep_binaries.yml
snpe-ep-nuget-packaging-pipeline.yml	Cjian/windows update python3.11 (#15243 )	2023-03-28 22:15:47 -07:00
web-ci-pipeline.yml	Refactor web-ci pipeline and delete eager mode CI pipeline (#15416 )	2023-04-10 10:41:04 -07:00
web-packaging-pipeline.yml	Update Win-CPU-2021 to onnxruntime-Win-CPU-2022 (#15967 )	2023-05-17 08:29:27 -07:00
win-ci-fuzz-testing.yml	Update Win-CPU-2021 to onnxruntime-Win-CPU-2022 (#15967 )	2023-05-17 08:29:27 -07:00
win-ci-pipeline.yml	Implement mutex-free spin lock for task queue (#14834 )	2023-05-19 10:12:10 -07:00
win-gpu-ci-pipeline.yml	Change CUDA pipelines to download CUDA SDK in every build job (#15915 )	2023-05-17 17:31:51 -07:00
win-gpu-reduce-op-ci-pipeline.yml	Change CUDA pipelines to download CUDA SDK in every build job (#15915 )	2023-05-17 17:31:51 -07:00
win-gpu-tensorrt-ci-pipeline.yml	Unify cuda & trt version on few CIs (#15943 )	2023-05-15 09:54:30 -07:00
win-qnn-arm64-ci-pipeline.yml	[QNN EP] Update default QNN SDK to version 2.10.0 (#15818 )	2023-05-05 13:01:21 -07:00
win-qnn-ci-pipeline.yml	[QNN EP] Update default QNN SDK to version 2.10.0 (#15818 )	2023-05-05 13:01:21 -07:00