onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-07-18 18:52:16 +00:00

History

Xavier Dupré e726151b5c Introduce float 8 types (#14731 ) ### Description The PR implements FloatE4M3FN, FloatE5M2, FloatE4MEFNUZ, FloatE5M2FNUZ as described in PR https://github.com/onnx/onnx/pull/4805. It uses CUDA API to cast float/half to float8 if CUDA>=11.8, a custom implementation if CUDA<11.8. * It implements, Cast, QuantizeLinear, DequantizeLinear for all types on CPU, only for types FloatE4M3FN, FloatE5M2 on CUDA. * It extends the supported types for control flow operator, Shape, Reshape, Identity, If, Loop, Scan, Reshape * It implements Equal(19). * Cast, QuantizeLinear, DequantizeLinear operators now support a parameter `saturate` only valid for float 8 types. It is true by default. In that case, any value out of range is converted into the maximum float 8 value. If false, it is infinite. * QuantizeLinear, DequantizeLinear now supports multiple scales on CUDA (and ROCm by extension), scale = 1D tensor with one scale per channel ### Motivation and Context Supports latest onnx version. Fixes [AB#15395](https://aiinfra.visualstudio.com/6a833879-cd9b-44a4-a9de-adc2d818f13c/_workitems/edit/15395) --------- Co-authored-by: Xavier Dupre <xadupre@microsoft.com@orttrainingdev8.d32nl1ml4oruzj4qz3bqlggovf.px.internal.cloudapp.net> Co-authored-by: Randy Shuai <rashuai@microsoft.com> Co-authored-by: Edward Chen <18449977+edgchen1@users.noreply.github.com> Co-authored-by: Scott McKay <Scott.McKay@microsoft.com>		2023-05-30 13:25:58 -07:00
..
external	eigen.cmake use url info from deps.txt (#16129 )	2023-05-30 11:07:20 -07:00
patches	FlatBuffers fails to compile with gcc13. (#15787 )	2023-05-11 11:20:19 -07:00
tensorboard
adjust_global_compile_flags.cmake	Cleanup WASM cmake code (#15996 )	2023-05-20 18:07:39 -07:00
CMakeLists.txt	Introduce float 8 types (#14731 )	2023-05-30 13:25:58 -07:00
CMakeSettings.json
codeconv.runsettings
deps.txt	Implement openAI endpoint invoker for nuget (#15797 )	2023-05-11 22:04:02 -07:00
EnableVisualStudioCodeAnalysis.props
gdk_toolchain.cmake
Info.plist.in
libonnxruntime.pc.cmake.in
nuget_helpers.cmake
onnxruntime.cmake	Cleanup WASM cmake code (#15996 )	2023-05-20 18:07:39 -07:00
onnxruntime_codegen_tvm.cmake
onnxruntime_common.cmake	Cleanup WASM cmake code (#15996 )	2023-05-20 18:07:39 -07:00
onnxruntime_compile_triton_kernel.cmake	integrate triton into ort (#15862 )	2023-05-17 09:35:28 +08:00
onnxruntime_config.h.in	Adust GetVersionString() GetBuildInfoString() signatures and move them to OrtApi (#15921 )	2023-05-13 13:45:07 -07:00
onnxruntime_csharp.cmake
onnxruntime_flatbuffers.cmake	Rework some external targets to ease building with `-DFETCHCONTENT_FULLY_DISCONNECTED=ON` (#15323 )	2023-04-03 17:45:12 -07:00
onnxruntime_framework.cmake	Fix python pipeline for AzureEP without using root (#16023 )	2023-05-22 16:38:47 -07:00
onnxruntime_fuzz_test.cmake
onnxruntime_graph.cmake	Create dedicated build for training api (#14136 )	2023-01-10 20:58:04 -08:00
onnxruntime_ios.toolchain.cmake
onnxruntime_java.cmake	Update build option for training in java to enable_training_api (#15638 )	2023-04-24 11:53:08 -07:00
onnxruntime_java_unittests.cmake	Update build option for training in java to enable_training_api (#15638 )	2023-04-24 11:53:08 -07:00
onnxruntime_kernel_explorer.cmake	[ROCm] add hipblaslt into GemmFastGelu TunableOp (#15945 )	2023-05-23 11:07:09 +08:00
onnxruntime_language_interop_ops.cmake
onnxruntime_mlas.cmake	Cleanup WASM cmake code (#15996 )	2023-05-20 18:07:39 -07:00
onnxruntime_nodejs.cmake
onnxruntime_objectivec.cmake	Add iOS Swift Package Manager support (#15297 )	2023-04-20 16:18:35 +10:00
onnxruntime_opschema_lib.cmake
onnxruntime_optimizer.cmake	Optimize SCE loss compute (#15401 )	2023-04-13 13:02:12 +08:00
onnxruntime_providers.cmake	Cleanup WASM cmake code (#15996 )	2023-05-20 18:07:39 -07:00
onnxruntime_pyop.cmake
onnxruntime_python.cmake	[DML EP] Fix issue with --dml_path build option (#15972 )	2023-05-24 19:20:40 -05:00
onnxruntime_rocm_hipify.cmake	[ROCm] add beam search support (#15625 )	2023-04-26 17:53:33 +08:00
onnxruntime_session.cmake
onnxruntime_snpe_provider.cmake
onnxruntime_training.cmake
onnxruntime_unittests.cmake	Avoid trt deprecated api warnings shown as errors during ORT-TRT build (#16035 )	2023-05-24 13:19:27 -07:00
onnxruntime_util.cmake
onnxruntime_webassembly.cmake	Support WebNN EP (#15698 )	2023-05-08 21:25:10 -07:00
precompiled_header.cmake
Sdl.ruleset	Add a Github workflow for Prefast (#15763 )	2023-05-03 11:42:51 -07:00
set_winapi_family_desktop.h
target_delayload.cmake
uwp_stubs.h	Run clang-format in CI (#15524 )	2023-04-18 09:26:58 -07:00
wcos_rules_override.cmake
winml.cmake	Use target name for flatbuffers (#13991 )	2022-12-20 11:44:02 -08:00
winml_cppwinrt.cmake
winml_sdk_helpers.cmake
winml_unittests.cmake