saymrwulf/onnxruntime: ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-07-03 03:58:54 +00:00

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

Find a file

Adrian Tsai 3d37a3c1d3 Merged PR 5807585: Remove support for strided 64-bit emulation in DML's Cast kernel A model from one of our partners regressed with a failure to evaluate due to the addition of strided 64-bit emulation in the DML EP for the Cast operator. Specifically, the model uses a Cast from int32 to int64 to produce the input shape to a Reshape node. When supplied with a shape dimension of -1 (int32 0xffffffff), the strided emulation in Cast ends up producing an int64 result of 0x00000000ffffffff. This is then fed into the Reshape operator, where it produces an incorrect tensor shape and a failure during evaluation. Generally speaking we assume that using strided 64-bit emulation is safe if a node's inputs came from the DML EP itself. This isn't true in the general case for Cast, however - casting negative signed values can and will produce incorrect outputs with strided emulation. After this change, Cast nodes with 64-bit types will fall back to CPU unless running on a GPU that native supports 64-bit datatypes. Related work items: #31768166		2021-03-18 00:42:32 +00:00
.github	Don't mark issues that are marked as enhancement as stale (#6134 )	2020-12-14 18:57:40 -08:00
cgmanifests	Upgrade TensorRT to v7.2.2 (#6452 )	2021-02-18 04:30:47 -08:00
cmake	Capitalize DLL name	2021-03-17 11:01:14 -07:00
csharp	Update packaging pipelines(#6664 )	2021-02-17 09:53:36 -08:00
dockerfiles	Setup perf in docker and add features (#6582 )	2021-02-25 09:31:03 -08:00
docs	Update docs/ONNX_Runtime_for_Mobile_Platforms.md with info about op type reduction. (#6747 )	2021-02-23 10:25:23 -08:00
include/onnxruntime/core	[CoreML EP] Add options to enable CoreML EP only on hardware with Apple Neural Engine (#6765 )	2021-02-22 18:55:27 -08:00
java	[Java] Adds extra providers (#6770 )	2021-02-24 10:25:05 -08:00
nodejs	Removed BUILD.md from master as source now lives in gh-pages (#6709 )	2021-02-19 11:34:21 -08:00
onnxruntime	Merged PR 5807585: Remove support for strided 64-bit emulation in DML's Cast kernel	2021-03-18 00:42:32 +00:00
orttraining	Merge remote-tracking branch 'upstream/master' into dmldev_temp	2021-02-25 12:02:34 -08:00
package/rpm	Bumping up version to 1.7 (#6736 )	2021-02-17 19:07:38 -08:00
samples	Removed BUILD.md from master as source now lives in gh-pages (#6709 )	2021-02-19 11:34:21 -08:00
server	Remove nGraph Execution Provider (#5858 )	2020-11-19 16:47:55 -08:00
tools	Setup perf in docker and add features (#6582 )	2021-02-25 09:31:03 -08:00
winml	Minor WinML model test skip name change	2021-02-17 14:27:58 -08:00
.clang-format	Initial bootstrap commit.	2018-11-19 16:48:22 -08:00
.clang-tidy	Add remaining build options and make minor changes in documentation (#39 )	2018-11-27 19:59:40 -08:00
.dockerignore	Update dockerfiles (#5929 )	2020-11-25 15:38:22 -08:00
.flake8	Add ability to track per operator types in reduced build config. (#6428 )	2021-01-29 07:59:51 +10:00
.gitattributes	Initial bootstrap commit.	2018-11-19 16:48:22 -08:00
.gitignore	Add robust dependency check for Python package (#6436 )	2021-02-21 15:11:28 -08:00
.gitmodules	Upgrade TensorRT to v7.2.2 (#6452 )	2021-02-18 04:30:47 -08:00
build.amd64.1411.bat	Initial bootstrap commit.	2018-11-19 16:48:22 -08:00
build.bat	Initial bootstrap commit.	2018-11-19 16:48:22 -08:00
build.sh	Add iOS test pipeline and a sample app. (#5298 )	2020-09-29 13:53:11 -07:00
CODEOWNERS	Update code owners for pytorch frontend team (#6329 )	2021-02-02 11:09:10 -08:00
CONTRIBUTING.md	Removed BUILD.md from master as source now lives in gh-pages (#6709 )	2021-02-19 11:34:21 -08:00
LICENSE	Remove year from license (#6658 )	2021-02-12 00:25:56 -08:00
NuGet.config	Delete nuget extra configs (#6477 )	2021-01-27 20:25:45 -08:00
ort.wprp	Add Tracelogging for profiling (#1639 )	2019-11-11 21:34:10 -08:00
packages.config	Update DirectML 1.4.1 to 1.4.2 for ORT 1.7 (#6780 )	2021-02-23 10:52:10 -08:00
README.md	Add direct link to build instructions on readme (#6729 )	2021-02-19 10:56:50 -08:00
requirements-dev.txt	Add ability to track per operator types in reduced build config. (#6428 )	2021-01-29 07:59:51 +10:00
requirements-doc.txt	Update readme.rst for pypi, change documentation style (#1663 )	2019-10-19 18:26:34 -07:00
requirements.txt	Remove cerberus from wheel package (#4919 )	2020-08-26 09:00:03 -07:00
setup.py	Add Python 3.9 to pypi metadata	2021-02-12 20:00:17 -08:00
ThirdPartyNotices.txt	Merge CPU packaging pipelines (#6480 )	2021-02-04 08:38:56 -08:00
VERSION_NUMBER	Bumping up version to 1.7 (#6736 )	2021-02-17 19:07:38 -08:00

README.md

ONNX Runtime is a cross-platform inference and training machine-learning accelerator compatible with deep learning frameworks, PyTorch and TensorFlow/Keras, as well as classical machine learning libraries such as scikit-learn, and more.

ONNX Runtime uses the portable ONNX computation graph format, backed by execution providers optimized for operating systems, drivers and hardware.

Common use cases for ONNX Runtime:

Improve inference performance for a wide variety of ML models
Reduce time and cost of training large models
Train in Python but deploy into a C#/C++/Java app
Run with optimized performance on different hardware and operating systems
Support models created in several different frameworks

ONNX Runtime inference APIs are stable and production-ready since the 1.0 release in October 2019 and can enable faster customer experiences and lower costs.

ONNX Runtime training feature was introduced in May 2020 in preview. This feature supports acceleration of PyTorch training on multi-node NVIDIA GPUs for transformer models. Additional updates for this feature are coming soon.