saymrwulf/onnxruntime: ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-07-20 19:12:24 +00:00

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

Find a file

Scott McKay 097bab8d1e Cleanup a change to ExecutionFrame a little (#7576 ) * Reduce the binary size growth from this change. Minimal build grew by 7KB from this checkin. Firstly simplify the checking logic a little. Same checks are still done - just without using an extra layer of helpers. The issue being addressed by the original change only applies if you have a graph output where the shape wasn't able to be inferred. e.g. Reshape node with dynamic input causes downstream shapes to be unknown. If that is not the case, MergeShapeInfo in graph.cc would have resolved any differences between a specified output shape and the inferred output shape during Graph::Resolve. The issue does not apply to the execution frame used by the optimizer as the only time it would create a graph output is if it could constant fold all the way through, so MergeShapeInfo would have handled any difference in that case as well. Due to these considerations, wiring a logger in at the IExecutionFrame level isn't necessary if VerifyOutputSizes optionally overridden by an implementation that cares. * Address PR comments		2021-05-06 19:29:34 +10:00
.github	Don't mark issues that are marked as enhancement as stale (#6134 )	2020-12-14 18:57:40 -08:00
cgmanifests	pick onnx release candidate (#7177 )	2021-04-22 23:57:09 -07:00
cmake	Additional cmake changes for OpenVINO build (#7579 )	2021-05-05 23:54:53 -07:00
csharp	Update SessionOptions.cs (#7540 )	2021-05-04 01:51:35 -07:00
dockerfiles	Install and use conda on ortmodule CI pipelines (#7530 )	2021-05-03 15:52:22 -07:00
docs	Android package infrastructure (#7430 )	2021-04-30 14:23:54 +10:00
include/onnxruntime/core	Add support for setting shape inference function on fused nodes (#7007 )	2021-05-05 13:32:07 +10:00
java	Add android test app to validate Java API for ORT-Mobile Android (#7477 )	2021-05-04 15:39:14 -07:00
js	[js] fix library bundling and some trivial improvement (#7550 )	2021-05-03 18:31:55 -07:00
objectivec	Update Objective-C API (#7567 )	2021-05-05 15:56:55 -07:00
onnxruntime	Cleanup a change to ExecutionFrame a little (#7576 )	2021-05-06 19:29:34 +10:00
orttraining	Fix compiler warnings treated as errors in GistEncodeDecode. (#7568 )	2021-05-05 09:05:11 -07:00
package/rpm	Bumping up version to 1.7 (#6736 )	2021-02-17 19:07:38 -08:00
samples	Introduce ORTModule training API to ONNX Runtime	2021-03-10 10:48:10 -08:00
server	Update ORT server build pipeline (#7030 )	2021-03-16 18:02:09 -07:00
tools	Ignore invalid input argument to install_os_deps.sh (#7566 )	2021-05-05 14:33:31 -07:00
winml	updated sampleTolerance of model fp16_inception_v1 for GPU execution provider (#7533 )	2021-05-03 12:08:31 -07:00
.clang-format	Initial bootstrap commit.	2018-11-19 16:48:22 -08:00
.clang-tidy	Add remaining build options and make minor changes in documentation (#39 )	2018-11-27 19:59:40 -08:00
.dockerignore	Update dockerfiles (#5929 )	2020-11-25 15:38:22 -08:00
.flake8	Sync ORTModule branch with master and fix tests (#6526 )	2021-02-02 08:59:56 -08:00
.gitattributes	Initial bootstrap commit.	2018-11-19 16:48:22 -08:00
.gitignore	Add auto doc gen for ORTModule API during CI build (#7046 )	2021-03-22 10:20:33 -07:00
.gitmodules	build ONNXRuntime into WebAssembly (#6478 )	2021-04-06 16:18:10 -07:00
build.amd64.1411.bat	Initial bootstrap commit.	2018-11-19 16:48:22 -08:00
build.bat	Initial bootstrap commit.	2018-11-19 16:48:22 -08:00
build.sh	Add iOS test pipeline and a sample app. (#5298 )	2020-09-29 13:53:11 -07:00
CODEOWNERS	Update code owners for pytorch frontend team (#6329 )	2021-02-02 11:09:10 -08:00
CONTRIBUTING.md	Add README for docs (#6626 )	2021-03-12 15:14:40 -08:00
LICENSE	Remove year from license (#6658 )	2021-02-12 00:25:56 -08:00
NuGet.config	Sync ORTModule branch with master and fix tests (#6526 )	2021-02-02 08:59:56 -08:00
ort.wprp	Add Tracelogging for profiling (#1639 )	2019-11-11 21:34:10 -08:00
packages.config	Update DirectML version to 1.5.1 and enable ARM/ARM64 builds with DML (#7511 )	2021-04-30 00:49:30 -07:00
README.md	build ONNXRuntime into WebAssembly (#6478 )	2021-04-06 16:18:10 -07:00
requirements-dev.txt	Sync ORTModule branch with master and fix tests (#6526 )	2021-02-02 08:59:56 -08:00
requirements-doc.txt	Add auto doc gen for ORTModule API during CI build (#7046 )	2021-03-22 10:20:33 -07:00
requirements-training.txt	Add missing Python dependencies for ORT training (#7104 )	2021-03-23 18:43:19 -07:00
requirements.txt	Quantization calibration refactor (#6893 )	2021-03-19 01:09:11 -07:00
setup.py	Update DirectML version to 1.5.1 and enable ARM/ARM64 builds with DML (#7511 )	2021-04-30 00:49:30 -07:00
ThirdPartyNotices.txt	Enable CoreML EP for minimal extended mode (#7266 )	2021-04-08 17:45:22 -07:00
VERSION_NUMBER	Bumping up version to 1.7 (#6736 )	2021-02-17 19:07:38 -08:00

README.md

ONNX Runtime is a cross-platform inference and training machine-learning accelerator compatible with deep learning frameworks, PyTorch and TensorFlow/Keras, as well as classical machine learning libraries such as scikit-learn, and more.

ONNX Runtime uses the portable ONNX computation graph format, backed by execution providers optimized for operating systems, drivers and hardware.

Common use cases for ONNX Runtime:

Improve inference performance for a wide variety of ML models
Reduce time and cost of training large models
Train in Python but deploy into a C#/C++/Java app
Run with optimized performance on different hardware and operating systems
Support models created in several different frameworks

ONNX Runtime inference APIs are stable and production-ready since the 1.0 release in October 2019 and can enable faster customer experiences and lower costs.

ONNX Runtime training feature was introduced in May 2020 in preview. This feature supports acceleration of PyTorch training on multi-node NVIDIA GPUs for transformer models. Additional updates for this feature are coming soon.