saymrwulf/onnxruntime: ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-07-04 04:07:22 +00:00

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

Find a file

Fanny Nina Paravecino c3c4db2c1b Upgrade GIST memory compression nodes, kernels, optimizer rule, and cli (#6262 ) * Add gist nodes, kernels, optimizer rule, and cli * Add Gist CUDA kernels * Added/updated gist compression cli to bert, gpt2, mnist * Fix decode priority generator for large models * Fix hardcoded decode priority generator, update gist training test * Fix incomplete if/else sequence for CI build * Added MSFP15 for gist compression type * fix Msfp15 bug * Resolved azure pipeline errors - unsupported ORT_RETURN macro format, cudastream argument * Resolved hardcoded cudastream argument, Pack8 zero error * Resolved PR comments - except gist tests * Added TypeInference to Gist Nodes, To attribute to Gist Decoder, Updated Gist Test Cases * Reverted error in merge commit * Updated logger usage in Gist rule, Updated GistPackMSFP15 compressed tensor's explaination * Converted onnxruntime::make_unique to std::make_unique based on PR 7502 Co-authored-by: Fanny Nina Paravecino <faninapa@microsoft.com> Co-authored-by: Aayush Ankit <aayushankit@microsoft.com> Co-authored-by: Aayush Ankit <Aayush-Ankit@users.noreply.github.com> Co-authored-by: Fanny Nina Paravecino <fanny.nina@microsoft.com>		2021-05-04 10:33:35 -07:00
.github
cgmanifests	pick onnx release candidate (#7177 )	2021-04-22 23:57:09 -07:00
cmake	compatibility was broken for myriad config parameter (#7349 )	2021-05-03 21:13:12 -07:00
csharp	Update SessionOptions.cs (#7540 )	2021-05-04 01:51:35 -07:00
dockerfiles	Install and use conda on ortmodule CI pipelines (#7530 )	2021-05-03 15:52:22 -07:00
docs	Android package infrastructure (#7430 )	2021-04-30 14:23:54 +10:00
include/onnxruntime/core	"Sticky" allocation of worker threads (#7551 )	2021-05-03 18:28:13 +01:00
java	Add static code analyzer to Windows CPU/GPU CI builds and fix the warnings (#7489 )	2021-04-29 11:54:57 -07:00
js	[js] fix library bundling and some trivial improvement (#7550 )	2021-05-03 18:31:55 -07:00
objectivec	Initial Objective-C API (#7366 )	2021-04-27 10:06:30 -07:00
onnxruntime	Upgrade GIST memory compression nodes, kernels, optimizer rule, and cli (#6262 )	2021-05-04 10:33:35 -07:00
orttraining	Upgrade GIST memory compression nodes, kernels, optimizer rule, and cli (#6262 )	2021-05-04 10:33:35 -07:00
package/rpm	Bumping up version to 1.7 (#6736 )	2021-02-17 19:07:38 -08:00
samples	Introduce ORTModule training API to ONNX Runtime	2021-03-10 10:48:10 -08:00
server	Update ORT server build pipeline (#7030 )	2021-03-16 18:02:09 -07:00
tools	Validate that the conversion script from the python package can be used to convert models. (#7517 )	2021-05-04 16:25:04 +10:00
winml	updated sampleTolerance of model fp16_inception_v1 for GPU execution provider (#7533 )	2021-05-03 12:08:31 -07:00
.clang-format
.clang-tidy
.dockerignore
.flake8	Add ability to track per operator types in reduced build config. (#6428 )	2021-01-29 07:59:51 +10:00
.gitattributes
.gitignore	Add auto doc gen for ORTModule API during CI build (#7046 )	2021-03-22 10:20:33 -07:00
.gitmodules	build ONNXRuntime into WebAssembly (#6478 )	2021-04-06 16:18:10 -07:00
build.amd64.1411.bat
build.bat
build.sh
CODEOWNERS	Update code owners for pytorch frontend team (#6329 )	2021-02-02 11:09:10 -08:00
CONTRIBUTING.md	Add README for docs (#6626 )	2021-03-12 15:14:40 -08:00
LICENSE	Remove year from license (#6658 )	2021-02-12 00:25:56 -08:00
NuGet.config	Delete nuget extra configs (#6477 )	2021-01-27 20:25:45 -08:00
ort.wprp
packages.config	Update DirectML version to 1.5.1 and enable ARM/ARM64 builds with DML (#7511 )	2021-04-30 00:49:30 -07:00
README.md	build ONNXRuntime into WebAssembly (#6478 )	2021-04-06 16:18:10 -07:00
requirements-dev.txt	Add ability to track per operator types in reduced build config. (#6428 )	2021-01-29 07:59:51 +10:00
requirements-doc.txt	Add auto doc gen for ORTModule API during CI build (#7046 )	2021-03-22 10:20:33 -07:00
requirements-training.txt	Add missing Python dependencies for ORT training (#7104 )	2021-03-23 18:43:19 -07:00
requirements.txt	Quantization calibration refactor (#6893 )	2021-03-19 01:09:11 -07:00
setup.py	Update DirectML version to 1.5.1 and enable ARM/ARM64 builds with DML (#7511 )	2021-04-30 00:49:30 -07:00
ThirdPartyNotices.txt	Enable CoreML EP for minimal extended mode (#7266 )	2021-04-08 17:45:22 -07:00
VERSION_NUMBER	Bumping up version to 1.7 (#6736 )	2021-02-17 19:07:38 -08:00

README.md

ONNX Runtime is a cross-platform inference and training machine-learning accelerator compatible with deep learning frameworks, PyTorch and TensorFlow/Keras, as well as classical machine learning libraries such as scikit-learn, and more.

ONNX Runtime uses the portable ONNX computation graph format, backed by execution providers optimized for operating systems, drivers and hardware.

Common use cases for ONNX Runtime:

Improve inference performance for a wide variety of ML models
Reduce time and cost of training large models
Train in Python but deploy into a C#/C++/Java app
Run with optimized performance on different hardware and operating systems
Support models created in several different frameworks

ONNX Runtime inference APIs are stable and production-ready since the 1.0 release in October 2019 and can enable faster customer experiences and lower costs.

ONNX Runtime training feature was introduced in May 2020 in preview. This feature supports acceleration of PyTorch training on multi-node NVIDIA GPUs for transformer models. Additional updates for this feature are coming soon.