onnxruntime/README.md

<p align="center"><img width="50%" src="docs/images/ONNX_Runtime_logo_dark.png" /></p>

[![Build Status](https://dev.azure.com/onnxruntime/onnxruntime/_apis/build/status/Windows%20CPU%20CI%20Pipeline?label=Windows+CPU)](https://dev.azure.com/onnxruntime/onnxruntime/_build/latest?definitionId=9)
[![Build Status](https://dev.azure.com/onnxruntime/onnxruntime/_apis/build/status/Windows%20GPU%20CI%20Pipeline?label=Windows+GPU)](https://dev.azure.com/onnxruntime/onnxruntime/_build/latest?definitionId=10)
[![Build Status](https://dev.azure.com/onnxruntime/onnxruntime/_apis/build/status/Linux%20CPU%20CI%20Pipeline?label=Linux+CPU)](https://dev.azure.com/onnxruntime/onnxruntime/_build/latest?definitionId=11)
[![Build Status](https://dev.azure.com/onnxruntime/onnxruntime/_apis/build/status/Linux%20GPU%20CI%20Pipeline?label=Linux+GPU)](https://dev.azure.com/onnxruntime/onnxruntime/_build/latest?definitionId=12)
[![Build Status](https://dev.azure.com/onnxruntime/onnxruntime/_apis/build/status/MacOS%20CI%20Pipeline?label=MacOS+CPU)](https://dev.azure.com/onnxruntime/onnxruntime/_build/latest?definitionId=13)

**ONNX Runtime** is an open-source scoring engine for Open Neural Network Exchange (ONNX) models. 

ONNX is an open format for machine learning (ML) models that is supported by various ML and DNN frameworks and tools. This format makes it easier to interoperate between frameworks and to maximize the reach of your hardware optimization investments. Learn more about ONNX on [https://onnx.ai](https://onnx.ai) or view the [Github Repo](https://github.com/onnx/onnx). 
 
# Why use ONNX Runtime 
ONNX Runtime is an open architecture that is continually evolving to adapt to and address the newest developments and challenges in AI and Deep Learning. We will keep ONNX Runtime up to date with the ONNX standard, supporting all ONNX releases with future compatibliity while maintaining backwards compatibility with prior releases.

ONNX Runtime continuously strives to provide top performance for a broad and growing number of usage scenarios in Machine Learning. Our investments focus on these 3 core areas:
1. Run any ONNX model 
2. High performance
3. Cross platform

## Run any ONNX model

### Alignment with ONNX Releases
ONNX Runtime provides comprehensive support of the ONNX spec and can be used to run all models based on ONNX v1.2.1 and higher. See ONNX version release details [here](https://github.com/onnx/onnx/releases).

As of January 2019, ONNX Runtime supports ONNX 1.3. We will soon add support for the recently released ONNX 1.4.

### Traditional ML support
ONNX Runtime fully supports the ONNX-ML profile of the ONNX spec for traditional ML scenarios. 

## High Performance 
You can use ONNX Runtime with both CPU and GPU hardware. You can also plug in additional execution providers to ONNX Runtime. With many graph optimizations and various accelerators, ONNX Runtime can often provide lower latency and higher efficiency compared to other runtimes. This provides smoother end-to-end customer experiences and lower costs from improved machine utilization.

Currently ONNX Runtime supports CUDA and MKL-DNN (with option to build with MKL) for computation acceleration. To add an execution provider, please refer to [this page](docs/AddingExecutionProvider.md).

We are continuously working to integrate new execution providers to provide improvements in latency and efficiency. We have ongoing collaborations to integrate the following with ONNX Runtime:
	* Intel MKL-DNN and nGraph
	* NVIDIA TensorRT

## Cross Platform 
ONNX Runtime offers:
* APIs for Python, C#, and C
* Available for Linux, Windows, and Mac 

See API documentation and package installation instructions [below](#Installation). 

Looking ahead: To broaden the reach of the runtime, we will continue investments to make ONNX Runtime available and compatible with more platforms. These include but are not limited to:
* C# for Mac
* [ARM](BUILD.md##arm-builds)

# Getting Started 
If you need a model:  
* Check out the [ONNX Model Zoo](https://github.com/onnx/models) for ready-to-use pre-trained models. 
* To get an ONNX model by exporting from various frameworks, see [ONNX Tutorials](https://github.com/onnx/tutorials).

If you already have an ONNX model, just [install the runtime](#Installation) for your machine to try it out. One easy way to deploy the model on the cloud is by using [Azure Machine Learning](https://azure.microsoft.com/en-us/services/machine-learning-service). See detailed instructions [here](https://docs.microsoft.com/en-us/azure/machine-learning/service/how-to-build-deploy-onnx). 

# Installation
## APIs and Official Builds
| API Documentation | CPU package | GPU package |
|-----|-------------|-------------|
| [Python](https://aka.ms/onnxruntime-python) | [Windows/Linux/Mac](https://pypi.org/project/onnxruntime/)| [Windows/Linux](https://pypi.org/project/onnxruntime-gpu/)<br>(Compatible with CUDA9.1 and cuDNN 7.3) |
| [C#](docs/CSharp_API.md) | [Windows/Linux](https://www.nuget.org/packages/Microsoft.ML.OnnxRuntime/)<br>Mac - Coming Soon| [Windows/Linux](https://www.nuget.org/packages/Microsoft.ML.OnnxRuntime.Gpu/)<br>Mac - Coming Soon|
| [C](docs/C_API.md) | [Windows/Linux](https://www.nuget.org/packages/Microsoft.ML.OnnxRuntime/)<br>Mac - Coming Soon | [Windows/Linux](https://www.nuget.org/packages/Microsoft.ML.OnnxRuntime.Gpu/)<br>Mac - Coming Soon |
| [C++](onnxruntime/core/session/inference_session.h) | TBD | TBD |

## System Requirements
* The OnnxRuntime binaries in CPU packages use OpenMP and depends on the library being available at runtime in the system. For Windows, OpenMP support comes as part of VC runtime. For Linux, the system must have the libgomp.so.1 installed. 
* The GPU builds require the CUDA9.1 and cuDNN 7.3 runtime libraries being installed in the system. 

## Build Details
For details on the build configurations and information on how to create a build, see [Build ONNX Runtime](BUILD.md).

## Versioning
See more details on API and ABI Versioning and ONNX Compatibility in [Versioning](docs/Versioning.md).

# Design and Key Features
For an overview of the high level architecture and key decisions in the technical design of ONNX Runtime, see [Engineering Design](docs/HighLevelDesign.md).

ONNX Runtime is built with an extensible design that makes it versatile to support a wide array of models with high performance.

* [Add a custom operator/kernel](docs/AddingCustomOp.md)
* [Add an execution provider](docs/AddingExecutionProvider.md)
* [Add a new graph
transform](include/onnxruntime/core/optimizer/graph_transformer.h)
* [Add a new rewrite rule](include/onnxruntime/core/optimizer/rewrite_rule.h)

# Contribute
We welcome your contributions! Please see the [contribution guidelines](CONTRIBUTING.md).

## Feedback
For any feedback or to report a bug, please file a [GitHub Issue](https://github.com/Microsoft/onnxruntime/issues).

## Code of Conduct
This project has adopted the [Microsoft Open Source Code of Conduct](https://opensource.microsoft.com/codeofconduct/).
For more information see the [Code of Conduct FAQ](https://opensource.microsoft.com/codeofconduct/faq/)
or contact [opencode@microsoft.com](mailto:opencode@microsoft.com) with any additional questions or comments.

# License
[MIT License](LICENSE)
-												Miscellaneous fixes (#123)


											
										
										
											2018-12-07 06:21:04 +00:00
+								<p align="center"><img width="50%" src="docs/images/ONNX_Runtime_logo_dark.png" /></p>
-												update label for build badges (#301)


											
										
										
											2019-01-10 00:54:41 +00:00
+								[![Build Status](https://dev.azure.com/onnxruntime/onnxruntime/_apis/build/status/Windows%20CPU%20CI%20Pipeline?label=Windows+CPU)](https://dev.azure.com/onnxruntime/onnxruntime/_build/latest?definitionId=9)
 								[![Build Status](https://dev.azure.com/onnxruntime/onnxruntime/_apis/build/status/Windows%20GPU%20CI%20Pipeline?label=Windows+GPU)](https://dev.azure.com/onnxruntime/onnxruntime/_build/latest?definitionId=10)
 								[![Build Status](https://dev.azure.com/onnxruntime/onnxruntime/_apis/build/status/Linux%20CPU%20CI%20Pipeline?label=Linux+CPU)](https://dev.azure.com/onnxruntime/onnxruntime/_build/latest?definitionId=11)
 								[![Build Status](https://dev.azure.com/onnxruntime/onnxruntime/_apis/build/status/Linux%20GPU%20CI%20Pipeline?label=Linux+GPU)](https://dev.azure.com/onnxruntime/onnxruntime/_build/latest?definitionId=12)
 								[![Build Status](https://dev.azure.com/onnxruntime/onnxruntime/_apis/build/status/MacOS%20CI%20Pipeline?label=MacOS+CPU)](https://dev.azure.com/onnxruntime/onnxruntime/_build/latest?definitionId=13)
-												Initial bootstrap commit.

											
										
										
											2018-11-20 00:48:22 +00:00
-												Reduce whitespace at top of readme for easier reading (#148)

* Reduce whitespace at top of readme for easier reading

* Update README.md

											
										
										
											2018-12-11 21:52:47 +00:00
+								**ONNX Runtime** is an open-source scoring engine for Open Neural Network Exchange (ONNX) models.
-												Initial bootstrap commit.

											
										
										
											2018-11-20 00:48:22 +00:00
-												Faxu documentation (#16)

* Update README.md

* Update CONTRIBUTING.md

* Update README.md

* Update README.md

											
										
										
											2018-11-27 10:28:55 +00:00
+								ONNX is an open format for machine learning (ML) models that is supported by various ML and DNN frameworks and tools. This format makes it easier to interoperate between frameworks and to maximize the reach of your hardware optimization investments. Learn more about ONNX on [https://onnx.ai](https://onnx.ai) or view the [Github Repo](https://github.com/onnx/onnx).
 								# Why use ONNX Runtime
-												Faxu patch 1 (#63)

* Delete Roadmap.md

* Update README.md

* Update README.md

* Update README.md

											
										
										
											2018-12-01 06:10:26 +00:00
+								ONNX Runtime is an open architecture that is continually evolving to adapt to and address the newest developments and challenges in AI and Deep Learning. We will keep ONNX Runtime up to date with the ONNX standard, supporting all ONNX releases with future compatibliity while maintaining backwards compatibility with prior releases.
 								ONNX Runtime continuously strives to provide top performance for a broad and growing number of usage scenarios in Machine Learning. Our investments focus on these 3 core areas:
 . Run any ONNX model
 . High performance
 . Cross platform
-												Faxu documentation (#16)

* Update README.md

* Update CONTRIBUTING.md

* Update README.md

* Update README.md

											
										
										
											2018-11-27 10:28:55 +00:00
+								## Run any ONNX model
-												Faxu patch 1 (#63)

* Delete Roadmap.md

* Update README.md

* Update README.md

* Update README.md

											
										
										
											2018-12-01 06:10:26 +00:00
 								### Alignment with ONNX Releases
-												Faxu documentation (#16)

* Update README.md

* Update CONTRIBUTING.md

* Update README.md

* Update README.md

											
										
										
											2018-11-27 10:28:55 +00:00
+								ONNX Runtime provides comprehensive support of the ONNX spec and can be used to run all models based on ONNX v1.2.1 and higher. See ONNX version release details [here](https://github.com/onnx/onnx/releases).
-												Initial bootstrap commit.

											
										
										
											2018-11-20 00:48:22 +00:00
-												Addl TPN updates (#403)

* Updated TPN

* Update batch_norm_op_test.cc

* Update ThirdPartyNotices.txt

* Update ThirdPartyNotices.txt

* Update readme with package links

* Update README.md

* Update README.md

* Update README.md

* Merged Ryan and TPN changes into single PR

* minor fix

* added mkldnn to GPU pipeline. Required by C# library as it is the default execution provider

											
										
										
											2019-01-31 01:28:17 +00:00
+								As of January 2019, ONNX Runtime supports ONNX 1.3. We will soon add support for the recently released ONNX 1.4.
-												Faxu patch 1 (#63)

* Delete Roadmap.md

* Update README.md

* Update README.md

* Update README.md

											
										
										
											2018-12-01 06:10:26 +00:00
 								### Traditional ML support
 								ONNX Runtime fully supports the ONNX-ML profile of the ONNX spec for traditional ML scenarios.
 								## High Performance
 								You can use ONNX Runtime with both CPU and GPU hardware. You can also plug in additional execution providers to ONNX Runtime. With many graph optimizations and various accelerators, ONNX Runtime can often provide lower latency and higher efficiency compared to other runtimes. This provides smoother end-to-end customer experiences and lower costs from improved machine utilization.
 								Currently ONNX Runtime supports CUDA and MKL-DNN (with option to build with MKL) for computation acceleration. To add an execution provider, please refer to [this page](docs/AddingExecutionProvider.md).
 								We are continuously working to integrate new execution providers to provide improvements in latency and efficiency. We have ongoing collaborations to integrate the following with ONNX Runtime:
 									* Intel MKL-DNN and nGraph
 									* NVIDIA TensorRT
-												Faxu documentation (#16)

* Update README.md

* Update CONTRIBUTING.md

* Update README.md

* Update README.md

											
										
										
											2018-11-27 10:28:55 +00:00
+								## Cross Platform
 								ONNX Runtime offers:
-												Addl TPN updates (#403)

* Updated TPN

* Update batch_norm_op_test.cc

* Update ThirdPartyNotices.txt

* Update ThirdPartyNotices.txt

* Update readme with package links

* Update README.md

* Update README.md

* Update README.md

* Merged Ryan and TPN changes into single PR

* minor fix

* added mkldnn to GPU pipeline. Required by C# library as it is the default execution provider

											
										
										
											2019-01-31 01:28:17 +00:00
+								* APIs for Python, C#, and C
-												Faxu documentation (#16)

* Update README.md

* Update CONTRIBUTING.md

* Update README.md

* Update README.md

											
										
										
											2018-11-27 10:28:55 +00:00
+								* Available for Linux, Windows, and Mac
 								See API documentation and package installation instructions [below](#Installation).
-												Faxu patch 1 (#63)

* Delete Roadmap.md

* Update README.md

* Update README.md

* Update README.md

											
										
										
											2018-12-01 06:10:26 +00:00
+								Looking ahead: To broaden the reach of the runtime, we will continue investments to make ONNX Runtime available and compatible with more platforms. These include but are not limited to:
-												Addl TPN updates (#403)

* Updated TPN

* Update batch_norm_op_test.cc

* Update ThirdPartyNotices.txt

* Update ThirdPartyNotices.txt

* Update readme with package links

* Update README.md

* Update README.md

* Update README.md

* Merged Ryan and TPN changes into single PR

* minor fix

* added mkldnn to GPU pipeline. Required by C# library as it is the default execution provider

											
										
										
											2019-01-31 01:28:17 +00:00
+								* C# for Mac
-												Add Dockerfile and page for ARM builds. (#83)


											
										
										
											2018-12-04 03:13:59 +00:00
+								* [ARM](BUILD.md##arm-builds)
-												Faxu patch 1 (#63)

* Delete Roadmap.md

* Update README.md

* Update README.md

* Update README.md

											
										
										
											2018-12-01 06:10:26 +00:00
-												Faxu documentation (#16)

* Update README.md

* Update CONTRIBUTING.md

* Update README.md

* Update README.md

											
										
										
											2018-11-27 10:28:55 +00:00
+								# Getting Started
 								If you need a model:
 								* Check out the [ONNX Model Zoo](https://github.com/onnx/models) for ready-to-use pre-trained models.
 								* To get an ONNX model by exporting from various frameworks, see [ONNX Tutorials](https://github.com/onnx/tutorials).
-												Bug bash (#43)

* Update README.md

* Update Versioning.md

* Update rename_manylinux.sh

Remove duplicate word

* Update README.md

Remove a 'the' as ONNX Runtime is a proper noun.

* Update CUDA version to 9.1 cudnn version to 7.1

* Update ReleaseManagement.md

* put tensorflow copy-right headers

there are around 10 lines of code is borrowed from tflite.

* Update README.md

Mention C++ API

* Update README.md

Fix link

* Update C_API.md

Fix broken link to onnxruntime_c_api.h

* Update ABI.md

Delete mention of COM and fix 'ONNX Runtime' to be two words

* Update README.md

* Update README.md

* Update C_API.md

											
										
										
											2018-11-28 02:52:50 +00:00
+								If you already have an ONNX model, just [install the runtime](#Installation) for your machine to try it out. One easy way to deploy the model on the cloud is by using [Azure Machine Learning](https://azure.microsoft.com/en-us/services/machine-learning-service). See detailed instructions [here](https://docs.microsoft.com/en-us/azure/machine-learning/service/how-to-build-deploy-onnx).
-												Faxu documentation (#16)

* Update README.md

* Update CONTRIBUTING.md

* Update README.md

* Update README.md

											
										
										
											2018-11-27 10:28:55 +00:00
 								# Installation
 								## APIs and Official Builds
 								| API Documentation | CPU package | GPU package |
-												Initial bootstrap commit.

											
										
										
											2018-11-20 00:48:22 +00:00
+								|-----|-------------|-------------|
-												TPN update and link fix (#483)

* TPN update

* Update TPN

* Update link

											
										
										
											2019-02-18 06:29:10 +00:00
+								| [Python](https://aka.ms/onnxruntime-python) | [Windows/Linux/Mac](https://pypi.org/project/onnxruntime/)| [Windows/Linux](https://pypi.org/project/onnxruntime-gpu/)<br>(Compatible with CUDA9.1 and cuDNN 7.3) |
-												Addl TPN updates (#403)

* Updated TPN

* Update batch_norm_op_test.cc

* Update ThirdPartyNotices.txt

* Update ThirdPartyNotices.txt

* Update readme with package links

* Update README.md

* Update README.md

* Update README.md

* Merged Ryan and TPN changes into single PR

* minor fix

* added mkldnn to GPU pipeline. Required by C# library as it is the default execution provider

											
										
										
											2019-01-31 01:28:17 +00:00
+								| [C#](docs/CSharp_API.md) | [Windows/Linux](https://www.nuget.org/packages/Microsoft.ML.OnnxRuntime/)<br>Mac - Coming Soon| [Windows/Linux](https://www.nuget.org/packages/Microsoft.ML.OnnxRuntime.Gpu/)<br>Mac - Coming Soon|
 								| [C](docs/C_API.md) | [Windows/Linux](https://www.nuget.org/packages/Microsoft.ML.OnnxRuntime/)<br>Mac - Coming Soon | [Windows/Linux](https://www.nuget.org/packages/Microsoft.ML.OnnxRuntime.Gpu/)<br>Mac - Coming Soon |
-												Faxu patch 1 (#63)

* Delete Roadmap.md

* Update README.md

* Update README.md

* Update README.md

											
										
										
											2018-12-01 06:10:26 +00:00
+								| [C++](onnxruntime/core/session/inference_session.h) | TBD | TBD |
-												Initial bootstrap commit.

											
										
										
											2018-11-20 00:48:22 +00:00
-												Updated System requirements in README.md (#466)

* Updated System requirements in README.md

* spell correct

											
										
										
											2019-02-12 17:58:20 +00:00
+								## System Requirements
 								* The OnnxRuntime binaries in CPU packages use OpenMP and depends on the library being available at runtime in the system. For Windows, OpenMP support comes as part of VC runtime. For Linux, the system must have the libgomp.so.1 installed.
 								* The GPU builds require the CUDA9.1 and cuDNN 7.3 runtime libraries being installed in the system.
-												Faxu documentation (#16)

* Update README.md

* Update CONTRIBUTING.md

* Update README.md

* Update README.md

											
										
										
											2018-11-27 10:28:55 +00:00
+								## Build Details
 								For details on the build configurations and information on how to create a build, see [Build ONNX Runtime](BUILD.md).
-												Initial bootstrap commit.

											
										
										
											2018-11-20 00:48:22 +00:00
-												Faxu documentation (#16)

* Update README.md

* Update CONTRIBUTING.md

* Update README.md

* Update README.md

											
										
										
											2018-11-27 10:28:55 +00:00
+								## Versioning
 								See more details on API and ABI Versioning and ONNX Compatibility in [Versioning](docs/Versioning.md).
-												Initial bootstrap commit.

											
										
										
											2018-11-20 00:48:22 +00:00
-												Faxu documentation (#16)

* Update README.md

* Update CONTRIBUTING.md

* Update README.md

* Update README.md

											
										
										
											2018-11-27 10:28:55 +00:00
+								# Design and Key Features
 								For an overview of the high level architecture and key decisions in the technical design of ONNX Runtime, see [Engineering Design](docs/HighLevelDesign.md).
 								ONNX Runtime is built with an extensible design that makes it versatile to support a wide array of models with high performance.
-												Bug bash (#43)

* Update README.md

* Update Versioning.md

* Update rename_manylinux.sh

Remove duplicate word

* Update README.md

Remove a 'the' as ONNX Runtime is a proper noun.

* Update CUDA version to 9.1 cudnn version to 7.1

* Update ReleaseManagement.md

* put tensorflow copy-right headers

there are around 10 lines of code is borrowed from tflite.

* Update README.md

Mention C++ API

* Update README.md

Fix link

* Update C_API.md

Fix broken link to onnxruntime_c_api.h

* Update ABI.md

Delete mention of COM and fix 'ONNX Runtime' to be two words

* Update README.md

* Update README.md

* Update C_API.md

											
										
										
											2018-11-28 02:52:50 +00:00
+								* [Add a custom operator/kernel](docs/AddingCustomOp.md)
 								* [Add an execution provider](docs/AddingExecutionProvider.md)
-												Faxu documentation (#16)

* Update README.md

* Update CONTRIBUTING.md

* Update README.md

* Update README.md

											
										
										
											2018-11-27 10:28:55 +00:00
+								* [Add a new graph
-												The files of graph_transformer.h and rewrite_rule.h has been moved. (#446)


											
										
										
											2019-02-06 21:30:39 +00:00
+								transform](include/onnxruntime/core/optimizer/graph_transformer.h)
 								* [Add a new rewrite rule](include/onnxruntime/core/optimizer/rewrite_rule.h)
-												Faxu documentation (#16)

* Update README.md

* Update CONTRIBUTING.md

* Update README.md

* Update README.md

											
										
										
											2018-11-27 10:28:55 +00:00
 								# Contribute
 								We welcome your contributions! Please see the [contribution guidelines](CONTRIBUTING.md).
-												Initial bootstrap commit.

											
										
										
											2018-11-20 00:48:22 +00:00
-												Faxu documentation (#16)

* Update README.md

* Update CONTRIBUTING.md

* Update README.md

* Update README.md

											
										
										
											2018-11-27 10:28:55 +00:00
+								## Feedback
 								For any feedback or to report a bug, please file a [GitHub Issue](https://github.com/Microsoft/onnxruntime/issues).
-												Initial bootstrap commit.

											
										
										
											2018-11-20 00:48:22 +00:00
-												Faxu documentation (#16)

* Update README.md

* Update CONTRIBUTING.md

* Update README.md

* Update README.md

											
										
										
											2018-11-27 10:28:55 +00:00
+								## Code of Conduct
-												Initial bootstrap commit.

											
										
										
											2018-11-20 00:48:22 +00:00
+								This project has adopted the [Microsoft Open Source Code of Conduct](https://opensource.microsoft.com/codeofconduct/).
 								For more information see the [Code of Conduct FAQ](https://opensource.microsoft.com/codeofconduct/faq/)
 								or contact [opencode@microsoft.com](mailto:opencode@microsoft.com) with any additional questions or comments.
 								# License
-												Faxu documentation (#16)

* Update README.md

* Update CONTRIBUTING.md

* Update README.md

* Update README.md

											
										
										
											2018-11-27 10:28:55 +00:00
+								[MIT License](LICENSE)