mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-05-14 20:48:00 +00:00

History

S. Manohar Karlapalem 6d4f2f5bf9 OpenVINO EP v2.0 (#3585 ) * Added FP16 transformations * Revert "Added CMAKE_BUILD_TYPE to make building dynamic" This reverts commit d3e17af1af655cfdc4d2fec33f52055caa525e85. * Added FP16 transformations for FP16 builds * Backend logic cleanup Cleans the backend(intel_graph.) code in the following ways:- 1. Minimize global usage: Since all the IR graphs need to be re-generated on every Infer, it is bad practice to rely on globals for their saving and usage as there would be multiple readers and writers to the same global variable leading to incorrect usages or contentions. This change replaces globals with locals where possible. This change also fixes an existing bug with due to incorrect global usage. 2. Remove all unused functions. 3. Remove all unused headers and prepocessor directives. removed commented out code * Disabled default optimization for Intel EP Signed-off-by: suryasidd <surya.siddharth.pemmaraju@intel.com> * Fix missed plugins.xml for python bindings * Fixed the build after latest master changes Signed-off-by: suryasidd <surya.siddharth.pemmaraju@intel.com> * Disabled unsupported ops for accelerators Signed-off-by: suryasidd <surya.siddharth.pemmaraju@intel.com> * Added some more disabled ops Signed-off-by: suryasidd <surya.siddharth.pemmaraju@intel.com> * Added environment variable to enable debugging Signed-off-by: suryasidd <surya.siddharth.pemmaraju@intel.com> * Added more debug statements Signed-off-by: suryasidd <surya.siddharth.pemmaraju@intel.com> * Fixed unsupported ops list for GPU and VPU Signed-off-by: suryasidd <surya.siddharth.pemmaraju@intel.com> * Fixed unsqueeze unit tests Signed-off-by: suryasidd <surya.siddharth.pemmaraju@intel.com> * Added error message to the status Signed-off-by: suryasidd <surya.siddharth.pemmaraju@intel.com> * Overwrite Model proto with shape info from data Overwrites the shape info of Model proto with the shape from actual input data. Needed for inferring models with Dynamic shapes. * Removed print statement and disabled where op Signed-off-by: suryasidd <surya.siddharth.pemmaraju@intel.com> * Disabled Reshape with Empty initializer * Added more debug statements for 1P * Don't allow 1D inputs with symbol for dimension * Disabled some 3rd phase ops * Disabled split and added zero dimension check for OutputDefs * Cleanup zero dimensionality check * Added different data type check for inputs and initializers * Added conditions for Mod, Cast and Pad * Removed unused variable * Disabled scan and added conditions for squeeze * Added changes for fixing all C++ unit tests * Implements Backend Manager class for caching Backend Manager provides a layer of indirection between EP interface and OV backend that provides caching services for models with symbolic dims in input shapes. * clean up commented blocks * clang-formatting * Read I/O type info from ModleProto Read the tensor element type information from ModelProto object, as FusedNode is no longer available. * code cleanup * clang-formatting * Added print statement for jenkins * Disabled some python tests * Changed the path of convert fp32 to fp16 hpp * Added conditions for BatchNorm in GetCapability * Fixed failed tests * Revert "Added conditions for BatchNorm in GetCapability" This reverts commit c3c28c3b00d27892c42546b35dacdd807a48ee90. * Added Intel to onnxruntime backends * pick up vars set by OV package setupvars.sh * Added conditions for Identity * remove a few cout prints * Added conditions for GPU_FP32 unit tests * Revert "pick up vars set by OV package setupvars.sh" This reverts commit 8199e029c03eae21a1a7ef6bfdc93d00e5d0198b. * Commented out fatal message for protobuf * Might need to be removed * Add interface class for current backend * moved common logic to base class * simplified cpu backend * Removed unused headers * use vectors to save i/o tensors for windows compatibility * move utils fxns to backend_utils namespace * rename ov_backend to ibackend * Factory pattern for backend creation * rename CPU backend to Basic backend * renamed to vad-M and added to factory list * Added conditions for VPU * Added print statements * Changed the logic for checking for symbolic shapes * Modified logic for zero dimension check * Removed VPU single dimension condition * Removed comments * Modified logic in DimensionCheck method * Remove legacy OpenVINO EP Remove all the legacy code for OpenVINO EP. UEP code will take its place going forward. This change does NOT remove OVEP files in the following areas asa they will be reused by UEP:- 1. Documentation: All .md files 2. Docker releated files 3. Python bindings 4. Java bindings 5. C# bindings 6. ORT Server 7. CI pipeline setup files * Rename Intel EP to OpenVINO EP * Added unique names to the subgraphs * Removed subgraphs with only constant inputs * Modified subgraph partitioning algorithm to remove const input subgraphs * Apply suggestion to onnxruntime/core/providers/openvino/openvino_execution_provider.cc * Tracking output names to fix the output order bug * Changed output names to a unordered map * Modified logic to check for symbolic input shapes * Fixed a bug in Reshape check * Added empty model path to Model constructor * Made necessary changes to cmake to build from the binary package * Changed INTEL_CVSDK_DIR to INTEL_OPENVINO_DIR * Enable dyn device selection with C++ API * Added Round operator to unsupported list * Modified subgraph partition logic for MYRIAD * Removed supported ops from the list * Enable dyn dev selection in Py API's * Add documentation for dynamic device selection * Use MYRIAD \|\| HDDL instead of VPU * Removed temporary cast of Int64 to FP32 * Disabled unit Tests for CPU_FP32 and GPU_FP32 * Removed default "CPU" from unit tests to allow overriding * Removed ops Concat, Squeeze, Unsqueeze from unsupported list * Get the device id from info * Removed overwriting device_id and precision * Enabled ConvTranspose and EyeLike * Reordered unsupported ops in alphabetical order * Fixed syntax error * Fixed syntax error * Code clean-up: Handle exceptions, logs and formatting Code formatted according to ORT coding guidelines. * remove debug print from pybind code * updated docs with ops and models * formatting prints * Added default values for c and j for openvino * Overriding the values set for c and j to be 1 * BACKEND_OPENVINO should be empty if openvino is not in build * Overriding c value with default for perftest * fix VAD-M device string bug * Add IE error details to exceptions * Use IE specific device names in EP * Add VAD-F (FPGA) device support * Removed unecessary libraries from whl package * Code changes for Windows compatibility * Add VAD-F option to python API * [revert before merge] cmake changes for RC * Enable Windows build in CMake * Unset macro OPTIONAL for windows builds inference_engine.hpp's include chain defines a macro 'OPTIONAL' which conflicts with onnx project's headers when using MSVC. So would need to explictly unset it for MSVC. * Use a single copy of plugin/IE::Core Defined as a static member in Backend manager * Remove restriction of single subgraphs for myriad * Passed subgraph name to Backend to enhance log statements * Disabled zero dimension conditions * Disabled concat to remove zero dims * Enabled building ngraph as part of ORT * Removed serializing and added versioning * Fix CPU_FP32 unit tests * Removed unecessary condition * add ngraph.so.0.0 to .whl * Check for zero dimensions only for inputs and outputs * Restrict loading only 10 subgraphs on myriad * Build ngraph.dll within UEP. Doesn't link yet * Rename Linux included libngraph.so to libovep_ngraph.so Renames locally built libngraph.so containing ONNX importer to libovep_ngraph.so in order to avoid linkage conflicts with libngraph.so supplied by OpenVINO binary installer. Applies only for Linux builds. * use output_name cmake properties for lib name * fix .so name format in lib_name.patch * CMake code cleanup * Rename WIN32 included ngraph.dll to ovep_ngraph.dll To avoid conflict with ngraph.dll distributed by openvino. * Added myriad config for networks without 4 dimensions * Loading the 10 max clusters for inference on myriad * Refactor code and add Batching support Encapsulate subgraph settings into context structs. Add batching support for completely supported models. * Disabled some broken tests * use input_indexes to avoid batch-checking initializers * Avoid static initialization order error on WOS * Added candy to broken tests * InternalCI changes for 2020.2 * Updated DLDT instructions * Unsaved changed in install_openvino.sh * Changes after manual check * Remove custom ngraph onnx_import build for WOS ONNX Importer on WOS does not have protobuf issue. * Remove FP32ToFP16 ngraph pass This conversion is performed implicitly within IE. * Surround debug logic by #ifndef NDEBUG * remove invalid TODO comments * removed references to ngrpah-ep * clang-formatting * remove commented code * comment edits * updating copyright year to that of first OpenVINO-EP release * remove redundant log msg * Modified operator and topology support * Update build instructions * doc formatting * Fixed clip unit tests * Revert "Remove FP32ToFP16 ngraph pass" This reverts commit ec962ca5f315a5658ad980e740196f19de2639c1. * Applying FP16 transformation only for GPU FP16 * Fixed GPU FP32 python tests * automatically use full protobuf * disable onnxrt server for now * Disabled upsample * update dockerfile instructions * Removed MO paths and added ngraph path * Remove OVEP from ORT Server docs Will put it back in after validation * Updated path to Ngraph lib * Disabled Resize and some other python tests * Removed unnecesary header files * Use commit SHA to fetch ngraph repo * Avoid un-needed file changes due to version update * Fixed clip tests * Fixed Pow, max and min onnx tests * build.md doc typo * Update cmake patch command for ngraph src * remove dead cmake code for onnxruntime_USE_OPENVINO_BINARY * use spaces instead of tab * remove commented code * Add info about protobuf version * edit debug env var and enable for WIN32 * specify only version tag of 2020.2 for dockerbuilds * remove unnecessary file changes * Pass empty string as default argument to C# tests * Use ${OPENVINO_VERSION} to name openvino install directory in CI builds * Enabled unnecessarily disabled tests * Fixed ngraph protobuf patch * Fixed error in protobuf patch * Revert "Use ${OPENVINO_VERSION} to name openvino install directory in CI builds" This reverts commit 89e72adb8bf3b9712f5c81c5e13fe68c6c0df002. * Remove unsetting OPTIONAL macro This is no longer used in recent ONNX update onnx/onnx@da13be2, so this unset workaround is no longer necessary. * Use a null string default argument for C# API * Set OpenVINO version yml files and pass to CI Docker builds Git Tag info for DLDT as well as install directory are set using this value. This reverts commit 9fa9c20348ed72ae360a95c98e9b074d2f9fafc5. * Documentation: recommendation and instructions for disabling ORT graph optimizations * more doc updates * Reduced the number of models according to CI time constraints Co-authored-by: ynimmaga <yamini.nimmagadda@intel.com> Co-authored-by: suryasidd <surya.siddharth.pemmaraju@intel.com> Co-authored-by: Mikhail Treskin <mikhail.treskin@intel.com> Co-authored-by: mbencer <mateusz.bencer@intel.com> Co-authored-by: Aravind <aravindx.gunda@intel.com> Co-authored-by: suryasidd <48925384+suryasidd@users.noreply.github.com>		2020-04-24 04:06:02 -07:00
..
scripts	TensorRT dockerfile updates (#3016 )	2020-02-13 14:34:00 -08:00
Dockerfile.arm32v7	Treat attribute warning as non-error on cross compiling ARM (#1261 )	2019-06-23 17:59:38 -07:00
Dockerfile.cuda	update dockerfiles/README (#2336 )	2019-11-06 16:54:10 -08:00
Dockerfile.ngraph	update dockerfiles/README (#2336 )	2019-11-06 16:54:10 -08:00
Dockerfile.nuphar	[NupharEP] refine parallel schedule control (#2514 )	2019-12-02 17:40:51 -08:00
Dockerfile.openvino	OpenVINO EP v2.0 (#3585 )	2020-04-24 04:06:02 -07:00
Dockerfile.server	Fix onnxruntime server docker file build failure (#3219 )	2020-03-15 14:46:46 -07:00
Dockerfile.source	merge two RUN to avoid making docker image too larger	2019-10-30 09:57:56 -07:00
Dockerfile.tensorrt	TensorRT dockerfile updates (#3016 )	2020-02-13 14:34:00 -08:00
LICENSE-IMAGE.txt	Dockerfiles for TensorRT, CUDA, build from source (#922 )	2019-07-09 02:03:55 -07:00
README.md	OpenVINO EP v2.0 (#3585 )	2020-04-24 04:06:02 -07:00

README.md

Docker Containers for ONNX Runtime

Dockerfiles

CPU Dockerfile, Instructions
CUDA + CUDNN: Dockerfile, Instructions
nGraph: Dockerfile, Instructions
TensorRT: Dockerfile, Instructions
OpenVINO: Dockerfile, Instructions
Nuphar: Dockerfile, Instructions
ARM 32v7: Dockerfile, Instructions
ONNX-Ecosystem (CPU + Converters): Dockerfile, Instructions
ONNX Runtime Server: Dockerfile, Instructions

Published Microsoft Container Registry (MCR) Images

Use docker pull with any of the images and tags below to pull an image and try for yourself. Note that the CPU, CUDA, and TensorRT images include additional dependencies like miniconda for compatibility with AzureML image deployment.

Example: Run docker pull mcr.microsoft.com/azureml/onnxruntime:latest-cuda to pull the latest released docker image with ONNX Runtime GPU, CUDA, and CUDNN support.

Build Flavor	Base Image	ONNX Runtime Docker Image tags	Latest
Source (CPU)	mcr.microsoft.com/azureml/onnxruntime	:v0.4.0, :v0.5.0, v0.5.1, :v1.0.0, :v1.2.0	:latest
CUDA (GPU)	mcr.microsoft.com/azureml/onnxruntime	:v0.4.0-cuda10.0-cudnn7, :v0.5.0-cuda10.1-cudnn7, v0.5.1-cuda10.1-cudnn7, :v1.0.0-cuda10.1-cudnn7, v1.2.0-cuda10.1-cudnn7	:latest-cuda
TensorRT (x86)	mcr.microsoft.com/azureml/onnxruntime	:v0.4.0-tensorrt19.03, :v0.5.0-tensorrt19.06, v1.0.0-tensorrt19.09, v1.2.0-tensorrt20.01	:latest-tensorrt
OpenVino (VAD-M)	mcr.microsoft.com/azureml/onnxruntime	:v0.5.0-openvino-r1.1-vadm, v1.0.0-openvino-r1.1-vadm	:latest-openvino-vadm
OpenVino (MYRIAD)	mcr.microsoft.com/azureml/onnxruntime	:v0.5.0-openvino-r1.1-myriad, :v1.0.0-openvino-r1.1-myriad	:latest-openvino-myriad
OpenVino (CPU)	mcr.microsoft.com/azureml/onnxruntime	:v1.0.0-openvino-r1.1-cpu	:latest-openvino-cpu
nGraph	mcr.microsoft.com/azureml/onnxruntime	:v1.0.0-ngraph-v0.26.0	:latest-ngraph
Nuphar	mcr.microsoft.com/azureml/onnxruntime		:latest-nuphar
Server	mcr.microsoft.com/onnxruntime/server	:v0.4.0, :v0.5.0, v0.5.1, v1.0.0	:latest

Building and using Docker images

CPU

Ubuntu 16.04, CPU, Python Bindings

Build the docker image from the Dockerfile in this repository.

docker build -t onnxruntime-source -f Dockerfile.source .

Run the Docker image

docker run -it onnxruntime-source

CUDA

Ubuntu 16.04, CUDA 10.0, CuDNN 7

Build the docker image from the Dockerfile in this repository.

docker build -t onnxruntime-cuda -f Dockerfile.cuda .

Run the Docker image

docker run -it onnxruntime-cuda

nGraph

Public Preview

Ubuntu 16.04, Python Bindings

Build the docker image from the Dockerfile in this repository.

docker build -t onnxruntime-ngraph -f Dockerfile.ngraph .

Run the Docker image

docker run -it onnxruntime-ngraph

TensorRT

Ubuntu 18.04, CUDA 10.2, TensorRT 7.0.0

Build the docker image from the Dockerfile in this repository.

docker build -t onnxruntime-trt -f Dockerfile.tensorrt .

Run the Docker image

docker run -it onnxruntime-trt

OpenVINO

Public Preview

Ubuntu 16.04, Python Bindings

Build the onnxruntime image for one of the accelerators supported below.

Retrieve your docker image in one of the following ways.
- To build your docker image, download the OpenVINO online installer version 2020.2 for Linux from this link and copy the OpenVINO tar file to the same directory before building the Docker image. The online installer size is 16MB and the components needed for the accelerators are mentioned in the dockerfile. Providing the docker build argument DEVICE enables the onnxruntime build for that particular device. You can also provide arguments ONNXRUNTIME_REPO and ONNXRUNTIME_BRANCH to test that particular repo and branch. Default repository is http://github.com/microsoft/onnxruntime and default branch is master.
```
docker build -t onnxruntime --build-arg DEVICE=$DEVICE .
```
- Pull the official image from DockerHub.

DEVICE: Specifies the hardware target for building OpenVINO Execution Provider. Below are the options for different Intel target devices.

Device Option	Target Device
`CPU_FP32` \|	Intel CPUs \|
`GPU_FP32` \|	ntel Integrated Graphics \|
`GPU_FP16` \|	Intel Integrated Graphics \|
`MYRIAD_FP16` \|	Intel Movidius^TM USB sticks \|
`VAD-M_FP16` \|	Intel Vision Accelerator Design based on Movidius^TM MyriadX VPUs \|

OpenVINO on CPU

Retrieve your docker image in one of the following ways.
- Build the docker image from the DockerFile in this repository.
```
docker build -t onnxruntime-cpu --build-arg DEVICE=CPU_FP32 --network host .
```
- Pull the official image from DockerHub.
```
# Will be available with next release
```
Run the docker image
```
 docker run -it onnxruntime-cpu
```

OpenVINO on GPU

Retrieve your docker image in one of the following ways.
- Build the docker image from the DockerFile in this repository.
```
 docker build -t onnxruntime-gpu --build-arg DEVICE=GPU_FP32 --network host .
```
- Pull the official image from DockerHub.
```
  # Will be available with next release
```

Run the docker image

docker run -it --device /dev/dri:/dev/dri onnxruntime-gpu:latest

OpenVINO on Myriad VPU Accelerator

Retrieve your docker image in one of the following ways.
- Build the docker image from the DockerFile in this repository.
```
 docker build -t onnxruntime-myriad --build-arg DEVICE=MYRIAD_FP16 --network host .
```
- Pull the official image from DockerHub.
```
 # Will be available with next release
```
Install the Myriad rules drivers on the host machine according to the reference in here

Run the docker image by mounting the device drivers

docker run -it --network host --privileged -v /dev:/dev  onnxruntime-myriad:latest

OpenVINO on VAD-M Accelerator Version

Retrieve your docker image in one of the following ways.
- Build the docker image from the DockerFile in this repository.
```
 docker build -t onnxruntime-vadr --build-arg DEVICE=VAD-M_FP16 --network host .
```
- Pull the official image from DockerHub.
```
 # Will be available with next release
```
Install the HDDL drivers on the host machine according to the reference in here

Run the docker image by mounting the device drivers

docker run -it --device --mount type=bind,source=/var/tmp,destination=/var/tmp --device /dev/ion:/dev/ion  onnxruntime-hddl:latest

ARM 32v7

Public Preview

The Dockerfile used in these instructions specifically targets Raspberry Pi 3/3+ running Raspbian Stretch. The same approach should work for other ARM devices, but may require some changes to the Dockerfile such as choosing a different base image (Line 0: FROM ...).

Install DockerCE on your development machine by following the instructions here
Create an empty local directory
```
mkdir onnx-build
cd onnx-build
```
Save the Dockerfile to your new directory
- Dockerfile.arm32v7
Run docker build

This will build all the dependencies first, then build ONNX Runtime and its Python bindings. This will take several hours.
```
docker build -t onnxruntime-arm32v7 -f Dockerfile.arm32v7 .
```
Note the full path of the .whl file
- Reported at the end of the build, after the # Build Output line.
- It should follow the format onnxruntime-0.3.0-cp35-cp35m-linux_armv7l.whl, but version number may have changed. You'll use this path to extract the wheel file later.
Check that the build succeeded

Upon completion, you should see an image tagged onnxruntime-arm32v7 in your list of docker images:
```
docker images
```
Extract the Python wheel file from the docker image

(Update the path/version of the .whl file with the one noted in step 5)
```
docker create -ti --name onnxruntime_temp onnxruntime-arm32v7 bash
docker cp onnxruntime_temp:/code/onnxruntime/build/Linux/MinSizeRel/dist/onnxruntime-0.3.0-cp35-cp35m-linux_armv7l.whl .
docker rm -fv onnxruntime_temp
```
This will save a copy of the wheel file, onnxruntime-0.3.0-cp35-cp35m-linux_armv7l.whl, to your working directory on your host machine.
Copy the wheel file (onnxruntime-0.3.0-cp35-cp35m-linux_armv7l.whl) to your Raspberry Pi or other ARM device

On device, install the ONNX Runtime wheel file

sudo apt-get update
sudo apt-get install -y python3 python3-pip
pip3 install numpy

# Install ONNX Runtime
# Important: Update path/version to match the name and location of your .whl file
pip3 install onnxruntime-0.3.0-cp35-cp35m-linux_armv7l.whl

Test installation by following the instructions here

Nuphar

Public Preview

Ubuntu 16.04, Python Bindings

Build the docker image from the Dockerfile in this repository.

docker build -t onnxruntime-nuphar -f Dockerfile.nuphar .

Run the Docker image

docker run -it onnxruntime-nuphar

ONNX Runtime Server

Public Preview

Ubuntu 16.04

Build the docker image from the Dockerfile in this repository

docker build -t {docker_image_name} -f Dockerfile.server .

Run the ONNXRuntime server with the image created in step 1

docker run -v {localModelAbsoluteFolder}:{dockerModelAbsoluteFolder} -p {your_local_port}:8001 {imageName} --model_path {dockerModelAbsolutePath}

Send HTTP requests to the container running ONNX Runtime Server

Send HTTP requests to the docker container through the binding local port. Here is the full usage document.

curl  -X POST -d "@request.json" -H "Content-Type: application/json" http://0.0.0.0:{your_local_port}/v1/models/mymodel/versions/3:predict

Device Option	Target Device
`CPU_FP32` \|	Intel CPUs \|
`GPU_FP32` \|	ntel Integrated Graphics \|
`GPU_FP16` \|	Intel Integrated Graphics \|
`MYRIAD_FP16` \|	Intel Movidius^TM USB sticks \|
`VAD-M_FP16` \|	Intel Vision Accelerator Design based on Movidius^TM MyriadX VPUs \|