onnxruntime/docs/execution_providers/Vitis-AI-ExecutionProvider.md
edelaye 64b5f7edf6
Initial release of Vitis-AI Execution Provider (#3771)
* Initial release of Vitis-AI Execution Provider

* Add documentation, fix for onnxruntime::Model changes and use stringstream instead of file dump for model passing

* - Add Vitis-AI docker file
- Add online quantization flow Vitis-AI execution provider
- Fix remarks

* - Add fatal error build message for Vitis-AI cmake build on Windows
- Fix pep8 issue in build.py
- Add Vitis-AI execution provider example in docs

Co-authored-by: Elliott Delaye <elliott@xilinx.com>
Co-authored-by: Jorn Tuyls <jornt@xilinx.com>
Co-authored-by: Jorn Tuyls <jtuyls@users.noreply.github.com>
2020-05-19 05:32:32 -07:00

4.4 KiB

Vitis-AI Execution Provider

Vitis-AI is Xilinx's development stack for hardware-accelerated AI inference on Xilinx platforms, including both edge devices and Alveo cards. It consists of optimized IP, tools, libraries, models, and example designs. It is designed with high efficiency and ease of use in mind, unleashing the full potential of AI acceleration on Xilinx FPGA and ACAP.

Build

For build instructions, please see the BUILD page. Please setup the hardware environment before starting the build: Hardware setup.

System requirements

The following table lists system requirements for running docker containers as well as Alveo cards.

Component Requirement
Motherboard PCI Express 3.0-compliant with one dual-width x16 slot
System Power Supply 225W
Operating System Ubuntu 16.04, 18.04
CentOS 7.4, 7.5
RHEL 7.4, 7.5
CPU Intel i3/i5/i7/i9/Xeon 64-bit CPU
GPU Optional to accelerate quantization NVIDIA GPU with a compute capability > 3.0
CUDA Driver Optional to accelerate quantization nvidia-410
FPGA Xilinx Alveo U200 or U250
Docker Version 19.03.1

Hardware setup

  1. Clone the Vitis AI repository:

    git clone https://github.com/xilinx/vitis-ai
    
  2. Install the Docker, and add the user to the docker group. Link the user to docker installation instructions from the following docker's website:

  3. Any GPU instructions will have to be separated from Vitis AI.

  4. Set up Vitis AI to target Alveo cards. To target Alveo cards with Vitis AI for machine learning workloads, you must install the following software components:

    • Xilinx Runtime (XRT)
    • Alveo Deployment Shells (DSAs)
    • Xilinx Resource Manager (XRM) (xbutler)
    • Xilinx Overlaybins (Accelerators to Dynamically Load - binary programming files)

    While it is possible to install all of these software components individually, a script has been provided to automatically install them at once. To do so:

    • Run the following commands:
      cd Vitis-AI/alveo/packages
      sudo su
      ./install.sh
      
    • Power cycle the system.
  5. Build and start the ONNXRuntime Vitis-AI Docker Container.

    cd {onnxruntime-root}/dockerfiles
    docker build -t onnxruntime-vitisai -f Dockerfile.vitisai .
    ./scripts/docker_run_vitisai.sh
    

    Setup inside container

    source /opt/xilinx/xrt/setup.sh
    conda activate vitis-ai-tensorflow
    

Samples

For python, you can base yourself on the following example:

# Import pyxir before onnxruntime
import pyxir
import pyxir.frontend.onnx
import pyxir.contrib.dpuv1.dpuv1

import onnxruntime

# Add other imports 
# ...

# Load inputs and do preprocessing
# ...

# Create an inference session using the Vitis-AI execution provider
session = onnxruntime.InferenceSession('[model_file].onnx', None,["VitisAIExecutionProvider"])

# First N (default = 128) inputs are used for quantization calibration and will
#   be executed on the CPU
imput_name = [...]
outputs = [session.run([], {input_name: calib_inputs[i]})[0] for i in range(128)]

# Afterwards, computations will be accelerated on the FPGA
input_data = [...]
result = session.run([], {input_name: input_data})