onnxruntime/tools/ci_build/github/azure-pipelines
Weixing Zhang aec4cb489e
ROCm EP for AMD GPU (#5480)
The ROCm EP is designed and implemented based on AMD GPU software stack named ROCm. Here is the link for the details about ROCm: https://rocmdocs.amd.com/en/latest/

ROCm EP was created based on the following things:
1. AMD GPU programming language: HIP
2. AMD GPU HIP language runtime: amdhip64
3. BLAS: rocBLAS, hipBLAS
4. DNN: miOpen
5. Collective Communication library: RCCL
6. cub: hipCub
7. …

Current status:
BERT-L and GPT2 training can be ran on AMD GPU with data parallel.

Next:
1. Make more GPU code be sharable between ROCm EP and CUDA EP since HIP language and HIP runtime API are very close to CUDA.
2. Continue improving the implementation.
3. Continue GPU kernel optimization.
4. Support model parallelism on ROCm EP.
……

The rocm kernels have been removed from this commit and will be in a separate PR. Since the original PR was too big(~180 files), it was suggested to split the PR into two parts, one is rocm-kernels, the other is non rocm kernels.  

Co-authored-by: Weixing Zhang <wezhan@microsoft.com>
Co-authored-by: sabreshao <sabre.shao@amd.com>
Co-authored-by: anghostcici <11013544+anghostcici@users.noreply.github.com>
Co-authored-by: Suffian Khan <sukha@microsoft.com>
Co-authored-by: Edward Chen <18449977+edgchen1@users.noreply.github.com>
2020-10-29 17:13:04 -07:00
..
nodejs Update ONNX commit (#5487) 2020-10-21 07:22:20 -07:00
nuget Remove MKLML build config (#5559) 2020-10-21 13:11:25 -07:00
templates Publish no-openmp python packages to test pypi (#5610) 2020-10-28 19:49:53 -07:00
android-x86_64-crosscompile-ci-pipeline.yml
c-api-packaging-pipelines.yml Update ONNX commit (#5487) 2020-10-21 07:22:20 -07:00
centos-ci-pipeline.yml
featurizers-py-packaging-pipeline.yml
java-api-packaging-pipelines-gpu.yml Update ONNX commit (#5487) 2020-10-21 07:22:20 -07:00
java-api-packaging-pipelines.yml Update ONNX commit (#5487) 2020-10-21 07:22:20 -07:00
linux-ci-pipeline.yml Remove MKLML build config (#5559) 2020-10-21 13:11:25 -07:00
linux-cpu-minimal-build-ci-pipeline.yml Update ONNX commit (#5487) 2020-10-21 07:22:20 -07:00
linux-dnnl-ci-pipeline.yml Update ONNX commit (#5487) 2020-10-21 07:22:20 -07:00
linux-gpu-ci-pipeline.yml Update ONNX commit (#5487) 2020-10-21 07:22:20 -07:00
linux-gpu-cuda-11-pipeline.yml Remove MKLML build config (#5559) 2020-10-21 13:11:25 -07:00
linux-gpu-tensorrt-ci-pipeline.yml
linux-multi-gpu-ci-pipeline.yml Update ONNX commit (#5487) 2020-10-21 07:22:20 -07:00
linux-multi-gpu-tensorrt-ci-pipeline.yml
linux-ngraph-ci-pipeline.yml
linux-nocontribops-ci-pipeline.yml Update ONNX commit (#5487) 2020-10-21 07:22:20 -07:00
linux-nuphar-ci-pipeline.yml
linux-openvino-ci-pipeline.yml
linux-openvino-nightly-pipeline.yml
linux-ort-srv-ci-pipeline.yml
linux-ort-srv-nightly-pipeline.yml
linux-pytorch-custom-ops-ci-pipeline.yml
mac-ci-pipeline.yml
mac-ios-ci-pipeline.yml
mac-nocontribops-ci-pipeline.yml
orttraining-linux-ci-pipeline.yml Remove MKLML build config (#5559) 2020-10-21 13:11:25 -07:00
orttraining-linux-gpu-ci-pipeline.yml
orttraining-linux-gpu-docker-release-pipeline.yml
orttraining-linux-gpu-e2e-test-ci-pipeline.yml Liqun/remove number matching (#5606) 2020-10-27 21:27:37 -07:00
orttraining-linux-gpu-frontend-test-ci-pipeline.yml Liqun/remove number matching (#5606) 2020-10-27 21:27:37 -07:00
orttraining-linux-gpu-perf-test-ci-pipeline.yml
orttraining-mac-ci-pipeline.yml
orttraining-pai-ci-pipeline.yml ROCm EP for AMD GPU (#5480) 2020-10-29 17:13:04 -07:00
orttraining-py-packaging-pipeline.yml
orttraining-win-ci-pipeline.yml
orttraining-win-gpu-ci-pipeline.yml
post-merge-jobs.yml
py-packaging-pipeline.yml
win-arm-crosscompile-ci-pipeline.yml
win-arm64-crosscompile-ci-pipeline.yml
win-ci-pipeline.yml
win-gpu-ci-pipeline.yml
win-gpu-cuda-11-pipeline.yml
win-gpu-reduce-op-ci-pipeline.yml
win-gpu-tensorrt-ci-pipeline.yml
win-ngraph-ci-pipeline.yml
win-nocontribops-ci-pipeline.yml
win-x86-nocontribops-ci-pipeline.yml