onnxruntime/tools/ci_build/github/azure-pipelines
Suffian Khan e6de0eb813
Add nightly pipeline for MI100 to run convergence and batch size test similar to V100. (#6611)
* Partial updating of ROCM reduction code.

* Update reduction_all.cu

* Add reduce template parameters.

* miopen common

* Reuse CUDA's reduction_functions.cc

* Reduction ops.

* Update remaining reduction ops to use MIOpen.  double datatype is not supported, so disable those typed kernels.

* Disable a couple more unsupported tests.

* Code formatting.

* Delete ROCM-specific reduction code that is identical to CUDA reduction code.

* Fix scratch buffer early free.

* Fix merge conflict.

* first attempt nightly amd ci pipeline

* try fix bad yaml file

* try again with corrected model directory

* add convergence test as well

* update reference loss for amd mi100

* include mi100 test results csv

* update the mi100  convergence test reference values

* update batch sizes for mi100 32g

* fix gpu sku for run_convergence_test.py

* undo unrelated changes to master

* pr comments

* pr comment

Co-authored-by: Jesse Benson <jesseb@microsoft.com>
2021-02-12 13:22:06 -08:00
..
nodejs Add python 3.8/3.9 support for Windows GPU and Linux ARM64 (#6615) 2021-02-11 16:43:35 -08:00
nuget Add python 3.8/3.9 support for Windows GPU and Linux ARM64 (#6615) 2021-02-11 16:43:35 -08:00
templates Add python 3.8/3.9 support for Windows GPU and Linux ARM64 (#6615) 2021-02-11 16:43:35 -08:00
android-x86_64-crosscompile-ci-pipeline.yml Add python 3.8/3.9 support for Windows GPU and Linux ARM64 (#6615) 2021-02-11 16:43:35 -08:00
c-api-packaging-pipelines.yml Add python 3.8/3.9 support for Windows GPU and Linux ARM64 (#6615) 2021-02-11 16:43:35 -08:00
centos-ci-pipeline.yml Add python 3.8/3.9 support for Windows GPU and Linux ARM64 (#6615) 2021-02-11 16:43:35 -08:00
clean-build-docker-image-cache-pipeline.yml Update build docker image cache cleanup (#6048) 2020-12-07 13:07:19 -08:00
featurizers-py-packaging-pipeline.yml
linux-ci-pipeline.yml Add python 3.8/3.9 support for Windows GPU and Linux ARM64 (#6615) 2021-02-11 16:43:35 -08:00
linux-cpu-minimal-build-ci-pipeline.yml Add CI build with type reduction enabled (#6622) 2021-02-10 13:31:51 -08:00
linux-dnnl-ci-pipeline.yml Add python 3.8/3.9 support for Windows GPU and Linux ARM64 (#6615) 2021-02-11 16:43:35 -08:00
linux-gpu-ci-pipeline.yml Update GPU packaging pipelines to cuda11 and fix the other build break issues (#6585) 2021-02-05 16:58:37 -08:00
linux-gpu-cuda-11-pipeline.yml Update GPU packaging pipelines to cuda11 and fix the other build break issues (#6585) 2021-02-05 16:58:37 -08:00
linux-gpu-tensorrt-ci-perf-pipeline.yml Tensorrt perf tool (#5436) 2020-11-06 12:27:42 -08:00
linux-gpu-tensorrt-ci-pipeline.yml Add python 3.8/3.9 support for Windows GPU and Linux ARM64 (#6615) 2021-02-11 16:43:35 -08:00
linux-multi-gpu-ci-pipeline.yml Update GPU packaging pipelines to cuda11 and fix the other build break issues (#6585) 2021-02-05 16:58:37 -08:00
linux-multi-gpu-tensorrt-ci-pipeline.yml Cache build docker images in container registry. (#5811) 2020-11-17 17:02:24 -08:00
linux-nocontribops-ci-pipeline.yml Add python 3.8/3.9 support for Windows GPU and Linux ARM64 (#6615) 2021-02-11 16:43:35 -08:00
linux-nuphar-ci-pipeline.yml Add python 3.8/3.9 support for Windows GPU and Linux ARM64 (#6615) 2021-02-11 16:43:35 -08:00
linux-openvino-ci-pipeline.yml Add python 3.8/3.9 support for Windows GPU and Linux ARM64 (#6615) 2021-02-11 16:43:35 -08:00
linux-openvino-nightly-pipeline.yml Cache build docker images in container registry. (#5811) 2020-11-17 17:02:24 -08:00
linux-ort-srv-ci-pipeline.yml Add python 3.8/3.9 support for Windows GPU and Linux ARM64 (#6615) 2021-02-11 16:43:35 -08:00
linux-pytorch-custom-ops-ci-pipeline.yml
mac-ci-pipeline.yml deprecate omp in ci 2021-02-08 19:10:13 -08:00
mac-coreml-ci-pipeline.yml [CoreML EP] Add CI for CoreML EP (macOS) and add coreml_flags for EP options (#6481) 2021-01-28 12:25:46 -08:00
mac-ios-ci-pipeline.yml
mac-nocontribops-ci-pipeline.yml
orttraining-linux-ci-pipeline.yml Add python 3.8/3.9 support for Windows GPU and Linux ARM64 (#6615) 2021-02-11 16:43:35 -08:00
orttraining-linux-gpu-amd-e2e-test-ci-pipeline.yml Add nightly pipeline for MI100 to run convergence and batch size test similar to V100. (#6611) 2021-02-12 13:22:06 -08:00
orttraining-linux-gpu-ci-pipeline.yml Cache build docker images in container registry. (#5811) 2020-11-17 17:02:24 -08:00
orttraining-linux-gpu-distributed-e2e-test-pipeline.yml merge e2e with distributed pipeline (#6443) 2021-01-28 14:17:47 -08:00
orttraining-linux-gpu-distributed-test-ci-pipeline.yml Increase the distributes tests pipeline timeout to 120 minutes (#6479) 2021-01-28 12:04:26 -08:00
orttraining-linux-gpu-docker-release-pipeline.yml
orttraining-linux-gpu-e2e-test-ci-pipeline.yml Cache build docker images in container registry. (#5811) 2020-11-17 17:02:24 -08:00
orttraining-linux-gpu-perf-test-ci-pipeline.yml Cache build docker images in container registry. (#5811) 2020-11-17 17:02:24 -08:00
orttraining-mac-ci-pipeline.yml
orttraining-pai-ci-pipeline.yml Fix AMD GPU pipeline by adjusting reference /opt/rocm-3.9.0 => /opt/rocm (#6063) 2020-12-08 08:53:20 -08:00
orttraining-py-packaging-pipeline.yml Clean up builds (#6015) 2020-12-04 15:13:17 -08:00
orttraining-win-ci-pipeline.yml
orttraining-win-gpu-ci-pipeline.yml
post-merge-jobs.yml
py-packaging-pipeline.yml
win-ci-fuzz-testing.yml Add python 3.8/3.9 support for Windows GPU and Linux ARM64 (#6615) 2021-02-11 16:43:35 -08:00
win-ci-pipeline.yml Add python 3.8/3.9 support for Windows GPU and Linux ARM64 (#6615) 2021-02-11 16:43:35 -08:00
win-gpu-ci-pipeline.yml Add python 3.8/3.9 support for Windows GPU and Linux ARM64 (#6615) 2021-02-11 16:43:35 -08:00
win-gpu-cuda-11-pipeline.yml Add python 3.8/3.9 support for Windows GPU and Linux ARM64 (#6615) 2021-02-11 16:43:35 -08:00
win-gpu-reduce-op-ci-pipeline.yml Change how ONNX get installed 2021-02-10 14:41:26 -08:00
win-gpu-tensorrt-ci-pipeline.yml Add python 3.8/3.9 support for Windows GPU and Linux ARM64 (#6615) 2021-02-11 16:43:35 -08:00