onnxruntime/tools/ci_build/github
Suffian Khan e6de0eb813
Add nightly pipeline for MI100 to run convergence and batch size test similar to V100. (#6611)
* Partial updating of ROCM reduction code.

* Update reduction_all.cu

* Add reduce template parameters.

* miopen common

* Reuse CUDA's reduction_functions.cc

* Reduction ops.

* Update remaining reduction ops to use MIOpen.  double datatype is not supported, so disable those typed kernels.

* Disable a couple more unsupported tests.

* Code formatting.

* Delete ROCM-specific reduction code that is identical to CUDA reduction code.

* Fix scratch buffer early free.

* Fix merge conflict.

* first attempt nightly amd ci pipeline

* try fix bad yaml file

* try again with corrected model directory

* add convergence test as well

* update reference loss for amd mi100

* include mi100 test results csv

* update the mi100  convergence test reference values

* update batch sizes for mi100 32g

* fix gpu sku for run_convergence_test.py

* undo unrelated changes to master

* pr comments

* pr comment

Co-authored-by: Jesse Benson <jesseb@microsoft.com>
2021-02-12 13:22:06 -08:00
..
android Add python 3.8/3.9 support for Windows GPU and Linux ARM64 (#6615) 2021-02-11 16:43:35 -08:00
azure-pipelines Add nightly pipeline for MI100 to run convergence and batch size test similar to V100. (#6611) 2021-02-12 13:22:06 -08:00
linux Add python 3.8/3.9 support for Windows GPU and Linux ARM64 (#6615) 2021-02-11 16:43:35 -08:00
pai Support double for operators ReduceMax, ReduceMin (#6265) 2021-02-08 19:14:26 -08:00
windows Add python 3.8/3.9 support for Windows GPU and Linux ARM64 (#6615) 2021-02-11 16:43:35 -08:00
download_test_data.py
Doxyfile_csharp.cfg Document all C# API pubic interfaces (#5853) 2020-11-20 14:03:55 -08:00