onnxruntime/orttraining/tools/ci_test
PeixuanZuo cb4bf4f5c8
[ROCm] Move ROCm build step on CPU only machine (#16596)
- Move ROCm build step on CPU only machine
- Add the performance data of the huggingface bert-large model on the
MI200
- At the beginning of the test step, check the agent's GPU usage and
kill the threads occupying the GPU, which may be left over from previous
tasks that exited abnormally.
- Use different docker images during the build and test steps. The
difference is the `uid` and `user` when build docker image and create
docker container.
2023-07-10 11:55:10 +08:00
..
results [ROCm] Move ROCm build step on CPU only machine (#16596) 2023-07-10 11:55:10 +08:00
compare_huggingface.py Adopt linrtunner as the linting tool - take 2 (#15085) 2023-03-24 15:29:03 -07:00
compare_results.py Adopt linrtunner as the linting tool - take 2 (#15085) 2023-03-24 15:29:03 -07:00
download_azure_blob_archive.py Bump ruff in CI (#15533) 2023-04-17 10:11:44 -07:00
run_batch_size_test.py [ROCm] reduce batch size to fix CI error (#15714) 2023-05-16 13:10:02 +08:00
run_bert_perf_test.py Adopt linrtunner as the linting tool - take 2 (#15085) 2023-03-24 15:29:03 -07:00
run_convergence_test.py Adopt linrtunner as the linting tool - take 2 (#15085) 2023-03-24 15:29:03 -07:00
run_gpt2_perf_test.py Adopt linrtunner as the linting tool - take 2 (#15085) 2023-03-24 15:29:03 -07:00