onnxruntime/tools/ci_build/github/azure-pipelines/orttraining-linux-nightly-ortmodule-test-pipeline.yml
Ashwini Khade f71ac9859e
Update acpt image in the training pipeline (#14855)
### Description
Current pipeline refers to an old image which is causing test failures.
Updating the image to the latest one.



### Motivation and Context
<!-- - Why is this change required? What problem does it solve?
Fixes pipeline failure:
https://dev.azure.com/onnxruntime/onnxruntime/_build?definitionId=198
- If it fixes an open issue, please link to the issue here. -->
2023-03-07 14:10:32 -08:00

28 lines
1 KiB
YAML

trigger: none
jobs:
- job: Onnxruntime_Linux_Nightly_ORTModule_tests
timeoutInMinutes: 120
pool: 'Onnxruntime-Linux-GPU-NC6sv3'
steps:
- checkout: self
clean: true
submodules: recursive
# Entry point for all ortmodule training tests
- script: |
docker run \
--gpus all \
--rm \
--volume $(Build.SourcesDirectory)/orttraining/orttraining/test/python:/onnxruntime_src \
--volume $(Build.SourcesDirectory)/tools/ci_build/github/linux/docker/scripts/training/ortmodule/stage1/requirements_torch_nightly:/requirements_torch_nightly \
ptebic.azurecr.io/internal/azureml/aifx/nightly-ubuntu2004-cu117-py38-torch200dev \
bash -c "python3 -m pip install -r /requirements_torch_nightly/requirements.txt && python3 -m pytest -sv /onnxruntime_src/orttraining_test_ortmodule_api.py"
displayName: 'Run ORTModule Tests'
condition: succeededOrFailed()
timeoutInMinutes: 120
- template: templates/component-governance-component-detection-steps.yml
parameters:
condition: 'succeeded'