transformers/.github/workflows
Funtowicz Morgan 2d71307dc0
Integrate AMD GPU in CI/CD environment (#26007)
* Add a Dockerfile for PyTorch + ROCm based on official AMD released artifact

* Add a new artifact single-amdgpu testing on main

* Attempt to test the workflow without merging.

* Changed BERT to check if things are triggered

* Meet the dependencies graph on workflow

* Revert BERT changes

* Add check_runners_amdgpu to correctly mount and check availability

* Rename setup to setup_gpu for CUDA and add setup_amdgpu for AMD

* Fix all the needs.setup -> needs.setup_[gpu|amdgpu] dependencies

* Fix setup dependency graph to use check_runner_amdgpu

* Let's do the runner status check only on AMDGPU target

* Update the Dockerfile.amd to put ourselves in / rather than /var/lib

* Restore the whole setup for CUDA too.

* Let's redisable them

* Change BERT to trigger tests

* Restore BERT

* Add torchaudio with rocm 5.6 to AMD Dockerfile (#26050)

fix dockerfile

Co-authored-by: Felix Marty <felix@hf.co>

* Place AMD GPU tests in a separate workflow (correct branch) (#26105)

AMDGPU CI lives in an other workflow

* Fix invalid job name is dependencies.

* Remove tests multi-amdgpu for now.

* Use single-amdgpu

* Use --net=host for now.

* Remote host networking.

* Removed duplicated check_runners_amdgpu step

* Let's tag machine-types with mi210 for now.

* Machine type should be only mi210

* Remove unnecessary push.branches item

* Apply review suggestions moving from `x-amdgpu` to `x-gpu` introducing `amd-gpu` and `miXXX` labels.

* Remove amdgpu from step names.

* finalize

* delete

---------

Co-authored-by: fxmarty <9808326+fxmarty@users.noreply.github.com>
Co-authored-by: Felix Marty <felix@hf.co>
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-09-20 14:48:49 +02:00
..
add-model-like.yml Skip Add model like job (#24865) 2023-07-17 15:52:04 -04:00
build-docker-images.yml Integrate AMD GPU in CI/CD environment (#26007) 2023-09-20 14:48:49 +02:00
build-nightly-ci-docker-images.yml Fix DeepSpeed stuff in the nightly CI (#23478) 2023-05-19 20:31:55 +02:00
build-past-ci-docker-images.yml Byebye pytorch 1.9 (#24080) 2023-06-16 16:38:23 +02:00
build_documentation.yml [doc build] Use secrets (#24079) 2023-06-07 17:33:39 +02:00
build_pr_documentation.yml [i18n-fr] Translate index page to French (#21458) 2023-02-06 12:25:49 -05:00
check_runner_status.yml Make Slack CI reporting stronger (#21823) 2023-02-28 17:12:44 +01:00
check_tiny_models.yml Make tiny model creation + pipeline testing more robust (#22500) 2023-04-06 17:45:55 +02:00
delete_doc_comment.yml [doc build] Use secrets (#24079) 2023-06-07 17:33:39 +02:00
delete_doc_comment_trigger.yml Update delete_doc_comment_trigger.yml (#24084) 2023-06-07 17:55:48 +02:00
doctests.yml Add flax installation in daily doctest workflow (#25860) 2023-08-30 15:13:50 +02:00
model-templates.yml Update some GH action versions (#20537) 2022-12-06 16:54:40 +01:00
release-conda.yml
self-nightly-past-ci-caller.yml Byebye pytorch 1.9 (#24080) 2023-06-16 16:38:23 +02:00
self-nightly-scheduled.yml Fix confusing transformers installation in CI (#23465) 2023-05-19 22:10:18 +02:00
self-past.yml Fix confusing transformers installation in CI (#23465) 2023-05-19 22:10:18 +02:00
self-push-amd.yml Integrate AMD GPU in CI/CD environment (#26007) 2023-09-20 14:48:49 +02:00
self-push-caller.yml Replace set-output by $GITHUB_OUTPUT (#20547) 2022-12-05 18:25:13 +01:00
self-push.yml Update workflow files (#23658) 2023-05-22 21:26:51 +02:00
self-scheduled.yml Unpin DeepSpeed and require DS >= 0.9.3 (#24541) 2023-06-28 14:01:22 +02:00
stale.yml Sets the stalebot to 10 AM CEST (#25678) 2023-08-23 14:21:07 +02:00
TROUBLESHOOT.md
update_metdata.yml pin pandas==2.0.3 (#25875) 2023-08-30 18:10:01 +02:00
upload_pr_documentation.yml [doc build] Use secrets (#24079) 2023-06-07 17:33:39 +02:00