transformers/docker/transformers-all-latest-gpu/Dockerfile

FROM nvidia/cuda:12.1.0-cudnn8-devel-ubuntu22.04
LABEL maintainer="Hugging Face"

ARG DEBIAN_FRONTEND=noninteractive

# Use login shell to read variables from `~/.profile` (to pass dynamic created variables between RUN commands)
SHELL ["sh", "-lc"]

# The following `ARG` are mainly used to specify the versions explicitly & directly in this docker file, and not meant
# to be used as arguments for docker build (so far).

ARG PYTORCH='2.4.0'
# (not always a valid torch version)
ARG INTEL_TORCH_EXT='2.3.0'
# Example: `cu102`, `cu113`, etc.
ARG CUDA='cu121'

RUN apt update
RUN apt install -y git libsndfile1-dev tesseract-ocr espeak-ng python3 python3-pip ffmpeg git-lfs
RUN git lfs install
RUN python3 -m pip install --no-cache-dir --upgrade pip

ARG REF=main
RUN git clone https://github.com/huggingface/transformers && cd transformers && git checkout $REF

# 1. Put several commands in a single `RUN` to avoid image/layer exporting issue. Could be revised in the future.
# 2. Regarding `torch` part, We might need to specify proper versions for `torchvision` and `torchaudio`.
#    Currently, let's not bother to specify their versions explicitly (so installed with their latest release versions).
RUN python3 -m pip install --no-cache-dir -U tensorflow==2.13 protobuf==3.20.3 "tensorflow_text<2.16" "tensorflow_probability<0.22" && python3 -m pip install --no-cache-dir -e ./transformers[dev,onnxruntime] && [ ${#PYTORCH} -gt 0 -a "$PYTORCH" != "pre" ] && VERSION='torch=='$PYTORCH'.*' ||  VERSION='torch'; echo "export VERSION='$VERSION'" >> ~/.profile && echo torch=$VERSION && [ "$PYTORCH" != "pre" ] && python3 -m pip install --no-cache-dir -U $VERSION torchvision torchaudio --extra-index-url https://download.pytorch.org/whl/$CUDA || python3 -m pip install --no-cache-dir -U --pre torch torchvision torchaudio --extra-index-url https://download.pytorch.org/whl/nightly/$CUDA

RUN python3 -m pip uninstall -y flax jax

RUN python3 -m pip install --no-cache-dir intel_extension_for_pytorch==$INTEL_TORCH_EXT -f https://developer.intel.com/ipex-whl-stable-cpu

RUN python3 -m pip install --no-cache-dir git+https://github.com/facebookresearch/detectron2.git pytesseract
RUN python3 -m pip install -U "itsdangerous<2.1.0"

RUN python3 -m pip install --no-cache-dir git+https://github.com/huggingface/accelerate@main#egg=accelerate

RUN python3 -m pip install --no-cache-dir git+https://github.com/huggingface/peft@main#egg=peft

# For bettertransformer
RUN python3 -m pip install --no-cache-dir git+https://github.com/huggingface/optimum@main#egg=optimum

# For video model testing
RUN python3 -m pip install --no-cache-dir av==9.2.0

# Some slow tests require bnb
RUN python3 -m pip install --no-cache-dir bitsandbytes

# Some tests require quanto
RUN python3 -m pip install --no-cache-dir quanto

# `quanto` will install `ninja` which leads to many `CUDA error: an illegal memory access ...` in some model tests
# (`deformable_detr`, `rwkv`, `mra`)
RUN python3 -m pip uninstall -y ninja

# For `dinat` model
# The `XXX` part in `torchXXX` needs to match `PYTORCH` (to some extent)
RUN python3 -m pip install --no-cache-dir natten==0.15.1+torch220$CUDA -f https://shi-labs.com/natten/wheels

# For `nougat` tokenizer
RUN python3 -m pip install --no-cache-dir python-Levenshtein

# For `FastSpeech2ConformerTokenizer` tokenizer
RUN python3 -m pip install --no-cache-dir g2p-en

# When installing in editable mode, `transformers` is not recognized as a package.
# this line must be added in order for python to be aware of transformers.
RUN cd transformers && python3 setup.py develop
Drop support for Python 3.8 (#34314) * drop python 3.8 * update docker files --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> 2024-10-24 09:16:55 +00:00			`FROM nvidia/cuda:12.1.0-cudnn8-devel-ubuntu22.04`
[Test refactor 5/5] Build docker images (#15729) 2022-02-23 20:48:19 +00:00			`LABEL maintainer="Hugging Face"`

			`ARG DEBIAN_FRONTEND=noninteractive`

Enable PyTorch nightly build CI (#17335) * nightly build pytorch CI * fix working dir * change time and event name Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> 2022-06-17 14:42:27 +00:00			# Use login shell to read variables from `~/.profile` (to pass dynamic created variables between RUN commands)
			`SHELL ["sh", "-lc"]`

Explicit versions in docker files (#17586) * Update docker file Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> 2022-06-08 13:04:22 +00:00			# The following `ARG` are mainly used to specify the versions explicitly & directly in this docker file, and not meant
			`# to be used as arguments for docker build (so far).`

use torch 2.4 in 2 CI jobs (#32302) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> 2024-07-29 20:12:21 +00:00			`ARG PYTORCH='2.4.0'`
Explicit versions in docker files (#17586) * Update docker file Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> 2022-06-08 13:04:22 +00:00			`# (not always a valid torch version)`
Use `torch 2.3` for CI (#30837) 2.3 Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> 2024-05-15 17:31:52 +00:00			`ARG INTEL_TORCH_EXT='2.3.0'`
Explicit versions in docker files (#17586) * Update docker file Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> 2022-06-08 13:04:22 +00:00			# Example: `cu102`, `cu113`, etc.
Use `torch 2.3` for CI (#30837) 2.3 Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> 2024-05-15 17:31:52 +00:00			`ARG CUDA='cu121'`
Explicit versions in docker files (#17586) * Update docker file Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> 2022-06-08 13:04:22 +00:00
[Test refactor 5/5] Build docker images (#15729) 2022-02-23 20:48:19 +00:00			`RUN apt update`
CLI: add stricter automatic checks to `pt-to-tf` (#17588) * Stricter pt-to-tf checks; Update docker image for related tests * check all attributes in the output Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> 2022-06-08 09:45:10 +00:00			`RUN apt install -y git libsndfile1-dev tesseract-ocr espeak-ng python3 python3-pip ffmpeg git-lfs`
			`RUN git lfs install`
[Test refactor 5/5] Build docker images (#15729) 2022-02-23 20:48:19 +00:00			`RUN python3 -m pip install --no-cache-dir --upgrade pip`

Rename master to main for notebooks links and leftovers (#16397) 2022-03-25 13:12:23 +00:00			`ARG REF=main`
[Test refactor 5/5] Build docker images (#15729) 2022-02-23 20:48:19 +00:00			`RUN git clone https://github.com/huggingface/transformers && cd transformers && git checkout $REF`

Fix docker image build for `Latest PyTorch + TensorFlow [dev]` (#29764) * update * update --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> 2024-03-21 12:14:29 +00:00			# 1. Put several commands in a single `RUN` to avoid image/layer exporting issue. Could be revised in the future.
			# 2. Regarding `torch` part, We might need to specify proper versions for `torchvision` and `torchaudio`.
			`# Currently, let's not bother to specify their versions explicitly (so installed with their latest release versions).`
pin `tensorflow_probability<0.22` in docker files (#34381) 0.21 Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> 2024-10-28 10:59:46 +00:00			RUN python3 -m pip install --no-cache-dir -U tensorflow==2.13 protobuf==3.20.3 "tensorflow_text<2.16" "tensorflow_probability<0.22" && python3 -m pip install --no-cache-dir -e ./transformers[dev,onnxruntime] && [ ${#PYTORCH} -gt 0 -a "$PYTORCH" != "pre" ] && VERSION='torch=='$PYTORCH'.*' \|\| VERSION='torch'; echo "export VERSION='$VERSION'" >> ~/.profile && echo torch=$VERSION && [ "$PYTORCH" != "pre" ] && python3 -m pip install --no-cache-dir -U $VERSION torchvision torchaudio --extra-index-url https://download.pytorch.org/whl/$CUDA \|\| python3 -m pip install --no-cache-dir -U --pre torch torchvision torchaudio --extra-index-url https://download.pytorch.org/whl/nightly/$CUDA
Enable PyTorch nightly build CI (#17335) * nightly build pytorch CI * fix working dir * change time and event name Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> 2022-06-17 14:42:27 +00:00
[Test refactor 5/5] Build docker images (#15729) 2022-02-23 20:48:19 +00:00			`RUN python3 -m pip uninstall -y flax jax`
Use latest stable PyTorch/DeepSpeed for Push & Scheduled CI (#17417) * update versions Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> 2022-06-07 09:53:05 +00:00
Fix daily CI image build (#27307) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> 2023-11-06 10:27:22 +00:00			`RUN python3 -m pip install --no-cache-dir intel_extension_for_pytorch==$INTEL_TORCH_EXT -f https://developer.intel.com/ipex-whl-stable-cpu`
Use latest stable PyTorch/DeepSpeed for Push & Scheduled CI (#17417) * update versions Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> 2022-06-07 09:53:05 +00:00
Change the import of kenlm from github to pypi (#19770) * Change the import of kenlm from github to pypi * Change the import of kenlm from github to pypi in circleci config * Fix code quality issues * Fix isort issue, add kenlm in extras for audio * Add kenlm to deps * Add kenlm to deps * Commit 'make fixup' changes * Remove version from kenlm deps * commit make fixup changes * Remove manual installation of kenlm * Remove manual installation of kenlm * Remove manual installation of kenlm 2022-10-26 15:06:46 +00:00			`RUN python3 -m pip install --no-cache-dir git+https://github.com/facebookresearch/detectron2.git pytesseract`
[Test refactor 5/5] Build docker images (#15729) 2022-02-23 20:48:19 +00:00			`RUN python3 -m pip install -U "itsdangerous<2.1.0"`

install dev. version of accelerate (#17243) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> 2022-05-13 17:47:09 +00:00			`RUN python3 -m pip install --no-cache-dir git+https://github.com/huggingface/accelerate@main#egg=accelerate`

[`PEFT`] Peft integration alternative design (#25077) * a draft version * v2 integration * fix * make it more generic and works for IA3 * add set adapter and multiple adapters support * fixup * adapt a bit * oops * oops * oops * adapt more * fix * add more refactor * now works with model class * change it to instance method as it causes issues with `jit`. * add CR * change method name * add `add_adapter` method * clean up * Update src/transformers/adapters/peft_mixin.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * add moe utils * fixup * Update src/transformers/adapters/peft_mixin.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * adapt * oops * fixup * add is_peft_available * remove `requires_backend` * trainer compatibility * fixup + docstring * more details * trigger CI * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/modeling_utils.py * fixup + is_main_process * added `save_peft_format` in save_pretrained * up * fix nits here and there * nits here and there. * docs * revert `encoding="utf-8"` * comment * added slow tests before the PEFT release. * fixup and nits * let's be on the safe zone * added more comments * v1 docs * add remaining docs * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * move to `lib_integrations` * fixup * this time fixup * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * address final comments * refactor to use `token` * add PEFT to DockerFile for slow tests. * added pipeline support. --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> 2023-08-18 17:08:03 +00:00			`RUN python3 -m pip install --no-cache-dir git+https://github.com/huggingface/peft@main#egg=peft`

Use torch 2.2 for daily CI (model tests) (#29208) * Use torch 2.2 for daily CI (model tests) * update * update --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> 2024-02-23 13:37:08 +00:00			`# For bettertransformer`
GPTQ integration (#25062) * GTPQ integration * Add tests for gptq * support for more quantization model * fix style * typo * fix method * Update src/transformers/modeling_utils.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * add dataclass and fix quantization_method * fix doc * Update tests/quantization/gptq/test_gptq.py Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> * Apply suggestions from code review Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> * modify dataclass * add gtpqconfig import * fix typo * fix tests * remove dataset as req arg * remove tokenizer import * add offload cpu quantization test * fix check dataset * modify dockerfile * protect trainer * style * test for config * add more log * overwrite torch_dtype * draft doc * modify quantization_config docstring * fix class name in docstring * Apply suggestions from code review Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> * more warning * fix 8bit kwargs tests * peft compatibility * remove var * fix is_gptq_quantized * remove is_gptq_quantized * fix wrap * Update src/transformers/modeling_utils.py Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> * add exllama * skip test * overwrite float16 * style * fix skip test * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * fix docsting formatting * add doc * better test --------- Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> 2023-08-10 20:06:29 +00:00			`RUN python3 -m pip install --no-cache-dir git+https://github.com/huggingface/optimum@main#egg=optimum`
Add methods to PreTrainedModel to use PyTorch's BetterTransformer (#21259) * fix mess * better documentation * typo * fix doc * update * add test * fix test * more tests * Update src/transformers/modeling_utils.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * move to utils * Apply suggestions from code review Co-authored-by: Michael Benayoun <mickbenayoun@gmail.com> * nit --------- Co-authored-by: younesbelkada <younesbelkada@gmail.com> Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Michael Benayoun <mickbenayoun@gmail.com> 2023-04-27 09:03:42 +00:00
Use PyAV instead of Decord in examples (#21572) * Use PyAV instead of Decord * Get frame indices * Fix number of frames * Update src/transformers/models/videomae/image_processing_videomae.py * Fix up * Fix copies * Update timesformer doctests * Update docstrings 2023-03-02 12:30:38 +00:00			`# For video model testing`
removes decord (#33987) * removes decord dependency optimize np Revert "optimize" This reverts commit faa136b51ec4ec5858e5b0ae40eb7ef89a88b475. helpers as documentation pydoc missing keys * make fixup * require_av --------- Co-authored-by: ad <hi@arnaudiaz.com> 2024-10-17 15:27:34 +00:00			`RUN python3 -m pip install --no-cache-dir av==9.2.0`
[VideoMAE] Add model to doc tests (#18523) * Add videomae to doc tests * Add pip install decord Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local> 2022-08-08 17:28:51 +00:00
FIX: re-add bnb on docker image (#30427) Update Dockerfile 2024-04-23 13:32:54 +00:00			`# Some slow tests require bnb`
			`RUN python3 -m pip install --no-cache-dir bitsandbytes`

Quantized KV Cache (#30483) * clean-up * Update src/transformers/cache_utils.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * Update src/transformers/cache_utils.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * Update src/transformers/cache_utils.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * fixup * Update tests/quantization/quanto_integration/test_quanto.py Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> * Update src/transformers/generation/configuration_utils.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * more suggestions * mapping if torch available * run tests & add 'support_quantized' flag * fix jamba test * revert, will be fixed by another PR * codestyle * HQQ and versatile cache classes * final update * typo * make tests happy --------- Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> 2024-05-23 12:25:20 +00:00			`# Some tests require quanto`
			`RUN python3 -m pip install --no-cache-dir quanto`

Remove `ninja` from docker image build (#31080) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> 2024-05-28 09:36:26 +00:00			# `quanto` will install `ninja` which leads to many `CUDA error: an illegal memory access ...` in some model tests
			# (`deformable_detr`, `rwkv`, `mra`)
			`RUN python3 -m pip uninstall -y ninja`

Update docker files to use official torch 2.0.0 (#22357) * update docker files to use official torch 2.0.0 --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> 2023-03-24 13:29:05 +00:00			# For `dinat` model
Fix natten install in docker (#30161) * fix dinat in docker * update --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> 2024-04-10 15:45:49 +00:00			# The `XXX` part in `torchXXX` needs to match `PYTORCH` (to some extent)
			`RUN python3 -m pip install --no-cache-dir natten==0.15.1+torch220$CUDA -f https://shi-labs.com/natten/wheels`
Install `natten` with CUDA version (#20546) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> 2022-12-05 14:08:32 +00:00
Install `python-Levenshtein` for `nougat` in CI image (#27465) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> 2023-11-13 15:38:13 +00:00			# For `nougat` tokenizer
			`RUN python3 -m pip install --no-cache-dir python-Levenshtein`

More fixes for doctest (#30265) * fix * update * update * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> 2024-04-16 09:58:55 +00:00			# For `FastSpeech2ConformerTokenizer` tokenizer
			`RUN python3 -m pip install --no-cache-dir g2p-en`

[Test refactor 5/5] Build docker images (#15729) 2022-02-23 20:48:19 +00:00			# When installing in editable mode, `transformers` is not recognized as a package.
			`# this line must be added in order for python to be aware of transformers.`
			`RUN cd transformers && python3 setup.py develop`