transformers/examples/README.md

<!---
Copyright 2020 The HuggingFace Team. All rights reserved.

Licensed under the Apache License, Version 2.0 (the "License");
you may not use this file except in compliance with the License.
You may obtain a copy of the License at

    http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.
-->

# Examples

Version 2.9 of 🤗 Transformers introduced a new [`Trainer`](https://github.com/huggingface/transformers/blob/master/src/transformers/trainer.py) class for PyTorch, and its equivalent [`TFTrainer`](https://github.com/huggingface/transformers/blob/master/src/transformers/trainer_tf.py) for TF 2.
Running the examples requires PyTorch 1.3.1+ or TensorFlow 2.2+.

Here is the list of all our examples:
- **grouped by task** (all official examples work for multiple models)
- with information on whether they are **built on top of `Trainer`/`TFTrainer`** (if not, they still work, they might
  just lack some features),
- whether or not they leverage the [🤗 Datasets](https://github.com/huggingface/datasets) library.
- links to **Colab notebooks** to walk through the scripts and run them easily,
- links to **Cloud deployments** to be able to deploy large-scale trainings in the Cloud with little to no setup.


## Important note

**Important**

To make sure you can successfully run the latest versions of the example scripts, you have to **install the library from source** and install some example-specific requirements.
Execute the following steps in a new virtual environment:

```bash
git clone https://github.com/huggingface/transformers
cd transformers
pip install .
pip install -r ./examples/requirements.txt
```

Alternatively, you can run the version of the examples as they were for your current version of Transformers via (for instance with v3.4.0):
```bash
git checkout tags/v3.4.0
```

## The Big Table of Tasks

| Task | Example datasets | Trainer support | TFTrainer support | 🤗 Datasets | Colab
|---|---|:---:|:---:|:---:|:---:|
| [**`language-modeling`**](https://github.com/huggingface/transformers/tree/master/examples/language-modeling)       | Raw text        | ✅ | -  | ✅ | [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/huggingface/blog/blob/master/notebooks/01_how_to_train.ipynb)
| [**`text-classification`**](https://github.com/huggingface/transformers/tree/master/examples/text-classification)   | GLUE, XNLI      | ✅ | ✅ | ✅ | [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://github.com/huggingface/notebooks/blob/master/examples/text_classification.ipynb)
| [**`token-classification`**](https://github.com/huggingface/transformers/tree/master/examples/token-classification) | CoNLL NER       | ✅ | ✅ | ✅ | -
| [**`multiple-choice`**](https://github.com/huggingface/transformers/tree/master/examples/multiple-choice)           | SWAG, RACE, ARC | ✅ | ✅ | - | [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/ViktorAlm/notebooks/blob/master/MPC_GPU_Demo_for_TF_and_PT.ipynb)
| [**`question-answering`**](https://github.com/huggingface/transformers/tree/master/examples/question-answering)     | SQuAD           | ✅ | ✅ | - | -
| [**`text-generation`**](https://github.com/huggingface/transformers/tree/master/examples/text-generation)           | -               | n/a | n/a | - | [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/huggingface/blog/blob/master/notebooks/02_how_to_generate.ipynb)
| [**`distillation`**](https://github.com/huggingface/transformers/tree/master/examples/distillation)                 | All             | - | -  | - | -
| [**`summarization`**](https://github.com/huggingface/transformers/tree/master/examples/seq2seq)                     | CNN/Daily Mail  | ✅  | - | - | -
| [**`translation`**](https://github.com/huggingface/transformers/tree/master/examples/seq2seq)                       | WMT             | ✅  | - | - | -
| [**`bertology`**](https://github.com/huggingface/transformers/tree/master/examples/bertology)                       | -               | - | - | - | -
| [**`adversarial`**](https://github.com/huggingface/transformers/tree/master/examples/adversarial)                   | HANS            | ✅ | - | - | -


<br>

## One-click Deploy to Cloud (wip)

**Coming soon!**

## Running on TPUs

When using Tensorflow, TPUs are supported out of the box as a `tf.distribute.Strategy`.

When using PyTorch, we support TPUs thanks to `pytorch/xla`. For more context and information on how to setup your TPU environment refer to Google's documentation and to the
very detailed [pytorch/xla README](https://github.com/pytorch/xla/blob/master/README.md).

In this repo, we provide a very simple launcher script named [xla_spawn.py](https://github.com/huggingface/transformers/tree/master/examples/xla_spawn.py) that lets you run our example scripts on multiple TPU cores without any boilerplate.
Just pass a `--num_cores` flag to this script, then your regular training script with its arguments (this is similar to the `torch.distributed.launch` helper for torch.distributed). 
Note that this approach does not work for examples that use `pytorch-lightning`.

For example for `run_glue`:

```bash
python examples/xla_spawn.py --num_cores 8 \
	examples/text-classification/run_glue.py \
	--model_name_or_path bert-base-cased \
	--task_name mnli \
	--data_dir ./data/glue_data/MNLI \
	--output_dir ./models/tpu \
	--overwrite_output_dir \
	--do_train \
	--do_eval \
	--num_train_epochs 1 \
	--save_steps 20000
```

Feedback and more use cases and benchmarks involving TPUs are welcome, please share with the community.

## Logging & Experiment tracking

You can easily log and monitor your runs code. The following are currently supported:

* [TensorBoard](https://www.tensorflow.org/tensorboard)
* [Weights & Biases](https://docs.wandb.com/library/integrations/huggingface)
* [Comet ML](https://www.comet.ml/docs/python-sdk/huggingface/)

### Weights & Biases

To use Weights & Biases, install the wandb package with:

```bash
pip install wandb
```

Then log in the command line:

```bash
wandb login
```

If you are in Jupyter or Colab, you should login with:

```python
import wandb
wandb.login()
```

Whenever you use `Trainer` or `TFTrainer` classes, your losses, evaluation metrics, model topology and gradients (for `Trainer` only) will automatically be logged.

When using 🤗 Transformers with PyTorch Lightning, runs can be tracked through `WandbLogger`. Refer to related [documentation & examples](https://docs.wandb.com/library/integrations/lightning).

### Comet.ml

To use `comet_ml`, install the Python package with:

```bash
pip install comet_ml
```

or if in a Conda environment:

```bash
conda install -c comet_ml -c anaconda -c conda-forge comet_ml
```
Copyright (#8970) * Add copyright everywhere missing * Style 2020-12-07 23:36:34 +00:00			`<!---`
			`Copyright 2020 The HuggingFace Team. All rights reserved.`

			`Licensed under the Apache License, Version 2.0 (the "License");`
			`you may not use this file except in compliance with the License.`
			`You may obtain a copy of the License at`

			`http://www.apache.org/licenses/LICENSE-2.0`

			`Unless required by applicable law or agreed to in writing, software`
			`distributed under the License is distributed on an "AS IS" BASIS,`
			`WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.`
			`See the License for the specific language governing permissions and`
			`limitations under the License.`
			`-->`

Fix examples titles and optimization doc page (#5408) 2020-07-01 12:11:25 +00:00			`# Examples`
Better examples 2019-09-06 16:00:12 +00:00
Move installation instructions to the top (#8106) 2020-10-27 21:32:20 +00:00			Version 2.9 of 🤗 Transformers introduced a new [`Trainer`](https://github.com/huggingface/transformers/blob/master/src/transformers/trainer.py) class for PyTorch, and its equivalent [`TFTrainer`](https://github.com/huggingface/transformers/blob/master/src/transformers/trainer_tf.py) for TF 2.
Rework TF trainer (#6038) * Fully rework training/prediction loops * fix method name * Fix variable name * Fix property name * Fix scope * Fix method name * Fix tuple index * Fix tuple index * Fix indentation * Fix variable name * fix eval before log * Add drop remainder for test dataset * Fix step number + fix logging datetime * fix eval loss value * use global step instead of step + fix logging at step 0 * Fix logging datetime * Fix global_step usage * Fix breaking loop + logging datetime * Fix step in prediction loop * Fix step breaking * Fix train/test loops * Force TF at least 2.2 for the trainer * Use assert_cardinality to facilitate the dataset size computation * Log steps per epoch * Make tfds compliant with TPU * Make tfds compliant with TPU * Use TF dataset enumerate instead of the Python one * revert previous commit * Fix data_dir * Apply style * rebase on master * Address Sylvain's comments * Address Sylvain's and Lysandre comments * Trigger CI * Remove unused import 2020-07-29 18:32:01 +00:00			`Running the examples requires PyTorch 1.3.1+ or TensorFlow 2.2+.`
Examples readme.md (#4215) * README * Update README.md 2020-05-07 19:00:06 +00:00
			`Here is the list of all our examples:`
			`- grouped by task (all official examples work for multiple models)`
Finalize lm examples (#8188) * Finish the cleanup of the language-modeling examples * Update main README * Apply suggestions from code review Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * Apply suggestions from code review Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com> * Propagate changes Co-authored-by: Lysandre Debut <lysandre@huggingface.co> Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com> 2020-10-30 18:20:18 +00:00			- with information on whether they are built on top of `Trainer`/`TFTrainer` (if not, they still work, they might
			`just lack some features),`
			`- whether or not they leverage the [🤗 Datasets](https://github.com/huggingface/datasets) library.`
Examples readme.md (#4215) * README * Update README.md 2020-05-07 19:00:06 +00:00			`- links to Colab notebooks to walk through the scripts and run them easily,`
			`- links to Cloud deployments to be able to deploy large-scale trainings in the Cloud with little to no setup.`


			`## Important note`
Better examples 2019-09-06 16:00:12 +00:00
Support for torch-lightning in NER examples (#2890) * initial pytorch lightning commit * tested multigpu * Fix learning rate schedule * black formatting * fix flake8 * isort * isort * . Co-authored-by: Check your git settings! <chris@chris-laptop> 2020-02-20 16:50:05 +00:00			`Important`
Move installation instructions to the top (#8106) 2020-10-27 21:32:20 +00:00
			`To make sure you can successfully run the latest versions of the example scripts, you have to install the library from source and install some example-specific requirements.`
fix #1450 - add doc 2019-12-05 10:26:55 +00:00			`Execute the following steps in a new virtual environment:`
update the documentation 2019-11-20 17:13:38 +00:00
			```bash
Uniformize #1952 2019-11-27 16:05:18 +00:00			`git clone https://github.com/huggingface/transformers`
update the documentation 2019-11-20 17:13:38 +00:00			`cd transformers`
Remove [--editable] in install instructions. Use -e only in docs targeted at contributors. If a user copy-pastes command line with [--editable], they will hit an error. If they don't know the --editable option, we're giving them a choice to make before they can move forwards, but this isn't a choice they need to make right now. 2019-12-24 07:46:08 +00:00			`pip install .`
fix #1450 - add doc 2019-12-05 10:26:55 +00:00			`pip install -r ./examples/requirements.txt`
update the documentation 2019-11-20 17:13:38 +00:00			```

Move installation instructions to the top (#8106) 2020-10-27 21:32:20 +00:00			`Alternatively, you can run the version of the examples as they were for your current version of Transformers via (for instance with v3.4.0):`
			```bash
			`git checkout tags/v3.4.0`
			```

			`## The Big Table of Tasks`

Finalize lm examples (#8188) * Finish the cleanup of the language-modeling examples * Update main README * Apply suggestions from code review Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * Apply suggestions from code review Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com> * Propagate changes Co-authored-by: Lysandre Debut <lysandre@huggingface.co> Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com> 2020-10-30 18:20:18 +00:00			`\| Task \| Example datasets \| Trainer support \| TFTrainer support \| 🤗 Datasets \| Colab`
			`\|---\|---\|:---:\|:---:\|:---:\|:---:\|`
			\| [`language-modeling`](https://github.com/huggingface/transformers/tree/master/examples/language-modeling) \| Raw text \| ✅ \| - \| ✅ \| [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/huggingface/blog/blob/master/notebooks/01_how_to_train.ipynb)
			\| [`text-classification`](https://github.com/huggingface/transformers/tree/master/examples/text-classification) \| GLUE, XNLI \| ✅ \| ✅ \| ✅ \| [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://github.com/huggingface/notebooks/blob/master/examples/text_classification.ipynb)
Add new token classification example (#8340) * Add new token classification example * Remove txt file * Add test * With actual testing done * Less warmup is better * Update examples/token-classification/run_ner_new.py Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com> * Address review comments * Fix test * Make Lysandre happy * Last touches and rename * Rename in tests * Address review comments * More run_ner -> run_ner_old Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com> 2020-11-09 16:39:55 +00:00			\| [`token-classification`](https://github.com/huggingface/transformers/tree/master/examples/token-classification) \| CoNLL NER \| ✅ \| ✅ \| ✅ \| -
Finalize lm examples (#8188) * Finish the cleanup of the language-modeling examples * Update main README * Apply suggestions from code review Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * Apply suggestions from code review Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com> * Propagate changes Co-authored-by: Lysandre Debut <lysandre@huggingface.co> Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com> 2020-10-30 18:20:18 +00:00			\| [`multiple-choice`](https://github.com/huggingface/transformers/tree/master/examples/multiple-choice) \| SWAG, RACE, ARC \| ✅ \| ✅ \| - \| [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/ViktorAlm/notebooks/blob/master/MPC_GPU_Demo_for_TF_and_PT.ipynb)
			\| [`question-answering`](https://github.com/huggingface/transformers/tree/master/examples/question-answering) \| SQuAD \| ✅ \| ✅ \| - \| -
			\| [`text-generation`](https://github.com/huggingface/transformers/tree/master/examples/text-generation) \| - \| n/a \| n/a \| - \| [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/huggingface/blog/blob/master/notebooks/02_how_to_generate.ipynb)
			\| [`distillation`](https://github.com/huggingface/transformers/tree/master/examples/distillation) \| All \| - \| - \| - \| -
			\| [`summarization`](https://github.com/huggingface/transformers/tree/master/examples/seq2seq) \| CNN/Daily Mail \| ✅ \| - \| - \| -
			\| [`translation`](https://github.com/huggingface/transformers/tree/master/examples/seq2seq) \| WMT \| ✅ \| - \| - \| -
			\| [`bertology`](https://github.com/huggingface/transformers/tree/master/examples/bertology) \| - \| - \| - \| - \| -
			\| [`adversarial`](https://github.com/huggingface/transformers/tree/master/examples/adversarial) \| HANS \| ✅ \| - \| - \| -
Move installation instructions to the top (#8106) 2020-10-27 21:32:20 +00:00

			`<br>`

[examples] Streamline doc 2020-05-15 00:34:31 +00:00			`## One-click Deploy to Cloud (wip)`

[doc] rm Azure buttons as not implemented yet 2020-09-30 21:31:08 +00:00			`Coming soon!`
[examples] Streamline doc 2020-05-15 00:34:31 +00:00
Examples readme.md (#4215) * README * Update README.md 2020-05-07 19:00:06 +00:00			`## Running on TPUs`
Table of contents 2019-09-06 16:08:36 +00:00
[TPU] Doc, fix xla_spawn.py, only preprocess dataset once (#4223) * [TPU] Doc, fix xla_spawn.py, only preprocess dataset once * Update examples/README.md * [xla_spawn] Add `_mp_fn` to other Trainer scripts * [TPU] Fix: eval dataloader was None 2020-05-08 18:10:05 +00:00			When using Tensorflow, TPUs are supported out of the box as a `tf.distribute.Strategy`.

			When using PyTorch, we support TPUs thanks to `pytorch/xla`. For more context and information on how to setup your TPU environment refer to Google's documentation and to the
			`very detailed [pytorch/xla README](https://github.com/pytorch/xla/blob/master/README.md).`

per_device instead of per_gpu/error thrown when argument unknown (#4618) * per_device instead of per_gpu/error thrown when argument unknown * [docs] Restore examples.md symlink * Correct absolute links so that symlink to the doc works correctly * Update src/transformers/hf_argparser.py Co-authored-by: Julien Chaumond <chaumond@gmail.com> * Warning + reorder * Docs * Style * not for squad Co-authored-by: Julien Chaumond <chaumond@gmail.com> 2020-05-27 15:36:55 +00:00			`In this repo, we provide a very simple launcher script named [xla_spawn.py](https://github.com/huggingface/transformers/tree/master/examples/xla_spawn.py) that lets you run our example scripts on multiple TPU cores without any boilerplate.`
examples/docs: caveat that PL examples don't work on TPU (#8309) 2020-11-09 13:55:22 +00:00			Just pass a `--num_cores` flag to this script, then your regular training script with its arguments (this is similar to the `torch.distributed.launch` helper for torch.distributed).
			Note that this approach does not work for examples that use `pytorch-lightning`.
[TPU] Doc, fix xla_spawn.py, only preprocess dataset once (#4223) * [TPU] Doc, fix xla_spawn.py, only preprocess dataset once * Update examples/README.md * [xla_spawn] Add `_mp_fn` to other Trainer scripts * [TPU] Fix: eval dataloader was None 2020-05-08 18:10:05 +00:00
			For example for `run_glue`:

			```bash
			`python examples/xla_spawn.py --num_cores 8 \`
Corrected typo in readme (#8320) 2020-11-05 12:48:36 +00:00			`examples/text-classification/run_glue.py \`
[TPU] Doc, fix xla_spawn.py, only preprocess dataset once (#4223) * [TPU] Doc, fix xla_spawn.py, only preprocess dataset once * Update examples/README.md * [xla_spawn] Add `_mp_fn` to other Trainer scripts * [TPU] Fix: eval dataloader was None 2020-05-08 18:10:05 +00:00			`--model_name_or_path bert-base-cased \`
			`--task_name mnli \`
			`--data_dir ./data/glue_data/MNLI \`
			`--output_dir ./models/tpu \`
			`--overwrite_output_dir \`
			`--do_train \`
			`--do_eval \`
			`--num_train_epochs 1 \`
			`--save_steps 20000`
			```

			`Feedback and more use cases and benchmarks involving TPUs are welcome, please share with the community.`
docs(wandb): explain how to use W&B integration (#5607) * docs(wandb): explain how to use W&B integration fix #5262 * Also mention TensorBoard Co-authored-by: Julien Chaumond <chaumond@gmail.com> 2020-07-14 09:12:33 +00:00
			`## Logging & Experiment tracking`

Adds comet_ml to the list of auto-experiment loggers (#6176) * Support for Comet.ml * Need to import comet first * Log this model, not the one in the backprop step * Log args as hyperparameters; use framework to allow fine control * Log hyperparameters with context * Apply black formatting * isort fix integrations * isort fix __init__ * Update src/transformers/trainer.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/trainer.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/trainer_tf.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Address review comments * Style + Quality, remove Tensorboard import test Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr> 2020-08-06 15:31:30 +00:00			`You can easily log and monitor your runs code. The following are currently supported:`

			`* [TensorBoard](https://www.tensorflow.org/tensorboard)`
			`* [Weights & Biases](https://docs.wandb.com/library/integrations/huggingface)`
			`* [Comet ML](https://www.comet.ml/docs/python-sdk/huggingface/)`

			`### Weights & Biases`
docs(wandb): explain how to use W&B integration (#5607) * docs(wandb): explain how to use W&B integration fix #5262 * Also mention TensorBoard Co-authored-by: Julien Chaumond <chaumond@gmail.com> 2020-07-14 09:12:33 +00:00
			`To use Weights & Biases, install the wandb package with:`

			```bash
			`pip install wandb`
			```

			`Then log in the command line:`

			```bash
			`wandb login`
			```

			`If you are in Jupyter or Colab, you should login with:`

			```python
			`import wandb`
			`wandb.login()`
			```

			Whenever you use `Trainer` or `TFTrainer` classes, your losses, evaluation metrics, model topology and gradients (for `Trainer` only) will automatically be logged.

correct pl link in readme (#6364) 2020-08-10 07:08:46 +00:00			When using 🤗 Transformers with PyTorch Lightning, runs can be tracked through `WandbLogger`. Refer to related [documentation & examples](https://docs.wandb.com/library/integrations/lightning).
Adds comet_ml to the list of auto-experiment loggers (#6176) * Support for Comet.ml * Need to import comet first * Log this model, not the one in the backprop step * Log args as hyperparameters; use framework to allow fine control * Log hyperparameters with context * Apply black formatting * isort fix integrations * isort fix __init__ * Update src/transformers/trainer.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/trainer.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/trainer_tf.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Address review comments * Style + Quality, remove Tensorboard import test Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr> 2020-08-06 15:31:30 +00:00
			`### Comet.ml`

			To use `comet_ml`, install the Python package with:

			```bash
			`pip install comet_ml`
			```

			`or if in a Conda environment:`

			```bash
			`conda install -c comet_ml -c anaconda -c conda-forge comet_ml`
			```