transformers/examples/research_projects
Suraj Patil d3bd9ac728
[Flax] improve large model init and loading (#16148)
* begin do_init

* add params_shape_tree

* raise error if params are accessed when do_init is False

* don't allow do_init=False when keys are missing

* make shape tree a property

* assign self._params at the end

* add test for do_init

* add do_init arg to all flax models

* fix param setting

* disbale do_init for composite models

* update test

* add do_init in FlaxBigBirdForMultipleChoice

* better names and errors

* improve test

* style

* add a warning when do_init=False

* remove extra if

* set params after _required_params

* add test for from_pretrained

* do_init => _do_init

* chage warning to info

* fix typo

* add params in init_weights

* add params to gpt neo init

* add params to init_weights

* update do_init test

* Trigger CI

* Apply suggestions from code review

Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* update template

* trigger CI

* style

* style

* fix template

Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
2022-04-19 14:19:55 +02:00
..
adversarial Update namespaces inside torch.utils.data to the latest. (#13167) 2021-08-19 14:29:51 +02:00
bert-loses-patience Fix tiny typo (#15884) 2022-03-02 15:37:05 +01:00
bertabs
bertology
codeparrot Jia multi gpu eval (#16428) 2022-04-11 11:24:32 +02:00
decision_transformer Decision transformer gym (#15845) 2022-03-23 16:18:43 -04:00
deebert
distillation Fix minor comment typos (#15740) 2022-02-21 12:41:27 +01:00
fsner Updates the default branch from master to main (#16326) 2022-03-23 03:46:59 -04:00
jax-projects [Flax] improve large model init and loading (#16148) 2022-04-19 14:19:55 +02:00
longform-qa
luke Add (M)Luke model training for Token Classification in the examples (#14880) 2022-01-31 07:58:18 -05:00
lxmert Upgrade black to version ~=22.0 (#15565) 2022-02-09 09:28:57 -05:00
mlm_wwm [urls to hub] Replace outdated model tags with their now-canonical pipeline types (#14617) 2021-12-06 04:35:01 -05:00
mm-imdb Updates the default branch from master to main (#16326) 2022-03-23 03:46:59 -04:00
movement-pruning Updates the default branch from master to main (#16326) 2022-03-23 03:46:59 -04:00
onnx/summarization Upgrade black to version ~=22.0 (#15565) 2022-02-09 09:28:57 -05:00
performer [urls to hub] Replace outdated model tags with their now-canonical pipeline types (#14617) 2021-12-06 04:35:01 -05:00
pplm [research_projects] deal with security alerts (#15594) 2022-02-11 14:31:09 -05:00
quantization-qdqbert Qdqbert example add benchmark script with ORT-TRT (#16592) 2022-04-12 11:13:59 -04:00
rag Fix example logs repeating themselves (#16669) 2022-04-11 16:25:16 -04:00
rag-end2end-retriever Updates the default branch from master to main (#16326) 2022-03-23 03:46:59 -04:00
robust-speech-event Updates the default branch from master to main (#16326) 2022-03-23 03:46:59 -04:00
self-training-text-classification Add self training code for text classification (#16738) 2022-04-13 12:03:24 -04:00
seq2seq-distillation Updates the default branch from master to main (#16326) 2022-03-23 03:46:59 -04:00
tapex Add TAPEX (#16473) 2022-04-08 10:57:51 +02:00
visual_bert Updates the default branch from master to main (#16326) 2022-03-23 03:46:59 -04:00
wav2vec2 [examples] max samples can't be bigger than the len of dataset (#16501) 2022-03-30 12:33:16 -07:00
xtreme-s [research] link to the XTREME-S paper (#16519) 2022-03-31 23:26:50 +04:00
zero-shot-distillation Updates the default branch from master to main (#16326) 2022-03-23 03:46:59 -04:00
README.md

Research projects

This folder contains various research projects using 🤗 Transformers. They are not maintained and require a specific version of 🤗 Transformers that is indicated in the requirements file of each folder. Updating them to the most recent version of the library will require some work.

To use any of them, just run the command

pip install -r requirements.txt

inside the folder of your choice.

If you need help with any of those, contact the author(s), indicated at the top of the README of each folder.