transformers

mirror of https://github.com/saymrwulf/transformers.git synced 2026-05-15 21:01:19 +00:00

History

Eduardo Gonzalez Ponferrada df5a4094a6 Add Data2Vec (#15507 ) * Add data2vec model cloned from roberta * Add checkpoint conversion script * Fix copies * Update docs * Add checkpoint conversion script * Remove fairseq data2vec_text script and fix format * Add comment on where to get data2vec_text.py * Remove mock implementation cheat.py and fix style * Fix copies * Remove TF and Flax classes from init * Add back copy from fairseq data2vec_text.py and fix style * Update model name in docs/source/index.mdx to be CamelCase * Revert model name in table to lower-case to get check_table test to pass * Update src/transformers/models/data2vec/__init__.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/models/data2vec/convert_data2vec_original_pytorch_checkpoint_to_pytorch.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/models/data2vec/modeling_data2vec.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/models/data2vec/modeling_data2vec.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/models/data2vec/modeling_data2vec.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/models/data2vec/modeling_data2vec.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update docs/source/model_doc/data2vec.mdx Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update docs/source/model_doc/data2vec.mdx Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/models/auto/configuration_auto.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/models/data2vec/configuration_data2vec.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/models/data2vec/modeling_data2vec.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/models/data2vec/modeling_data2vec.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/models/data2vec/modeling_data2vec.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update tests/test_modeling_data2vec.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/models/data2vec/configuration_data2vec.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/models/data2vec/modeling_data2vec.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update documentation * Copy-paste Data2VecConfig from BertConfig * Update config checkpoint to point to edugp/data2vec-nlp-base. Fix style and repo-consistency * Update config special tokens to match RoBERTa * Split multiple assertions and add individual error messages * Rename Data2VecModel to Data2VecForTextModel * Add Data2Vec to _toctree.yml * Rename Data2VecEmbeddings to Data2VecForTextEmbeddings * Add initial Data2VecForAudio model (unfinished). Only matching fairseq's implementation up to the feature encoder (before positional encoding). * finish audio model * finish audio file * Update names and fix style, quality and repo consistency * Remove Data2VecAudioForPretraining. Add tests for Data2VecAudio, mimicking the Wav2Vec2 test suite. Fix bias initilization in positional conv layers. Move back configurations for audio and text to separate files. * add inputs to logits to data2vec' * correct autio models * correct config auto * correct tok auto * Update utils/tests_fetcher.py * delete unnecessary files * delete unnecessary files * further renaming * make all tests pass * finish * remove useless test file * Update tests/test_modeling_common.py * Update utils/check_repo.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/models/data2vec/modeling_data2vec_text.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Fix copies * Update docs * Remove fairseq data2vec_text script and fix format * Add comment on where to get data2vec_text.py * Remove mock implementation cheat.py and fix style * Fix copies * Remove TF and Flax classes from init * Add back copy from fairseq data2vec_text.py and fix style * Update model name in docs/source/index.mdx to be CamelCase * Revert model name in table to lower-case to get check_table test to pass * Update documentation * Update src/transformers/models/data2vec/__init__.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/models/data2vec/convert_data2vec_original_pytorch_checkpoint_to_pytorch.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/models/data2vec/modeling_data2vec.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/models/data2vec/modeling_data2vec.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/models/data2vec/modeling_data2vec.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/models/data2vec/modeling_data2vec.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/models/auto/configuration_auto.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/models/data2vec/configuration_data2vec.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/models/data2vec/modeling_data2vec.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/models/data2vec/modeling_data2vec.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/models/data2vec/modeling_data2vec.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update tests/test_modeling_data2vec.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/models/data2vec/configuration_data2vec.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/models/data2vec/modeling_data2vec.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Copy-paste Data2VecConfig from BertConfig * Update config checkpoint to point to edugp/data2vec-nlp-base. Fix style and repo-consistency * Update config special tokens to match RoBERTa * Split multiple assertions and add individual error messages * Rename Data2VecModel to Data2VecForTextModel * Add Data2Vec to _toctree.yml * Rename Data2VecEmbeddings to Data2VecForTextEmbeddings * Add initial Data2VecForAudio model (unfinished). Only matching fairseq's implementation up to the feature encoder (before positional encoding). * finish audio model * finish audio file * add inputs to logits to data2vec' * Update names and fix style, quality and repo consistency * Remove Data2VecAudioForPretraining. Add tests for Data2VecAudio, mimicking the Wav2Vec2 test suite. Fix bias initilization in positional conv layers. Move back configurations for audio and text to separate files. * correct autio models * correct config auto * correct tok auto * delete unnecessary files * delete unnecessary files * Update utils/tests_fetcher.py * further renaming * make all tests pass * finish * remove useless test file * Update tests/test_modeling_common.py * Update utils/check_repo.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/models/data2vec/modeling_data2vec_text.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Move data2vec tests to new structure * Fix test imports for text tests * Remove fairseq files * Change paper link to arxiv * Modify Data2Vec documentation to reflect that the encoder is not shared across the audio and text models in the current implementation. * Update text model checkpoint to be facebook/data2vec-text-base * Add 'Copy from' statements and update paper links and docs * fix copy from statements * improve copied from * correct more copied from statements * finish copied from stuff * make style * add model to README * add to master Co-authored-by: Eduardo Gonzalez Ponferrada <eduardo@ferrumhealth.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>		2022-03-01 11:09:20 +01:00
..
albert.mdx	Convert model files from rst to mdx (#14865 )	2021-12-22 03:27:30 -05:00
auto.mdx	Add SimMIM (#15586 )	2022-02-17 19:44:55 +01:00
bart.mdx	Doc styler examples (#14953 )	2021-12-27 19:07:46 -05:00
barthez.mdx	Convert model files from rst to mdx (#14865 )	2021-12-22 03:27:30 -05:00
bartpho.mdx	Doc styler examples (#14953 )	2021-12-27 19:07:46 -05:00
beit.mdx	Convert model files from rst to mdx (#14865 )	2021-12-22 03:27:30 -05:00
bert-generation.mdx	Map model_type and doc pages names (#14944 )	2022-01-03 05:08:55 -05:00
bert-japanese.mdx	Map model_type and doc pages names (#14944 )	2022-01-03 05:08:55 -05:00
bert.mdx	Convert rst to mdx bert (#14806 )	2021-12-17 11:13:34 -05:00
bertweet.mdx	Doc styler examples (#14953 )	2021-12-27 19:07:46 -05:00
big_bird.mdx	Map model_type and doc pages names (#14944 )	2022-01-03 05:08:55 -05:00
bigbird_pegasus.mdx	Convert model files from rst to mdx (#14865 )	2021-12-22 03:27:30 -05:00
blenderbot-small.mdx	Map model_type and doc pages names (#14944 )	2022-01-03 05:08:55 -05:00
blenderbot.mdx	Map model_type and doc pages names (#14944 )	2022-01-03 05:08:55 -05:00
bort.mdx	Convert model files from rst to mdx (#14865 )	2021-12-22 03:27:30 -05:00
byt5.mdx	Doc styler examples (#14953 )	2021-12-27 19:07:46 -05:00
camembert.mdx	Convert model files from rst to mdx (#14865 )	2021-12-22 03:27:30 -05:00
canine.mdx	Doc styler examples (#14953 )	2021-12-27 19:07:46 -05:00
clip.mdx	Doc styler examples (#14953 )	2021-12-27 19:07:46 -05:00
convbert.mdx	Convert model files from rst to mdx (#14865 )	2021-12-22 03:27:30 -05:00
convnext.mdx	Add TFConvNextModel (#15750 )	2022-02-25 18:19:16 +01:00
cpm.mdx	Convert model files from rst to mdx (#14865 )	2021-12-22 03:27:30 -05:00
ctrl.mdx	🖍 remove broken link (#15615 )	2022-02-11 12:33:55 -06:00
data2vec.mdx	Add Data2Vec (#15507 )	2022-03-01 11:09:20 +01:00
deberta-v2.mdx	Map model_type and doc pages names (#14944 )	2022-01-03 05:08:55 -05:00
deberta.mdx	Convert model files from rst to mdx (#14865 )	2021-12-22 03:27:30 -05:00
deit.mdx	Add SimMIM (#15586 )	2022-02-17 19:44:55 +01:00
detr.mdx	Doc new front (#14590 )	2021-12-01 14:13:02 -05:00
dialogpt.mdx	Convert model files from rst to mdx (#14865 )	2021-12-22 03:27:30 -05:00
distilbert.mdx	Convert model files from rst to mdx (#14865 )	2021-12-22 03:27:30 -05:00
dpr.mdx	Convert model files from rst to mdx (#14865 )	2021-12-22 03:27:30 -05:00
electra.mdx	Add `ElectraForCausalLM` -> Enable Electra encoder-decoder model (#14729 )	2021-12-27 12:37:52 +01:00
encoder-decoder.mdx	Map model_type and doc pages names (#14944 )	2022-01-03 05:08:55 -05:00
flaubert.mdx	Convert model files from rst to mdx (#14865 )	2021-12-22 03:27:30 -05:00
fnet.mdx	Convert model files from rst to mdx (#14865 )	2021-12-22 03:27:30 -05:00
fsmt.mdx	Convert model files from rst to mdx (#14865 )	2021-12-22 03:27:30 -05:00
funnel.mdx	Convert model files from rst to mdx (#14865 )	2021-12-22 03:27:30 -05:00
gpt2.mdx	Convert model files from rst to mdx (#14865 )	2021-12-22 03:27:30 -05:00
gpt_neo.mdx	Doc styler examples (#14953 )	2021-12-27 19:07:46 -05:00
gptj.mdx	Doc styler examples (#14953 )	2021-12-27 19:07:46 -05:00
herbert.mdx	Doc styler examples (#14953 )	2021-12-27 19:07:46 -05:00
hubert.mdx	Convert model files from rst to mdx (#14865 )	2021-12-22 03:27:30 -05:00
ibert.mdx	Convert model files from rst to mdx (#14865 )	2021-12-22 03:27:30 -05:00
imagegpt.mdx	Map model_type and doc pages names (#14944 )	2022-01-03 05:08:55 -05:00
layoutlm.mdx	Doc styler examples (#14953 )	2021-12-27 19:07:46 -05:00
layoutlmv2.mdx	Doc styler examples (#14953 )	2021-12-27 19:07:46 -05:00
layoutxlm.mdx	Doc styler examples (#14953 )	2021-12-27 19:07:46 -05:00
led.mdx	Convert model files from rst to mdx (#14865 )	2021-12-22 03:27:30 -05:00
longformer.mdx	Doc styler examples (#14953 )	2021-12-27 19:07:46 -05:00
luke.mdx	Doc styler examples (#14953 )	2021-12-27 19:07:46 -05:00
lxmert.mdx	Convert model files from rst to mdx (#14865 )	2021-12-22 03:27:30 -05:00
m2m_100.mdx	Doc styler examples (#14953 )	2021-12-27 19:07:46 -05:00
marian.mdx	Doc styler examples (#14953 )	2021-12-27 19:07:46 -05:00
mbart.mdx	Doc styler examples (#14953 )	2021-12-27 19:07:46 -05:00
megatron-bert.mdx	Map model_type and doc pages names (#14944 )	2022-01-03 05:08:55 -05:00
megatron_gpt2.mdx	Convert rst files (#14888 )	2021-12-22 16:14:35 -05:00
mluke.mdx	Doc styler examples (#14953 )	2021-12-27 19:07:46 -05:00
mobilebert.mdx	Convert rst files (#14888 )	2021-12-22 16:14:35 -05:00
mpnet.mdx	Convert rst files (#14888 )	2021-12-22 16:14:35 -05:00
mt5.mdx	Convert rst files (#14888 )	2021-12-22 16:14:35 -05:00
nystromformer.mdx	Add Nystromformer (#14659 )	2022-01-11 14:25:49 +01:00
openai-gpt.mdx	Map model_type and doc pages names (#14944 )	2022-01-03 05:08:55 -05:00
pegasus.mdx	Doc styler examples (#14953 )	2021-12-27 19:07:46 -05:00
perceiver.mdx	Fix Perceiver docs (#14879 )	2021-12-22 14:18:03 +01:00
phobert.mdx	Convert rst files (#14888 )	2021-12-22 16:14:35 -05:00
plbart.mdx	Add PLBart (#13269 )	2022-02-18 14:17:09 +01:00
poolformer.mdx	Add model specific output classes to PoolFormer model docs (#15746 )	2022-02-25 13:43:56 +01:00
prophetnet.mdx	Convert rst files (#14888 )	2021-12-22 16:14:35 -05:00
qdqbert.mdx	Doc styler examples (#14953 )	2021-12-27 19:07:46 -05:00
rag.mdx	Convert rst files (#14888 )	2021-12-22 16:14:35 -05:00
realm.mdx	Add FastTokenizer to REALM (#15211 )	2022-01-19 15:19:36 +01:00
reformer.mdx	Doc styler examples (#14953 )	2021-12-27 19:07:46 -05:00
rembert.mdx	Convert rst files (#14888 )	2021-12-22 16:14:35 -05:00
retribert.mdx	Convert rst files (#14888 )	2021-12-22 16:14:35 -05:00
roberta.mdx	Convert rst files (#14888 )	2021-12-22 16:14:35 -05:00
roformer.mdx	Add Flax RoFormer (#15005 )	2022-01-04 13:23:10 +01:00
segformer.mdx	make table into valid Markdown table syntax (#15337 )	2022-01-26 07:10:00 -05:00
sew-d.mdx	Map model_type and doc pages names (#14944 )	2022-01-03 05:08:55 -05:00
sew.mdx	Convert rst files (#14888 )	2021-12-22 16:14:35 -05:00
speech-encoder-decoder.mdx	Flax Speech-Encoder-Decoder Model (#15613 )	2022-02-28 12:22:36 +01:00
speech_to_text.mdx	Add TFSpeech2Text (#15113 )	2022-02-08 16:27:23 +00:00
speech_to_text_2.mdx	Fix typo in speech2text2 doc (#15617 )	2022-02-15 13:54:34 +01:00
splinter.mdx	Convert rst files (#14888 )	2021-12-22 16:14:35 -05:00
squeezebert.mdx	Convert rst files (#14888 )	2021-12-22 16:14:35 -05:00
swin.mdx	Add SimMIM (#15586 )	2022-02-17 19:44:55 +01:00
t5.mdx	[t5/t0/mt5 models] faster/leaner custom layer norm (#14656 )	2022-02-15 16:49:57 -08:00
t5v1.1.mdx	Doc styler examples (#14953 )	2021-12-27 19:07:46 -05:00
tapas.mdx	Doc styler examples (#14953 )	2021-12-27 19:07:46 -05:00
transfo-xl.mdx	Map model_type and doc pages names (#14944 )	2022-01-03 05:08:55 -05:00
trocr.mdx	Fix doc examples (#15257 )	2022-01-20 21:51:51 +01:00
unispeech-sat.mdx	Map model_type and doc pages names (#14944 )	2022-01-03 05:08:55 -05:00
unispeech.mdx	Convert rst files (#14888 )	2021-12-22 16:14:35 -05:00
vilt.mdx	Add ViLT (#14895 )	2022-01-19 19:51:59 +01:00
vision-encoder-decoder.mdx	Make Swin work with VisionEncoderDecoderModel (#15527 )	2022-02-14 17:33:35 +01:00
vision-text-dual-encoder.mdx	Map model_type and doc pages names (#14944 )	2022-01-03 05:08:55 -05:00
visual_bert.mdx	Doc styler examples (#14953 )	2021-12-27 19:07:46 -05:00
vit.mdx	Add SimMIM (#15586 )	2022-02-17 19:44:55 +01:00
vit_mae.mdx	Add link (#15588 )	2022-02-09 23:33:39 +01:00
wav2vec2.mdx	Time stamps for CTC models (#15687 )	2022-02-22 19:26:44 +01:00
wav2vec2_phoneme.mdx	Wav2Vec2 meets phonemes (#14353 )	2021-12-17 19:56:44 +01:00
wavlm.mdx	Convert rst files (#14888 )	2021-12-22 16:14:35 -05:00
xglm.mdx	Add XGLM models (#14876 )	2022-01-28 18:55:23 +01:00
xlm-prophetnet.mdx	Map model_type and doc pages names (#14944 )	2022-01-03 05:08:55 -05:00
xlm-roberta-xl.mdx	Add support for XLM-R XL and XXL models by modeling_xlm_roberta_xl.py (#13727 )	2022-01-29 13:42:37 +01:00
xlm-roberta.mdx	Map model_type and doc pages names (#14944 )	2022-01-03 05:08:55 -05:00
xlm.mdx	Convert rst files (#14888 )	2021-12-22 16:14:35 -05:00
xlnet.mdx	Convert rst files (#14888 )	2021-12-22 16:14:35 -05:00
xls_r.mdx	Convert rst files (#14888 )	2021-12-22 16:14:35 -05:00
xlsr_wav2vec2.mdx	Convert rst files (#14888 )	2021-12-22 16:14:35 -05:00
yoso.mdx	Add YOSO (#15091 )	2022-01-26 19:18:29 +01:00