transformers/docs/source/model_doc
NielsRogge 408b2d2bd0
Add TrOCR + VisionEncoderDecoderModel (#13874)
* First draft

* Update self-attention of RoBERTa as proposition

* Improve conversion script

* Add TrOCR decoder-only model

* More improvements

* Make forward pass with pretrained weights work

* More improvements

* Some more improvements

* More improvements

* Make conversion work

* Clean up print statements

* Add documentation, processor

* Add test files

* Small improvements

* Some more improvements

* Make fix-copies, improve docs

* Make all vision encoder decoder model tests pass

* Make conversion script support other models

* Update URL for OCR image

* Update conversion script

* Fix style & quality

* Add support for the large-printed model

* Fix some issues

* Add print statement for debugging

* Add print statements for debugging

* Make possible fix for sinusoidal embedding

* Further debugging

* Potential fix v2

* Add more print statements for debugging

* Add more print statements for debugging

* Deubg more

* Comment out print statements

* Make conversion of large printed model possible, address review comments

* Make it possible to convert the stage1 checkpoints

* Clean up code, apply suggestions from code review

* Apply suggestions from code review, use Microsoft models in tests

* Rename encoder_hidden_size to cross_attention_hidden_size

* Improve docs
2021-10-13 10:28:56 +02:00
..
albert.rst albert flax (#13294) 2021-08-30 17:29:27 +02:00
auto.rst Image Segmentation pipeline (#13828) 2021-10-08 09:59:53 +02:00
bart.rst
barthez.rst
beit.rst beit-flax (#13515) 2021-09-21 13:34:19 +02:00
bert.rst [Flax] Correct flax docs (#12782) 2021-08-04 16:31:23 +02:00
bert_japanese.rst
bertgeneration.rst
bertweet.rst
bigbird.rst
bigbird_pegasus.rst
blenderbot.rst
blenderbot_small.rst Add BlenderBot small tokenizer to the init (#13367) 2021-09-22 19:00:47 -04:00
bort.rst
byt5.rst Improve T5 docs (#13240) 2021-09-01 15:05:40 +02:00
camembert.rst
canine.rst
clip.rst
convbert.rst
cpm.rst
ctrl.rst
deberta.rst Deberta tf (#12972) 2021-08-12 05:01:26 -04:00
deberta_v2.rst Deberta_v2 tf (#13120) 2021-08-31 06:32:47 -04:00
deit.rst
detr.rst
dialogpt.rst
distilbert.rst Fix typo distilbert doc (#13643) 2021-09-20 15:10:33 -04:00
dpr.rst [DPR] Correct init (#13796) 2021-09-30 18:55:20 +02:00
electra.rst
encoderdecoder.rst Add TFEncoderDecoderModel + Add cross-attention to some TF models (#13222) 2021-10-13 00:10:34 +02:00
flaubert.rst
fnet.rst Add FNet (#13045) 2021-09-20 13:24:30 +02:00
fsmt.rst
funnel.rst
gpt.rst
gpt2.rst Add Mistral GPT-2 Stability Tweaks (#13573) 2021-10-04 07:37:09 -04:00
gpt_neo.rst
gptj.rst [docs/gpt-j] fix typo (#13851) 2021-10-04 12:30:50 +02:00
herbert.rst Fixed typo: herBERT -> HerBERT (#13936) 2021-10-08 10:27:32 -04:00
hubert.rst Add Wav2Vec2 & Hubert ForSequenceClassification (#13153) 2021-08-27 20:52:51 +03:00
ibert.rst
layoutlm.rst
layoutlmv2.rst Add LayoutLMv2 + LayoutXLM (#12604) 2021-08-30 12:35:42 +02:00
layoutxlm.rst Add tokenizer docs (#13373) 2021-09-02 09:46:05 +02:00
led.rst Make gradient_checkpointing a training argument (#13657) 2021-09-22 07:51:38 -04:00
longformer.rst
luke.rst
lxmert.rst
m2m_100.rst replace tgt_lang by tgt_text (#13061) 2021-08-09 22:47:05 +05:30
marian.rst Rely on huggingface_hub for common tools (#13100) 2021-08-12 14:59:02 +02:00
mbart.rst fix example (#13387) 2021-09-02 11:32:18 +02:00
megatron_bert.rst
megatron_gpt2.rst
mobilebert.rst
mpnet.rst
mt5.rst Fix mT5 documentation (#13639) 2021-09-20 07:53:31 -04:00
pegasus.rst [Flax] Addition of FlaxPegasus (#13420) 2021-09-14 17:15:19 +02:00
phobert.rst
prophetnet.rst
rag.rst
reformer.rst
rembert.rst Add RemBERT model code to huggingface (#10692) 2021-07-24 11:31:42 -04:00
retribert.rst
roberta.rst Add TFEncoderDecoderModel + Add cross-attention to some TF models (#13222) 2021-10-13 00:10:34 +02:00
roformer.rst
speech_to_text.rst fix: typo spelling grammar (#13212) 2021-08-30 08:09:14 -04:00
speech_to_text_2.rst Add SpeechEncoderDecoder & Speech2Text2 (#13186) 2021-09-01 13:33:31 +02:00
speechencoderdecoder.rst Add SpeechEncoderDecoder & Speech2Text2 (#13186) 2021-09-01 13:33:31 +02:00
splinter.rst Add splinter (#12955) 2021-08-17 08:29:01 -04:00
squeezebert.rst
t5.rst Improve T5 docs (#13240) 2021-09-01 15:05:40 +02:00
t5v1.1.rst Improve T5 docs (#13240) 2021-09-01 15:05:40 +02:00
tapas.rst
transformerxl.rst
trocr.rst Add TrOCR + VisionEncoderDecoderModel (#13874) 2021-10-13 10:28:56 +02:00
visionencoderdecoder.rst Add TrOCR + VisionEncoderDecoderModel (#13874) 2021-10-13 10:28:56 +02:00
visual_bert.rst Fix VisualBERT docs (#13106) 2021-08-13 11:44:04 +05:30
vit.rst Add DINO conversion script (#13265) 2021-08-26 17:25:20 +02:00
wav2vec2.rst Add Wav2Vec2 & Hubert ForSequenceClassification (#13153) 2021-08-27 20:52:51 +03:00
xlm.rst
xlmprophetnet.rst
xlmroberta.rst
xlnet.rst
xlsr_wav2vec2.rst