transformers

mirror of https://github.com/saymrwulf/transformers.git synced 2026-05-14 20:58:08 +00:00

History

NielsRogge 408b2d2bd0 Add TrOCR + VisionEncoderDecoderModel (#13874 ) * First draft * Update self-attention of RoBERTa as proposition * Improve conversion script * Add TrOCR decoder-only model * More improvements * Make forward pass with pretrained weights work * More improvements * Some more improvements * More improvements * Make conversion work * Clean up print statements * Add documentation, processor * Add test files * Small improvements * Some more improvements * Make fix-copies, improve docs * Make all vision encoder decoder model tests pass * Make conversion script support other models * Update URL for OCR image * Update conversion script * Fix style & quality * Add support for the large-printed model * Fix some issues * Add print statement for debugging * Add print statements for debugging * Make possible fix for sinusoidal embedding * Further debugging * Potential fix v2 * Add more print statements for debugging * Add more print statements for debugging * Deubg more * Comment out print statements * Make conversion of large printed model possible, address review comments * Make it possible to convert the stage1 checkpoints * Clean up code, apply suggestions from code review * Apply suggestions from code review, use Microsoft models in tests * Rename encoder_hidden_size to cross_attention_hidden_size * Improve docs		2021-10-13 10:28:56 +02:00
..
albert.rst	albert flax (#13294 )	2021-08-30 17:29:27 +02:00
auto.rst	Image Segmentation pipeline (#13828 )	2021-10-08 09:59:53 +02:00
bart.rst
barthez.rst
beit.rst	beit-flax (#13515 )	2021-09-21 13:34:19 +02:00
bert.rst	[Flax] Correct flax docs (#12782 )	2021-08-04 16:31:23 +02:00
bert_japanese.rst
bertgeneration.rst
bertweet.rst
bigbird.rst
bigbird_pegasus.rst
blenderbot.rst
blenderbot_small.rst	Add BlenderBot small tokenizer to the init (#13367 )	2021-09-22 19:00:47 -04:00
bort.rst
byt5.rst	Improve T5 docs (#13240 )	2021-09-01 15:05:40 +02:00
camembert.rst
canine.rst
clip.rst
convbert.rst
cpm.rst
ctrl.rst
deberta.rst	Deberta tf (#12972 )	2021-08-12 05:01:26 -04:00
deberta_v2.rst	Deberta_v2 tf (#13120 )	2021-08-31 06:32:47 -04:00
deit.rst
detr.rst
dialogpt.rst
distilbert.rst	Fix typo distilbert doc (#13643 )	2021-09-20 15:10:33 -04:00
dpr.rst	[DPR] Correct init (#13796 )	2021-09-30 18:55:20 +02:00
electra.rst
encoderdecoder.rst	Add TFEncoderDecoderModel + Add cross-attention to some TF models (#13222 )	2021-10-13 00:10:34 +02:00
flaubert.rst
fnet.rst	Add FNet (#13045 )	2021-09-20 13:24:30 +02:00
fsmt.rst
funnel.rst
gpt.rst
gpt2.rst	Add Mistral GPT-2 Stability Tweaks (#13573 )	2021-10-04 07:37:09 -04:00
gpt_neo.rst
gptj.rst	[docs/gpt-j] fix typo (#13851 )	2021-10-04 12:30:50 +02:00
herbert.rst	Fixed typo: herBERT -> HerBERT (#13936 )	2021-10-08 10:27:32 -04:00
hubert.rst	Add Wav2Vec2 & Hubert ForSequenceClassification (#13153 )	2021-08-27 20:52:51 +03:00
ibert.rst
layoutlm.rst
layoutlmv2.rst	Add LayoutLMv2 + LayoutXLM (#12604 )	2021-08-30 12:35:42 +02:00
layoutxlm.rst	Add tokenizer docs (#13373 )	2021-09-02 09:46:05 +02:00
led.rst	Make gradient_checkpointing a training argument (#13657 )	2021-09-22 07:51:38 -04:00
longformer.rst
luke.rst
lxmert.rst
m2m_100.rst	replace tgt_lang by tgt_text (#13061 )	2021-08-09 22:47:05 +05:30
marian.rst	Rely on huggingface_hub for common tools (#13100 )	2021-08-12 14:59:02 +02:00
mbart.rst	fix example (#13387 )	2021-09-02 11:32:18 +02:00
megatron_bert.rst
megatron_gpt2.rst
mobilebert.rst
mpnet.rst
mt5.rst	Fix mT5 documentation (#13639 )	2021-09-20 07:53:31 -04:00
pegasus.rst	[Flax] Addition of FlaxPegasus (#13420 )	2021-09-14 17:15:19 +02:00
phobert.rst
prophetnet.rst
rag.rst
reformer.rst
rembert.rst	Add RemBERT model code to huggingface (#10692 )	2021-07-24 11:31:42 -04:00
retribert.rst
roberta.rst	Add TFEncoderDecoderModel + Add cross-attention to some TF models (#13222 )	2021-10-13 00:10:34 +02:00
roformer.rst
speech_to_text.rst	fix: typo spelling grammar (#13212 )	2021-08-30 08:09:14 -04:00
speech_to_text_2.rst	Add SpeechEncoderDecoder & Speech2Text2 (#13186 )	2021-09-01 13:33:31 +02:00
speechencoderdecoder.rst	Add SpeechEncoderDecoder & Speech2Text2 (#13186 )	2021-09-01 13:33:31 +02:00
splinter.rst	Add splinter (#12955 )	2021-08-17 08:29:01 -04:00
squeezebert.rst
t5.rst	Improve T5 docs (#13240 )	2021-09-01 15:05:40 +02:00
t5v1.1.rst	Improve T5 docs (#13240 )	2021-09-01 15:05:40 +02:00
tapas.rst
transformerxl.rst
trocr.rst	Add TrOCR + VisionEncoderDecoderModel (#13874 )	2021-10-13 10:28:56 +02:00
visionencoderdecoder.rst	Add TrOCR + VisionEncoderDecoderModel (#13874 )	2021-10-13 10:28:56 +02:00
visual_bert.rst	Fix VisualBERT docs (#13106 )	2021-08-13 11:44:04 +05:30
vit.rst	Add DINO conversion script (#13265 )	2021-08-26 17:25:20 +02:00
wav2vec2.rst	Add Wav2Vec2 & Hubert ForSequenceClassification (#13153 )	2021-08-27 20:52:51 +03:00
xlm.rst
xlmprophetnet.rst
xlmroberta.rst
xlnet.rst
xlsr_wav2vec2.rst