mirror of
https://github.com/saymrwulf/transformers.git
synced 2026-05-14 20:58:08 +00:00
* First draft * Update self-attention of RoBERTa as proposition * Improve conversion script * Add TrOCR decoder-only model * More improvements * Make forward pass with pretrained weights work * More improvements * Some more improvements * More improvements * Make conversion work * Clean up print statements * Add documentation, processor * Add test files * Small improvements * Some more improvements * Make fix-copies, improve docs * Make all vision encoder decoder model tests pass * Make conversion script support other models * Update URL for OCR image * Update conversion script * Fix style & quality * Add support for the large-printed model * Fix some issues * Add print statement for debugging * Add print statements for debugging * Make possible fix for sinusoidal embedding * Further debugging * Potential fix v2 * Add more print statements for debugging * Add more print statements for debugging * Deubg more * Comment out print statements * Make conversion of large printed model possible, address review comments * Make it possible to convert the stage1 checkpoints * Clean up code, apply suggestions from code review * Apply suggestions from code review, use Microsoft models in tests * Rename encoder_hidden_size to cross_attention_hidden_size * Improve docs |
||
|---|---|---|
| .. | ||
| albert.rst | ||
| auto.rst | ||
| bart.rst | ||
| barthez.rst | ||
| beit.rst | ||
| bert.rst | ||
| bert_japanese.rst | ||
| bertgeneration.rst | ||
| bertweet.rst | ||
| bigbird.rst | ||
| bigbird_pegasus.rst | ||
| blenderbot.rst | ||
| blenderbot_small.rst | ||
| bort.rst | ||
| byt5.rst | ||
| camembert.rst | ||
| canine.rst | ||
| clip.rst | ||
| convbert.rst | ||
| cpm.rst | ||
| ctrl.rst | ||
| deberta.rst | ||
| deberta_v2.rst | ||
| deit.rst | ||
| detr.rst | ||
| dialogpt.rst | ||
| distilbert.rst | ||
| dpr.rst | ||
| electra.rst | ||
| encoderdecoder.rst | ||
| flaubert.rst | ||
| fnet.rst | ||
| fsmt.rst | ||
| funnel.rst | ||
| gpt.rst | ||
| gpt2.rst | ||
| gpt_neo.rst | ||
| gptj.rst | ||
| herbert.rst | ||
| hubert.rst | ||
| ibert.rst | ||
| layoutlm.rst | ||
| layoutlmv2.rst | ||
| layoutxlm.rst | ||
| led.rst | ||
| longformer.rst | ||
| luke.rst | ||
| lxmert.rst | ||
| m2m_100.rst | ||
| marian.rst | ||
| mbart.rst | ||
| megatron_bert.rst | ||
| megatron_gpt2.rst | ||
| mobilebert.rst | ||
| mpnet.rst | ||
| mt5.rst | ||
| pegasus.rst | ||
| phobert.rst | ||
| prophetnet.rst | ||
| rag.rst | ||
| reformer.rst | ||
| rembert.rst | ||
| retribert.rst | ||
| roberta.rst | ||
| roformer.rst | ||
| speech_to_text.rst | ||
| speech_to_text_2.rst | ||
| speechencoderdecoder.rst | ||
| splinter.rst | ||
| squeezebert.rst | ||
| t5.rst | ||
| t5v1.1.rst | ||
| tapas.rst | ||
| transformerxl.rst | ||
| trocr.rst | ||
| visionencoderdecoder.rst | ||
| visual_bert.rst | ||
| vit.rst | ||
| wav2vec2.rst | ||
| xlm.rst | ||
| xlmprophetnet.rst | ||
| xlmroberta.rst | ||
| xlnet.rst | ||
| xlsr_wav2vec2.rst | ||