mirror of
https://github.com/saymrwulf/transformers.git
synced 2026-05-15 21:01:19 +00:00
* First draft * Style and remove mlm * Make forward pass work * More improvements * More improvements * Fix bug * More improvements * More improvements * Add PerceiverTokenizer first draft * Improve conversion script * More improvements * Make conversion script work for the encoder * Make conversion script work with local pickle files * Style & quality, fix-copies * Add dummy input to conversion script * Add absolute position embeddings to TextPreProcessor * Make forward pass of encoder work * More improvements * Move text preprocessor to separate script * More improvements * More improvements * Add post processor * Make MLM model work * Style * Add PerceiverForMaskedLM * Add PerceiverImagePreprocessor * Make style * Make PerceiverForImageClassification work * More improvements * More improvements * Use tokenizer in conversion script * Use PerceiverForMaskedLM in conversion script * Define custom PerceiverModelOutput * Improve PerceiverAttention to make it work for both MLM and image classification * More improvements * More improvements * More improvements to the conversion script * Make conversion script work for both MLM and image classification * Add PerceiverFeatureExtractor * More improvements * Style and quality * Add center cropping * Fix bug * Small fix * Add print statement * Fix bug in image preprocessor * Fix bug with conversion script * Make output position embeddings an nn.Parameter layer instead of nn.Embedding * Comment out print statements * Add position encoding classes * More improvements * Use position_encoding_kwargs * Add PerceiverForImageClassificationFourier * Make style & quality * Add PerceiverForImageClassificationConvProcessing * Style & quality * Add flow model * Move processors to modeling file * Make position encodings modular * Make basic decoder use modular position encodings * Add PerceiverForOpticalFlow to conversion script * Add AudioPreprocessor * Make it possible for the basic decoder to use Fourier position embeddings * Add PerceiverForMultimodalAutoencoding * Improve model for optical flow * Improve _build_network_inputs method * Add print statement * Fix device issue * Fix device of Fourier embeddings * Add print statements for debugging * Add another print statement * Add another print statement * Add another print statement * Add another print statement * Improve PerceiverAudioPreprocessor * Improve conversion script for multimodal modal * More improvements * More improvements * Improve multimodal model * Make forward pass multimodal model work * More improvements * Improve tests * Fix some more tests * Add output dataclasses * Make more tests pass * Add print statements for debuggin * Add tests for image classification * Add PerceiverClassifierOutput * More improvements * Make more tests pass for the optical flow model * Make style & quality * Small improvements * Don't support training for optical flow model for now * Fix _prepare_for_class for tests * Make more tests pass, add some docs * Add multimodal model to tests * Minor fixes * Fix tests * Improve conversion script * Make fixup * Remove pos_dim argument * Fix device issue * Potential fix for OOM * Revert previous commit * Fix test_initialization * Add print statements for debugging * Fix print statement * Add print statement * Add print statement * Add print statement * Add print statement * Add print statement * Add print statement * Remove need for output_shape * Comment out output_shape * Remove unnecessary code * Improve docs * Fix make fixup * Remove PerceiverTextProcessor from init * Improve docs * Small improvement * Apply first batch of suggestions from code review * Apply more suggestions from code review * Update docstrings * Define dicts beforehand for readability * Rename task to architecture in conversion script, include PerceiverModel in tests * Add print statements for debugging * Fix tests on GPU * Remove preprocessors, postprocessors and decoders from main init * Add integration test * Fix docs * Replace einops by torch * Update for new docs frontend * Rename PerceiverForImageClassification * Improve docs * Improve docs * Improve docs of PerceiverModel * Fix some more tests * Improve center_crop * Add PerceiverForSequenceClassification * Small improvements * Fix tests * Add integration test for optical flow model * Clean up * Add tests for tokenizer * Fix tokenizer by adding special tokens properly * Fix CI |
||
|---|---|---|
| .. | ||
| albert.rst | ||
| auto.rst | ||
| bart.rst | ||
| barthez.rst | ||
| bartpho.rst | ||
| beit.rst | ||
| bert.rst | ||
| bert_japanese.rst | ||
| bertgeneration.rst | ||
| bertweet.rst | ||
| bigbird.rst | ||
| bigbird_pegasus.rst | ||
| blenderbot.rst | ||
| blenderbot_small.rst | ||
| bort.rst | ||
| byt5.rst | ||
| camembert.rst | ||
| canine.rst | ||
| clip.rst | ||
| convbert.rst | ||
| cpm.rst | ||
| ctrl.rst | ||
| deberta.rst | ||
| deberta_v2.rst | ||
| deit.rst | ||
| detr.mdx | ||
| dialogpt.rst | ||
| distilbert.rst | ||
| dpr.rst | ||
| electra.rst | ||
| encoderdecoder.rst | ||
| flaubert.rst | ||
| fnet.rst | ||
| fsmt.rst | ||
| funnel.rst | ||
| gpt.rst | ||
| gpt2.rst | ||
| gpt_neo.rst | ||
| gptj.rst | ||
| herbert.rst | ||
| hubert.rst | ||
| ibert.rst | ||
| imagegpt.rst | ||
| layoutlm.rst | ||
| layoutlmv2.rst | ||
| layoutxlm.rst | ||
| led.rst | ||
| longformer.rst | ||
| luke.rst | ||
| lxmert.rst | ||
| m2m_100.rst | ||
| marian.rst | ||
| mbart.rst | ||
| megatron_bert.rst | ||
| megatron_gpt2.rst | ||
| mluke.rst | ||
| mobilebert.rst | ||
| mpnet.rst | ||
| mt5.rst | ||
| pegasus.rst | ||
| perceiver.rst | ||
| phobert.rst | ||
| prophetnet.rst | ||
| qdqbert.rst | ||
| rag.rst | ||
| reformer.rst | ||
| rembert.rst | ||
| retribert.rst | ||
| roberta.rst | ||
| roformer.rst | ||
| segformer.rst | ||
| sew.rst | ||
| sew_d.rst | ||
| speech_to_text.rst | ||
| speech_to_text_2.rst | ||
| speechencoderdecoder.rst | ||
| splinter.rst | ||
| squeezebert.rst | ||
| t5.rst | ||
| t5v1.1.rst | ||
| tapas.rst | ||
| transformerxl.rst | ||
| trocr.rst | ||
| unispeech.rst | ||
| unispeech_sat.rst | ||
| vision_text_dual_encoder.rst | ||
| visionencoderdecoder.rst | ||
| visual_bert.rst | ||
| vit.rst | ||
| wav2vec2.rst | ||
| xlm.rst | ||
| xlmprophetnet.rst | ||
| xlmroberta.rst | ||
| xlnet.rst | ||
| xlsr_wav2vec2.rst | ||