transformers

mirror of https://github.com/saymrwulf/transformers.git synced 2026-05-15 21:01:19 +00:00

History

NielsRogge 65b20b739b Add Perceiver IO (#14487 ) * First draft * Style and remove mlm * Make forward pass work * More improvements * More improvements * Fix bug * More improvements * More improvements * Add PerceiverTokenizer first draft * Improve conversion script * More improvements * Make conversion script work for the encoder * Make conversion script work with local pickle files * Style & quality, fix-copies * Add dummy input to conversion script * Add absolute position embeddings to TextPreProcessor * Make forward pass of encoder work * More improvements * Move text preprocessor to separate script * More improvements * More improvements * Add post processor * Make MLM model work * Style * Add PerceiverForMaskedLM * Add PerceiverImagePreprocessor * Make style * Make PerceiverForImageClassification work * More improvements * More improvements * Use tokenizer in conversion script * Use PerceiverForMaskedLM in conversion script * Define custom PerceiverModelOutput * Improve PerceiverAttention to make it work for both MLM and image classification * More improvements * More improvements * More improvements to the conversion script * Make conversion script work for both MLM and image classification * Add PerceiverFeatureExtractor * More improvements * Style and quality * Add center cropping * Fix bug * Small fix * Add print statement * Fix bug in image preprocessor * Fix bug with conversion script * Make output position embeddings an nn.Parameter layer instead of nn.Embedding * Comment out print statements * Add position encoding classes * More improvements * Use position_encoding_kwargs * Add PerceiverForImageClassificationFourier * Make style & quality * Add PerceiverForImageClassificationConvProcessing * Style & quality * Add flow model * Move processors to modeling file * Make position encodings modular * Make basic decoder use modular position encodings * Add PerceiverForOpticalFlow to conversion script * Add AudioPreprocessor * Make it possible for the basic decoder to use Fourier position embeddings * Add PerceiverForMultimodalAutoencoding * Improve model for optical flow * Improve _build_network_inputs method * Add print statement * Fix device issue * Fix device of Fourier embeddings * Add print statements for debugging * Add another print statement * Add another print statement * Add another print statement * Add another print statement * Improve PerceiverAudioPreprocessor * Improve conversion script for multimodal modal * More improvements * More improvements * Improve multimodal model * Make forward pass multimodal model work * More improvements * Improve tests * Fix some more tests * Add output dataclasses * Make more tests pass * Add print statements for debuggin * Add tests for image classification * Add PerceiverClassifierOutput * More improvements * Make more tests pass for the optical flow model * Make style & quality * Small improvements * Don't support training for optical flow model for now * Fix _prepare_for_class for tests * Make more tests pass, add some docs * Add multimodal model to tests * Minor fixes * Fix tests * Improve conversion script * Make fixup * Remove pos_dim argument * Fix device issue * Potential fix for OOM * Revert previous commit * Fix test_initialization * Add print statements for debugging * Fix print statement * Add print statement * Add print statement * Add print statement * Add print statement * Add print statement * Add print statement * Remove need for output_shape * Comment out output_shape * Remove unnecessary code * Improve docs * Fix make fixup * Remove PerceiverTextProcessor from init * Improve docs * Small improvement * Apply first batch of suggestions from code review * Apply more suggestions from code review * Update docstrings * Define dicts beforehand for readability * Rename task to architecture in conversion script, include PerceiverModel in tests * Add print statements for debugging * Fix tests on GPU * Remove preprocessors, postprocessors and decoders from main init * Add integration test * Fix docs * Replace einops by torch * Update for new docs frontend * Rename PerceiverForImageClassification * Improve docs * Improve docs * Improve docs of PerceiverModel * Fix some more tests * Improve center_crop * Add PerceiverForSequenceClassification * Small improvements * Fix tests * Add integration test for optical flow model * Clean up * Add tests for tokenizer * Fix tokenizer by adding special tokens properly * Fix CI		2021-12-08 14:20:34 +01:00
..
albert.rst	albert flax (#13294 )	2021-08-30 17:29:27 +02:00
auto.rst	Tapas tf (#13393 )	2021-11-30 11:07:55 +01:00
bart.rst	[Bart] Fix docs (#14434 )	2021-11-17 19:02:33 +01:00
barthez.rst
bartpho.rst	Add BARTpho: Pre-trained Sequence-to-Sequence Models for Vietnamese (#13788 )	2021-10-18 10:16:46 -04:00
beit.rst	Added Beit model output class (#14133 )	2021-11-02 18:29:14 +01:00
bert.rst	[Flax] Correct flax docs (#12782 )	2021-08-04 16:31:23 +02:00
bert_japanese.rst
bertgeneration.rst
bertweet.rst	Add BARTpho: Pre-trained Sequence-to-Sequence Models for Vietnamese (#13788 )	2021-10-18 10:16:46 -04:00
bigbird.rst	Flax Big Bird (#11967 )	2021-06-14 20:01:03 +01:00
bigbird_pegasus.rst
blenderbot.rst	Doc new front (#14590 )	2021-12-01 14:13:02 -05:00
blenderbot_small.rst	[Flax] Add FlaxBlenderbotSmall (#14576 )	2021-12-02 14:21:48 +05:30
bort.rst
byt5.rst	Improve T5 docs (#13240 )	2021-09-01 15:05:40 +02:00
camembert.rst
canine.rst	Wrong model is used in example, should be character instead of subword model (#12676 )	2021-07-13 08:40:27 -04:00
clip.rst	add and fix examples (#12810 )	2021-07-20 09:28:50 -04:00
convbert.rst
cpm.rst
ctrl.rst
deberta.rst	Deberta tf (#12972 )	2021-08-12 05:01:26 -04:00
deberta_v2.rst	Deberta_v2 tf (#13120 )	2021-08-31 06:32:47 -04:00
deit.rst	Doc new front (#14590 )	2021-12-01 14:13:02 -05:00
detr.mdx	Doc new front (#14590 )	2021-12-01 14:13:02 -05:00
dialogpt.rst
distilbert.rst	Fix typo distilbert doc (#13643 )	2021-09-20 15:10:33 -04:00
dpr.rst	Update dpr.rst (#14300 )	2021-11-06 09:41:02 -04:00
electra.rst
encoderdecoder.rst	Add TFEncoderDecoderModel + Add cross-attention to some TF models (#13222 )	2021-10-13 00:10:34 +02:00
flaubert.rst
fnet.rst	Add FNet (#13045 )	2021-09-20 13:24:30 +02:00
fsmt.rst
funnel.rst
gpt.rst
gpt2.rst	Add Mistral GPT-2 Stability Tweaks (#13573 )	2021-10-04 07:37:09 -04:00
gpt_neo.rst	FlaxGPTNeo (#12493 )	2021-07-06 18:55:18 +05:30
gptj.rst	Add GPTJForQuestionAnswering (#14503 )	2021-12-06 11:44:10 -05:00
herbert.rst	Fixed typo: herBERT -> HerBERT (#13936 )	2021-10-08 10:27:32 -04:00
hubert.rst	Add Wav2Vec2 & Hubert ForSequenceClassification (#13153 )	2021-08-27 20:52:51 +03:00
ibert.rst
imagegpt.rst	Rename ImageGPT (#14526 )	2021-11-29 10:19:11 +01:00
layoutlm.rst
layoutlmv2.rst	Doc new front (#14590 )	2021-12-01 14:13:02 -05:00
layoutxlm.rst	Add LayoutXLMProcessor (and LayoutXLMTokenizer, LayoutXLMTokenizerFast) (#14115 )	2021-11-03 08:59:44 +01:00
led.rst	Make gradient_checkpointing a training argument (#13657 )	2021-09-22 07:51:38 -04:00
longformer.rst
luke.rst	Add mLUKE (#14640 )	2021-12-07 00:25:28 -05:00
lxmert.rst
m2m_100.rst	replace tgt_lang by tgt_text (#13061 )	2021-08-09 22:47:05 +05:30
marian.rst	Update Transformers to huggingface_hub >= 0.1.0 (#14251 )	2021-11-02 18:58:42 -04:00
mbart.rst	fix example (#13387 )	2021-09-02 11:32:18 +02:00
megatron_bert.rst
megatron_gpt2.rst
mluke.rst	Add mLUKE (#14640 )	2021-12-07 00:25:28 -05:00
mobilebert.rst
mpnet.rst
mt5.rst	Fix mT5 documentation (#13639 )	2021-09-20 07:53:31 -04:00
pegasus.rst	[Flax] Addition of FlaxPegasus (#13420 )	2021-09-14 17:15:19 +02:00
perceiver.rst	Add Perceiver IO (#14487 )	2021-12-08 14:20:34 +01:00
phobert.rst	Add BARTpho: Pre-trained Sequence-to-Sequence Models for Vietnamese (#13788 )	2021-10-18 10:16:46 -04:00
prophetnet.rst
qdqbert.rst	Add QDQBert model and quantization examples of SQUAD task (#14066 )	2021-11-19 13:33:39 -05:00
rag.rst
reformer.rst
rembert.rst	Add RemBERT model code to huggingface (#10692 )	2021-07-24 11:31:42 -04:00
retribert.rst
roberta.rst	Add TFEncoderDecoderModel + Add cross-attention to some TF models (#13222 )	2021-10-13 00:10:34 +02:00
roformer.rst	[RoFormer] Fix some issues (#12397 )	2021-07-06 03:31:57 -04:00
segformer.rst	Improve semantic segmentation models (#14355 )	2021-11-17 15:29:58 +01:00
sew.rst	[Speech] Refactor Examples (#14040 )	2021-10-18 17:43:35 +02:00
sew_d.rst	[Speech] Refactor Examples (#14040 )	2021-10-18 17:43:35 +02:00
speech_to_text.rst	up (#14008 )	2021-10-14 15:46:22 +02:00
speech_to_text_2.rst	[Speech2Text2] Enable tokenizers (#14390 )	2021-11-15 16:34:11 +01:00
speechencoderdecoder.rst	Add SpeechEncoderDecoder & Speech2Text2 (#13186 )	2021-09-01 13:33:31 +02:00
splinter.rst	Add splinter (#12955 )	2021-08-17 08:29:01 -04:00
squeezebert.rst
t5.rst	Improve T5 docs (#13240 )	2021-09-01 15:05:40 +02:00
t5v1.1.rst	Improve T5 docs (#13240 )	2021-09-01 15:05:40 +02:00
tapas.rst	Tapas tf (#13393 )	2021-11-30 11:07:55 +01:00
transformerxl.rst
trocr.rst	Add TrOCR + VisionEncoderDecoderModel (#13874 )	2021-10-13 10:28:56 +02:00
unispeech.rst	Add Unispeech & Unispeech-SAT (#13963 )	2021-10-26 18:59:58 +02:00
unispeech_sat.rst	Add Unispeech & Unispeech-SAT (#13963 )	2021-10-26 18:59:58 +02:00
vision_text_dual_encoder.rst	VisionTextDualEncoder (#13511 )	2021-11-30 22:21:48 +05:30
visionencoderdecoder.rst	Add FlaxVisionEncoderDecoderModel (#13359 )	2021-11-09 15:14:28 +05:30
visual_bert.rst	Fix VisualBERT docs (#13106 )	2021-08-13 11:44:04 +05:30
vit.rst	Add TFViTModel (#13778 )	2021-11-09 07:54:37 -05:00
wav2vec2.rst	[Wav2Vec2] PyCTCDecode Integration to support language model boosted decoding (#14339 )	2021-12-08 12:07:54 +01:00
xlm.rst
xlmprophetnet.rst
xlmroberta.rst
xlnet.rst
xlsr_wav2vec2.rst