transformers/tests
JB (Don) f1a385b1de
[RoBERTa-based] Add support for sdpa (#30510)
* Adding SDPA support for RoBERTa-based models

* add not is_cross_attention

* fix copies

* fix test

* add minimal test for camembert and xlm_roberta as their test class does not inherit from ModelTesterMixin

* address some review comments

* use copied from

* style

* consistency

* fix lists

---------

Co-authored-by: fxmarty <9808326+fxmarty@users.noreply.github.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
2024-08-28 10:26:00 +02:00
..
agents Agents use grammar (#31735) 2024-08-07 11:42:52 +02:00
benchmark
bettertransformer
deepspeed Revert PR 32299, flag users when Zero-3 was missed (#32851) 2024-08-16 12:35:41 -04:00
extended
fixtures
fsdp 🚨🚨🚨 Update min version of accelerate to 0.26.0 (#32627) 2024-08-20 11:42:36 +02:00
generation Llama: make slow tests green 🟢 (#33138) 2024-08-27 14:44:42 +01:00
models [RoBERTa-based] Add support for sdpa (#30510) 2024-08-28 10:26:00 +02:00
optimization fix: Fixed the 1st argument name in classmethods (#31907) 2024-07-11 12:11:50 +01:00
peft_integration
pipelines 🚨 Add Blip2ForImageTextRetrieval (#29261) 2024-08-27 18:50:27 +01:00
quantization Cache: use batch_size instead of max_batch_size (#32657) 2024-08-16 11:48:45 +01:00
repo_utils
sagemaker Fixed log messages that are resulting in TypeError due to too many arguments (#32017) 2024-07-17 10:56:44 +01:00
tokenization #32184 save total_vocab_size (#32240) 2024-08-05 09:22:48 +02:00
trainer Integrate Liger (Linkedin GPU Efficient Runtime) Kernel to Trainer (#32860) 2024-08-23 13:20:49 +02:00
utils CI: fix efficientnet pipeline timeout and prevent future similar issues due to large image size (#33123) 2024-08-27 11:58:27 +01:00
__init__.py
test_backbone_common.py
test_configuration_common.py Refactor: Removed un-necessary object base class (#32230) 2024-07-26 10:33:02 +02:00
test_feature_extraction_common.py
test_image_processing_common.py Update kwargs validation for preprocess with decorator (#32024) 2024-08-06 11:33:05 +01:00
test_image_transforms.py
test_modeling_common.py Test: add higher atol in test_forward_with_num_logits_to_keep (#33093) 2024-08-26 15:23:30 +01:00
test_modeling_flax_common.py
test_modeling_tf_common.py
test_pipeline_mixin.py fix: Fixed raising TypeError instead of ValueError for invalid type (#32111) 2024-07-22 17:46:17 +01:00
test_processing_common.py Modify ProcessorTesterMixin for better generalization (#32637) 2024-08-13 11:48:53 -04:00
test_sequence_feature_extraction_common.py
test_tokenization_common.py Enable some Jinja extensions and add datetime capabilities (#32684) 2024-08-23 14:26:12 +01:00