transformers/tests
Arthur 8a8a0a4ae0
[Llama ROPE] Fix torch export but also slow downs in forward (#29198)
* remove control flow

* update gptneox

* update ....

* nits

* Actually let's just break. Otherwise we are silently failing which imo is not optimal

* version BC

* fix tests

* fix eager causal

* nit

* add a test

* style

* nits

* nits

* more nits for the test

* update and fix

* make sure cuda graphs are not skipped

* read token is needed for meta llama

* update!

* fiixup

* compile test should be slow

* fix thet fix copies

* stle 🫠
2024-02-28 10:45:53 +01:00
..
benchmark
bettertransformer
deepspeed fix failing trainer ds tests (#29057) 2024-02-16 17:18:45 +05:30
extended
fixtures
fsdp Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
generation GenerationConfig validate both constraints and force_words_ids (#29163) 2024-02-27 01:43:52 +01:00
models [Llama ROPE] Fix torch export but also slow downs in forward (#29198) 2024-02-28 10:45:53 +01:00
optimization
peft_integration
pipelines Token level timestamps for long-form generation in Whisper (#29148) 2024-02-27 18:15:26 +00:00
quantization Cleaner Cache dtype and device extraction for CUDA graph generation for quantizers compatibility (#29079) 2024-02-27 09:32:39 +01:00
repo_utils
sagemaker Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
tokenization Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
tools
trainer FIX [PEFT / Trainer ] Handle better peft + quantized compiled models (#29055) 2024-02-20 12:45:08 +01:00
utils Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
__init__.py
test_backbone_common.py
test_cache_utils.py Llama: fix batched generation (#29109) 2024-02-20 10:23:17 +00:00
test_configuration_common.py
test_configuration_utils.py Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
test_feature_extraction_common.py
test_feature_extraction_utils.py
test_image_processing_common.py Raise unused kwargs image processor (#29063) 2024-02-20 16:20:20 +01:00
test_image_processing_utils.py
test_image_transforms.py
test_modeling_common.py Adding SegGPT (#27735) 2024-02-26 18:17:19 +00:00
test_modeling_flax_common.py [Flax] Update no init test for Flax v0.7.1 (#28735) 2024-01-26 18:20:39 +00:00
test_modeling_flax_utils.py Enable safetensors conversion from PyTorch to other frameworks without the torch requirement (#27599) 2024-01-23 10:28:23 +01:00
test_modeling_tf_common.py Add tf_keras imports to prepare for Keras 3 (#28588) 2024-01-30 17:26:36 +00:00
test_modeling_tf_utils.py Add tf_keras imports to prepare for Keras 3 (#28588) 2024-01-30 17:26:36 +00:00
test_modeling_utils.py Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
test_pipeline_mixin.py Image Feature Extraction pipeline (#28216) 2024-02-05 14:50:07 +00:00
test_processing_common.py Don't save processor_config.json if a processor has no extra attribute (#28584) 2024-01-19 09:59:14 +00:00
test_sequence_feature_extraction_common.py
test_tokenization_common.py Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
test_tokenization_utils.py Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00