transformers

mirror of https://github.com/saymrwulf/transformers.git synced 2026-05-14 20:58:08 +00:00

History

Arthur 8a8a0a4ae0 [`Llama ROPE`] Fix torch export but also slow downs in forward (#29198 ) * remove control flow * update gptneox * update .... * nits * Actually let's just break. Otherwise we are silently failing which imo is not optimal * version BC * fix tests * fix eager causal * nit * add a test * style * nits * nits * more nits for the test * update and fix * make sure cuda graphs are not skipped * read token is needed for meta llama * update! * fiixup * compile test should be slow * fix thet fix copies * stle 🫠		2024-02-28 10:45:53 +01:00
..
benchmark
bettertransformer
deepspeed	fix failing trainer ds tests (#29057 )	2024-02-16 17:18:45 +05:30
extended
fixtures
fsdp	Update all references to canonical models (#29001 )	2024-02-16 08:16:58 +01:00
generation	GenerationConfig validate both constraints and force_words_ids (#29163 )	2024-02-27 01:43:52 +01:00
models	[`Llama ROPE`] Fix torch export but also slow downs in forward (#29198 )	2024-02-28 10:45:53 +01:00
optimization
peft_integration
pipelines	Token level timestamps for long-form generation in Whisper (#29148 )	2024-02-27 18:15:26 +00:00
quantization	Cleaner Cache `dtype` and `device` extraction for CUDA graph generation for quantizers compatibility (#29079 )	2024-02-27 09:32:39 +01:00
repo_utils
sagemaker	Update all references to canonical models (#29001 )	2024-02-16 08:16:58 +01:00
tokenization	Update all references to canonical models (#29001 )	2024-02-16 08:16:58 +01:00
tools
trainer	FIX [`PEFT` / `Trainer` ] Handle better peft + quantized compiled models (#29055 )	2024-02-20 12:45:08 +01:00
utils	Update all references to canonical models (#29001 )	2024-02-16 08:16:58 +01:00
__init__.py
test_backbone_common.py
test_cache_utils.py	Llama: fix batched generation (#29109 )	2024-02-20 10:23:17 +00:00
test_configuration_common.py
test_configuration_utils.py	Update all references to canonical models (#29001 )	2024-02-16 08:16:58 +01:00
test_feature_extraction_common.py
test_feature_extraction_utils.py
test_image_processing_common.py	Raise unused kwargs image processor (#29063 )	2024-02-20 16:20:20 +01:00
test_image_processing_utils.py
test_image_transforms.py
test_modeling_common.py	Adding SegGPT (#27735 )	2024-02-26 18:17:19 +00:00
test_modeling_flax_common.py	[Flax] Update no init test for Flax v0.7.1 (#28735 )	2024-01-26 18:20:39 +00:00
test_modeling_flax_utils.py	Enable safetensors conversion from PyTorch to other frameworks without the torch requirement (#27599 )	2024-01-23 10:28:23 +01:00
test_modeling_tf_common.py	Add tf_keras imports to prepare for Keras 3 (#28588 )	2024-01-30 17:26:36 +00:00
test_modeling_tf_utils.py	Add tf_keras imports to prepare for Keras 3 (#28588 )	2024-01-30 17:26:36 +00:00
test_modeling_utils.py	Update all references to canonical models (#29001 )	2024-02-16 08:16:58 +01:00
test_pipeline_mixin.py	Image Feature Extraction pipeline (#28216 )	2024-02-05 14:50:07 +00:00
test_processing_common.py	Don't save `processor_config.json` if a processor has no extra attribute (#28584 )	2024-01-19 09:59:14 +00:00
test_sequence_feature_extraction_common.py
test_tokenization_common.py	Update all references to canonical models (#29001 )	2024-02-16 08:16:58 +01:00
test_tokenization_utils.py	Update all references to canonical models (#29001 )	2024-02-16 08:16:58 +01:00