transformers/tests
Zach Mueller 3d6e55c7e7 Fix model kwargs (#35875)
* Save state

* Make a failing test

* Better test

* mpt -> done, many more to go

* Rm extranious

* Bamba

* Bert

* big_bird

* biogpt

* bloom

* codegen

* ctrl

* data2vec

* dbrx

* Through up to Dbrx

* electra

* ernie

* falcon

* Fuyu/persimmon

* Include noop kwargs to base models

* Rebase

* Skip musigen

* Refactor/skip mllama

* Revert makefile

* Rm file

* Fix PT failing, need to modify rest of loss funcs to not resize

* Propagate some

* Continue

* More

* More options

* Mostly fixed

* Proved that it's the same

* Bloom is good

* Make ability to override loss func possible

* Fixup

* Clean

* Fix xglm

* Quality tests

* Skip OCR2

* Make specific loss for xglm

* Make order the same/line up 1:1

* xglm

* Skip fx output loss bloom model

* Didn't pass in pad_token_id

* Fix quality
2025-02-06 21:09:51 +01:00
..
agents Change is_soundfile_availble to is_soundfile_available (#35030) 2025-01-03 14:37:42 +01:00
benchmark
bettertransformer
deepspeed Trainer - deprecate tokenizer for processing_class (#32385) 2024-10-02 14:08:46 +01:00
extended [tests] skip tests for xpu (#33553) 2024-09-19 19:28:04 +01:00
fixtures
fsdp FSDP grad accum fix (#34645) 2024-11-15 22:28:06 +01:00
generation [WIP] Emu3: add model (#33770) 2025-01-10 12:30:23 +01:00
models Paligemma: fix generation with Gemma2 (#36044) 2025-02-06 14:37:47 +01:00
optimization
peft_integration added logic for deleting adapters once loaded (#34650) 2025-01-06 18:36:40 +00:00
pipelines Pipeline: simple API for assisted generation (#34504) 2025-01-08 17:08:02 +00:00
quantization Add Gemma2 GGUF support (#34002) 2025-01-03 14:50:07 +01:00
repo_utils Fix modular edge case + modular sorting order (#35562) 2025-01-09 17:17:52 +01:00
sagemaker Trainer - deprecate tokenizer for processing_class (#32385) 2024-10-02 14:08:46 +01:00
tokenization tokenizer train from iterator without pre_tokenizers (#35396) 2025-01-09 15:34:43 +01:00
tp Simplify Tensor Parallel implementation with PyTorch TP (#34184) 2024-11-18 19:51:49 +01:00
trainer Fix condition when GA loss bug fix is not performed (#35651) 2025-01-20 16:12:49 +01:00
utils Fix: loading DBRX back from saved path (#35728) 2025-01-30 09:24:51 +01:00
__init__.py
test_backbone_common.py
test_configuration_common.py Load sub-configs from composite configs (#34410) 2024-11-05 11:34:01 +01:00
test_feature_extraction_common.py
test_image_processing_common.py Fix Qwen2VL processor to handle odd number of frames (#35431) 2025-01-08 13:49:00 +01:00
test_image_transforms.py
test_modeling_common.py Fix model kwargs (#35875) 2025-02-06 21:09:51 +01:00
test_modeling_flax_common.py 🚨All attention refactor🚨 (#35235) 2024-12-18 16:53:39 +01:00
test_modeling_tf_common.py 🚨All attention refactor🚨 (#35235) 2024-12-18 16:53:39 +01:00
test_pipeline_mixin.py Add image text to text pipeline (#34170) 2024-10-31 15:48:11 -04:00
test_processing_common.py VLMs: major clean up 🧼 (#34502) 2025-01-08 10:35:23 +01:00
test_sequence_feature_extraction_common.py
test_tokenization_common.py [tokenizers] Ensure that add_prefix_space is propagated to backend_tokenizer.pre_tokenizer (#35593) 2025-01-09 17:46:50 +01:00