transformers/tests
Arthur c23a1c1932
Add-helium (#35669)
* Add the helium model.

* Add a missing helium.

* And add another missing helium.

* Use float for the rmsnorm mul.

* Add the Helium tokenizer converter.

* Add the pad token as suggested by Arthur.

* Update the RMSNorm + some other tweaks.

* Fix more rebase issues.

* fix copies and style

* fixes and add helium.md

* add missing tests

* udpate the backlink

* oups

* style

* update init, and expected results

* small fixes

* match test outputs

* style fixup, fix doc builder

* add dummies and we should be good to go!z

* update sdpa and fa2 documentation

---------

Co-authored-by: laurent <laurent.mazare@gmail.com>
2025-01-13 18:41:15 +01:00
..
agents Change is_soundfile_availble to is_soundfile_available (#35030) 2025-01-03 14:37:42 +01:00
benchmark
bettertransformer
deepspeed Use inherit tempdir makers for tests + fix failing DS tests (#35600) 2025-01-10 10:01:58 -05:00
extended
fixtures
fsdp [tests] make cuda-only tests device-agnostic (#35607) 2025-01-13 14:48:39 +01:00
generation [tests] make cuda-only tests device-agnostic (#35607) 2025-01-13 14:48:39 +01:00
models Add-helium (#35669) 2025-01-13 18:41:15 +01:00
optimization
peft_integration added logic for deleting adapters once loaded (#34650) 2025-01-06 18:36:40 +00:00
pipelines [tests] make cuda-only tests device-agnostic (#35607) 2025-01-13 14:48:39 +01:00
quantization [tests] make cuda-only tests device-agnostic (#35607) 2025-01-13 14:48:39 +01:00
repo_utils Fix modular edge case + modular sorting order (#35562) 2025-01-09 17:17:52 +01:00
sagemaker
tokenization tokenizer train from iterator without pre_tokenizers (#35396) 2025-01-09 15:34:43 +01:00
tp
trainer [tests] make cuda-only tests device-agnostic (#35607) 2025-01-13 14:48:39 +01:00
utils Enable different torch dtype in sub models (#34873) 2025-01-13 13:42:08 +01:00
__init__.py
test_backbone_common.py
test_configuration_common.py
test_feature_extraction_common.py
test_image_processing_common.py Fix Qwen2VL processor to handle odd number of frames (#35431) 2025-01-08 13:49:00 +01:00
test_image_transforms.py
test_modeling_common.py [tests] make cuda-only tests device-agnostic (#35607) 2025-01-13 14:48:39 +01:00
test_modeling_flax_common.py 🚨All attention refactor🚨 (#35235) 2024-12-18 16:53:39 +01:00
test_modeling_tf_common.py 🚨All attention refactor🚨 (#35235) 2024-12-18 16:53:39 +01:00
test_pipeline_mixin.py
test_processing_common.py VLMs: major clean up 🧼 (#34502) 2025-01-08 10:35:23 +01:00
test_sequence_feature_extraction_common.py
test_tokenization_common.py [tokenizers] Ensure that add_prefix_space is propagated to backend_tokenizer.pre_tokenizer (#35593) 2025-01-09 17:46:50 +01:00