transformers/tests
Ita Zaporozhets deba7655e6
Add split special tokens (#30772)
* seems like `split_special_tokens` is used here

* split special token

* add new line at end of file

* moving split special token test to common tests

* added assertions

* test

* fixup

* add co-author

* passing rest of args to gptsan_japanese, fixing tests

* removing direct comparison of fast and slow models

* adding test support for UDOP and LayoutXLM

* ruff fix

* readd check if slow tokenizer

* modify test to handle bos tokens

* removing commented function

* trigger build

* applying review feedback - updated docstrings, var names, and simplified tests

* ruff fixes

* Update tests/test_tokenization_common.py

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* applying feedback, comments

* shutil temp directory fix

---------

Co-authored-by: Arthur Zucker <arthur.zucker@gmail.com>
Co-authored-by: Ita Zaporozhets <itazaporozhets@Itas-MBP.localdomain>
Co-authored-by: itazap <itazap@users.noreply.github.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
Co-authored-by: Ita Zaporozhets <itazaporozhets@Itas-MacBook-Pro.local>
2024-05-24 08:38:58 -07:00
..
agents Reboot Agents (#30387) 2024-05-07 12:59:49 +02:00
benchmark
bettertransformer
deepspeed Update ds_config_zero3.json (#30829) 2024-05-15 10:02:31 -04:00
extended CI: update to ROCm 6.0.2 and test MI300 (#30266) 2024-05-13 18:14:36 +02:00
fixtures
fsdp Add FSDP config for CPU RAM efficient loading through accelerate (#30002) 2024-04-22 13:15:28 +01:00
generation Quantized KV Cache (#30483) 2024-05-23 17:25:20 +05:00
models Add split special tokens (#30772) 2024-05-24 08:38:58 -07:00
optimization Add WSD scheduler (#30231) 2024-04-25 12:07:21 +01:00
peft_integration
pipelines Using assistant in AutomaticSpeechRecognitionPipeline with different encoder size (#30637) 2024-05-23 09:59:38 +01:00
quantization Quantization / TST: Fix remaining quantization tests (#31000) 2024-05-24 14:35:59 +02:00
repo_utils
sagemaker update ruff version (#30932) 2024-05-22 06:40:15 +02:00
tokenization update ruff version (#30932) 2024-05-22 06:40:15 +02:00
trainer Enforce saving at end of training if saving option chosen (#30160) 2024-05-21 07:50:11 -04:00
utils 🚨 out_indices always a list (#30941) 2024-05-22 15:23:04 +01:00
__init__.py
test_backbone_common.py
test_cache_utils.py
test_configuration_common.py
test_configuration_utils.py Fix resume_download future warning (#31007) 2024-05-24 14:35:40 +02:00
test_feature_extraction_common.py
test_feature_extraction_utils.py
test_image_processing_common.py
test_image_processing_utils.py
test_image_transforms.py fix: center_crop occasionally outputs off-by-one dimension matrix (#30934) 2024-05-21 13:56:52 +01:00
test_modeling_common.py [tests] make test_model_parallelism device-agnostic (#30844) 2024-05-24 11:51:51 +01:00
test_modeling_flax_common.py add sdpa to ViT [follow up of #29325] (#30555) 2024-05-16 10:56:11 +01:00
test_modeling_flax_utils.py
test_modeling_tf_common.py Port IDEFICS to tensorflow (#26870) 2024-05-13 15:59:46 +01:00
test_modeling_tf_utils.py
test_modeling_utils.py Llama: fix custom 4D masks, v2 (#30348) 2024-05-13 13:46:06 +02:00
test_pipeline_mixin.py
test_processing_common.py
test_sequence_feature_extraction_common.py
test_tokenization_common.py Add split special tokens (#30772) 2024-05-24 08:38:58 -07:00
test_tokenization_utils.py