transformers/tests/generation
Arthur 4bff54f921
Gemma capping (#34282)
* softcapping

* soft cap before the mask

* style

* ...

* super nit

* update

* fixes

* update

* small issue with modular

* fix modular imports

* update

* fixup

* simplify a hell lot

* simplify cleaning imports

* finish fixing

* update our design

* nits

* use a deprecation cycle

* updates

* Fix modular (recursive deps need to always be computed after merges!)

* push

* fix

* update

* fix modular order

* make fix-copies

* updates

* update

* ?

* don't compile for now

* ?

* fix some stuff

* donc!

* fix copies

* update

* fixup

* ?

* fix two tests

* fix?

* for now, don't use head info

* eager when output attentoin and sdpa or flash as it's the simplest behaviour (for our tests as well :))

* fix-copies

* revert sdpa check

* Apply suggestions from code review

Co-authored-by: Cyril Vallez <cyril.vallez@huggingface.co>

* rebase, fix-copies and push

* add a slow integration test

* update the test

* fix left padding issue

* fix test

* remove duplicate scaling

* quality

* add a small test and make sure it works

* 2b

---------

Co-authored-by: Cyril Vallez <cyril.vallez@gmail.com>
Co-authored-by: Cyril Vallez <cyril.vallez@huggingface.co>
2024-11-19 13:52:38 +01:00
..
__init__.py
test_beam_constraints.py
test_beam_search.py
test_configuration_utils.py Fix PushToHubMixin when pusing to a PR revision (#34090) 2024-10-11 15:06:15 +02:00
test_flax_logits_process.py Adding FlaxNoRepeatNGramLogitsProcessor (#29677) 2024-04-02 11:39:33 +02:00
test_flax_utils.py Fix CI (#34458) 2024-10-29 08:26:04 +01:00
test_framework_agnostic.py Generation: fix handling of special tokens (#31254) 2024-06-06 15:21:32 +05:00
test_fsdp.py Default synced_gpus to True when using FullyShardedDataParallel (#33483) 2024-10-10 14:09:04 -04:00
test_logits_process.py Add SynthID (watermerking by Google DeepMind) (#34350) 2024-10-23 21:18:52 +01:00
test_stopping_criteria.py Dynamic number of speculative tokens in order to accelerate speculative decoding (#33258) 2024-09-11 14:22:28 +02:00
test_streamers.py
test_tf_logits_process.py fix: multilingual midel convert to tflite get wrong token (#32079) 2024-08-27 11:44:09 +02:00
test_tf_utils.py Revert workaround for TF safetensors loading (#30128) 2024-04-09 11:04:18 +01:00
test_utils.py Gemma capping (#34282) 2024-11-19 13:52:38 +01:00