transformers/utils
Pavel Iakubovskii 5fcf6286bf
Add TimmWrapper (#34564)
* Add files

* Init

* Add TimmWrapperModel

* Fix up

* Some fixes

* Fix up

* Remove old file

* Sort out import orders

* Fix some model loading

* Compatible with pipeline and trainer

* Fix up

* Delete test_timm_model_1/config.json

* Remove accidentally commited files

* Delete src/transformers/models/modeling_timm_wrapper.py

* Remove empty imports; fix transformations applied

* Tidy up

* Add image classifcation model to special cases

* Create pretrained model; enable device_map='auto'

* Enable most tests; fix init order

* Sort imports

* [run-slow] timm_wrapper

* Pass num_classes into timm.create_model

* Remove train transforms from image processor

* Update timm creation with pretrained=False

* Fix gamma/beta issue for timm models

* Fixing gamma and beta renaming for timm models

* Simplify config and model creation

* Remove attn_implementation diff

* Fixup

* Docstrings

* Fix warning msg text according to test case

* Fix device_map auto

* Set dtype and device for pixel_values in forward

* Enable output hidden states

* Enable tests for hidden_states and model parallel

* Remove default scriptable arg

* Refactor inner model

* Update timm version

* Fix _find_mismatched_keys function

* Change inheritance for Classification model (fix weights loading with device_map)

* Minor bugfix

* Disable save pretrained for image processor

* Rename hook method for loaded keys correction

* Rename state dict keys on save, remove `timm_model` prefix, make checkpoint compatible with `timm`

* Managing num_labels <-> num_classes attributes

* Enable loading checkpoints in Trainer to resume training

* Update error message for output_hidden_states

* Add output hidden states test

* Decouple base and classification models

* Add more test cases

* Add save-load-to-timm test

* Fix test name

* Fixup

* Add do_pooling

* Add test for do_pooling

* Fix doc

* Add tests for TimmWrapperModel

* Add validation for `num_classes=0` in timm config + test for DINO checkpoint

* Adjust atol for test

* Fix docs

* dev-ci

* dev-ci

* Add tests for image processor

* Update docs

* Update init to new format

* Update docs in configuration

* Fix some docs in image processor

* Improve docs for modeling

* fix for is_timm_checkpoint

* Update code examples

* Fix header

* Fix typehint

* Increase tolerance a bit

* Fix Path

* Fixing model parallel tests

* Disable "parallel" tests

* Add comment for metadata

* Refactor AutoImageProcessor for timm wrapper loading

* Remove custom test_model_outputs_equivalence

* Add require_timm decorator

* Fix comment

* Make image processor work with older timm versions and tensor input

* Save config instead of whole model in image processor tests

* Add docstring for `image_processor_filename`

* Sanitize kwargs for timm image processor

* Fix doc style

* Update check for tensor input

* Update normalize

* Remove _load_timm_model function

---------

Co-authored-by: Amy Roberts <22614925+amyeroberts@users.noreply.github.com>
2024-12-11 12:40:30 +00:00
..
test_module
tf_ops
add_pipeline_model_mapping_to_test.py
check_bad_commit.py Fix utils/check_bad_commit.py (for auto ping in CI) (#34943) 2024-11-28 15:34:38 +01:00
check_build.py Fix import of FalconMambaForCausalLM (#33381) 2024-09-10 09:14:54 +02:00
check_config_attributes.py Add Aria (#34157) 2024-12-06 12:17:34 +01:00
check_config_docstrings.py Add TimmWrapper (#34564) 2024-12-11 12:40:30 +00:00
check_copies.py Generate: visit non-llm prepare_inputs_for_generation (#34199) 2024-10-17 16:53:48 +01:00
check_doc_toc.py
check_docstrings.py Add Aria (#34157) 2024-12-06 12:17:34 +01:00
check_doctest_list.py
check_dummies.py
check_inits.py Fix import of FalconMambaForCausalLM (#33381) 2024-09-10 09:14:54 +02:00
check_model_tester.py
check_modular_conversion.py Large modular logic refactoring (#34487) 2024-11-01 10:13:51 +01:00
check_repo.py Add Aria (#34157) 2024-12-06 12:17:34 +01:00
check_self_hosted_runner.py
check_support_list.py [RoBERTa-based] Add support for sdpa (#30510) 2024-08-28 10:26:00 +02:00
check_table.py Add OmDet-Turbo (#31843) 2024-09-25 13:26:28 -04:00
check_tf_ops.py
create_dependency_mapping.py Modular transformers: modularity and inheritance for new model additions (#33248) 2024-09-24 15:54:07 +02:00
create_dummy_models.py CI: fix efficientnet pipeline timeout and prevent future similar issues due to large image size (#33123) 2024-08-27 11:58:27 +01:00
custom_init_isort.py Import structure & first three model refactors (#31329) 2024-09-10 11:10:53 +02:00
deprecate_models.py
download_glue_data.py
extract_warnings.py
get_ci_error_statistics.py
get_github_job_time.py
get_modified_files.py
get_previous_daily_ci.py Ping team members for new failed tests in daily CI (#34171) 2024-10-17 16:11:52 +02:00
get_test_info.py CI: fix efficientnet pipeline timeout and prevent future similar issues due to large image size (#33123) 2024-08-27 11:58:27 +01:00
important_models.txt
models_to_deprecate.py
modular_model_converter.py Add Aria (#34157) 2024-12-06 12:17:34 +01:00
not_doctested.txt Add support for __all__ and potentilly deleting functions (#33859) 2024-10-08 10:19:17 +02:00
notification_service.py Fix CI slack reporting issue (#34833) 2024-11-20 21:36:13 +01:00
notification_service_doc_tests.py
notification_service_quantization.py
past_ci_versions.py
patch_helper.py [Patch helper] update to not have to checkout main (#34006) 2024-10-09 09:21:46 +02:00
pr_slow_ci_models.py
print_env.py
process_bad_commit_report.py Tiny update after #34383 (#34404) 2024-10-28 12:01:05 +01:00
process_test_artifacts.py fix the parallel number of CI nodes when it is smaller than number of tests (#33276) 2024-09-03 16:53:21 +02:00
release.py
set_cuda_devices_for_ci.py
slow_documentation_tests.txt
sort_auto_mappings.py
split_doctest_jobs.py
split_model_tests.py
tests_fetcher.py no filter (#34391) 2024-10-25 12:32:39 +02:00
update_metadata.py Add image text to text pipeline (#34170) 2024-10-31 15:48:11 -04:00
update_tiny_models.py Mention model_info.id instead of model_info.modelId (#32106) 2024-07-22 14:14:47 +01:00