transformers

mirror of https://github.com/saymrwulf/transformers.git synced 2026-05-14 20:58:08 +00:00

Author	SHA1	Message	Date
Matt	5a2ff5dfb0	Set to pull_request_target, testing works!	2025-01-24 15:42:53 +00:00
Matt	fdaacaaf03	Set back to pull_request for testing	2025-01-24 15:40:34 +00:00
Matt	feafbe087e	Update the script	2025-01-24 15:40:15 +00:00
Matt	f124ec012b	Use pull-request-target instead	2025-01-24 14:57:26 +00:00
Matt	016ae273a2	Add TODO	2025-01-23 17:50:00 +00:00
Matt	3a12f71ab9	Request reviews instead of assigning	2025-01-22 20:10:49 +00:00
Matt	580aa713cf	Request reviews instead of assigning	2025-01-22 20:05:56 +00:00
Matt	8b20315634	Remove prefix	2025-01-22 19:53:16 +00:00
Matt	adad02848a	Strip inline comments	2025-01-22 19:39:14 +00:00
Matt	27d2961545	Update debug logs	2025-01-22 19:35:34 +00:00
Matt	3d6105a8d8	Update workflow permissions	2025-01-22 19:29:17 +00:00
Matt	8dc084682c	Update workflow permissions	2025-01-22 19:23:27 +00:00
Matt	6b0f5b9b24	Correct path for codeowners file	2025-01-22 19:15:51 +00:00
Matt	ef3df762f3	Temporarily comment out the opened line so we can test the script	2025-01-22 19:13:07 +00:00
Matt	e96ba83ad4	Don't reassign reviewers if we already have them	2025-01-22 19:08:37 +00:00
Matt	4333c61971	fix missing import	2025-01-22 19:06:54 +00:00
Matt	e17ab9831e	First draft of github action on PR opening for auto-assigning reviewers	2025-01-22 19:04:36 +00:00
Marc Sun	2c3a44f9a7	Fix NoneType type as it requires py>=3.10 (#35843 ) fix type	2025-01-22 15:56:53 +00:00
Mohit Sharma	fdcc62c855	Add PyTorch version check for FA backend on AMD GPUs (#35813 ) Disable FA backend for SDPA on AMD GPUs (PyTorch < 2.4.1)	2025-01-22 16:09:23 +01:00
LRL-ModelCloud	3b9770581e	Fix compatibility issues when using auto_gptq with these older versions (#35830 ) convert_model method of optimum only accepts a single nn.Module type model parameter for versions less than 1.23.99.	2025-01-22 15:46:47 +01:00
Joao Gante	62bd83947a	[chat] docs fix (#35840 ) docs fix	2025-01-22 14:32:27 +00:00
Isotr0py	487e2f63bd	Fix `head_dim` in config extracted from Gemma2 GGUF model (#35818 ) fix gemma2 head dim Signed-off-by: Isotr0py <2037008807@qq.com> Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com>	2025-01-22 15:22:04 +01:00
Joao Gante	b3d6722469	[Chat] Add Chat from TRL 🐈 (#35714 ) * tmp commit * add working chat * add docts * docs 2 * use auto dtype by default	2025-01-22 13:30:12 +00:00
Mohamed Mekkouri	a7738f5a89	Fix : Nemotron tokenizer for GGUF format (#35836 ) fix nemotron gguf	2025-01-22 12:28:40 +01:00
Joao Gante	ec28957f94	[pipeline] missing import regarding assisted generation (#35752 ) missing import	2025-01-22 10:34:28 +00:00
Joao Gante	36c9181f5c	[gpt2] fix generation tests (#35822 ) fix gpt2 generation tests	2025-01-22 09:41:04 +00:00
Yih-Dar	f439e28d32	Hotfix: missing `working-directory` in `self-comment-ci.yml` (#35833 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2025-01-22 10:25:50 +01:00
Raushan Turganbay	373e50e970	Init cache on meta device (#35164 ) * init cache on meta device * offloaded static + enable tests * tests weren't running before :( * update * fix mamba * fix copies * update * address comments and fix tests * fix copies * Update src/transformers/cache_utils.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * update * mamba fix --------- Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>	2025-01-22 09:49:17 +01:00
Yih-Dar	870e2c8ea0	Another security patch for `self-comment-ci.yml` (#35816 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2025-01-22 09:29:54 +01:00
CalOmnie	f4f33a20a2	Remove pyav pin to allow python 3.11 to be used (#35823 ) * Remove pyav pin to allow python 3.11 to be used * Run make fixup --------- Co-authored-by: Louis Groux <louis.cal.groux@gmail.com>	2025-01-21 20:16:18 +00:00
Joao Gante	90b46e983f	Remove old `benchmark` code (#35730 ) * remove traces of the old deprecated benchmarks * also remove old tf benchmark example, which uses deleted code * run doc builder	2025-01-21 17:56:43 +00:00
eustlb	870eb7b41b	[Mimi] update test expected values for t4 runners (#35696 ) update values for t4	2025-01-21 18:23:36 +01:00
Cyril Vallez	8ac851b0b3	Improve modular documentation (#35737 ) * start a nice doc * keep improving the doc * Finalize doc * Update modular_transformers.md * apply suggestion	2025-01-21 17:53:30 +01:00
Yoni Gozlan	107f9f5127	add Qwen2-VL image processor fast (#35733 ) * add qwen2_vl image processor fast * add device to ImagesKwargs * remove automatic fix copies * fix fast_is_faster_than_slow * remove unnecessary import	2025-01-21 11:49:05 -05:00
eustlb	3df90103b8	move fastspeech to audio models (#35788 )	2025-01-21 08:32:09 -08:00
Ahmed Almaghz	741d55237a	[i18n-ar] Translated file: `docs/source/ar/tasks/masked_language_modeling.md` into Arabic (#35198 ) * إضافة الترجمة العربية: masked_language_modeling.md * Update docs/source/ar/tasks/masked_language_modeling.md Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/tasks/masked_language_modeling.md Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/tasks/masked_language_modeling.md Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/tasks/masked_language_modeling.md Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/tasks/masked_language_modeling.md Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/tasks/masked_language_modeling.md Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/tasks/masked_language_modeling.md Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/tasks/masked_language_modeling.md Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/tasks/masked_language_modeling.md Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/tasks/masked_language_modeling.md Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/tasks/masked_language_modeling.md Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/tasks/masked_language_modeling.md Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/tasks/masked_language_modeling.md Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update _toctree.yml * Update _toctree.yml * Add language_modeling.md * Add Sequence_classifiation.md * Update _toctree.yml --------- Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>	2025-01-21 08:29:58 -08:00
v2ray	568941bf11	Optimized set_initialized_submodules. (#35493 )	2025-01-21 17:01:28 +01:00
Lucain	7051c5fcc8	Remove deprecated `get_cached_models` (#35809 ) * Remove deprecated get_cached_models * imports	2025-01-21 16:08:31 +01:00
InfroLab	97fbaf0861	Fixed typo in autoawq version number in an error message for IPEX backend requirements. (#35815 ) Fixed typo in version number for IPEX backend required minimal autoawq version	2025-01-21 14:42:44 +00:00
Mohamed Mekkouri	dbd8474125	Fix : BLOOM tie_word_embeddings in GGUF (#35812 ) * fix bloom ggml * fix falcon output * make style	2025-01-21 15:35:54 +01:00
Pedro Cuenca	678bd7f1ce	Auto-add `timm` tag to timm-wrapper models. (#35794 ) Works for fine-tuned or exported models: ```py from transformers import AutoModelForImageClassification checkpoint = "timm/vit_base_patch16_224.augreg2_in21k_ft_in1k" model = AutoModelForImageClassification.from_pretrained(checkpoint) model.push_to_hub("pcuenq/tw1") ``` The uploaded model will now show snippets for both the timm and the transformers libraries.	2025-01-21 14:34:45 +01:00
fzyzcjy	dc10f7906a	Support adamw_torch_8bit (#34993 ) * var * more * test	2025-01-21 14:17:49 +01:00
Louie Tsai	f82b19cb6f	add a new flax example for Bert model inference (#34794 ) * add a new example for flax inference cases * Update examples/flax/language-modeling/README.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update examples/flax/language-modeling/README.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update examples/flax/language-modeling/README.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update examples/flax/language-modeling/README.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update examples/flax/language-modeling/README.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update examples/flax/language-modeling/README.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * fix for "make fixup" --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2025-01-21 14:09:29 +01:00
Aritra Roy Gosthipaty	edbabf6b82	[Doc] Adding blog post to model doc for `TimmWrapper` (#35744 ) * adding blog post to model doc * Update docs/source/en/model_doc/timm_wrapper.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * review suggestions * review suggestions --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2025-01-21 12:32:39 +00:00
Yih-Dar	fd8d61fdb2	Byebye `test_batching_equivalence`'s flakiness (#35729 ) * fix * fix * skip * better error message --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2025-01-21 13:11:33 +01:00
NielsRogge	78f5ee0217	Add LlavaImageProcessor (#33191 ) * First draft * Add equivalence test * Update docstrings * Add tests * Use numpy * Fix tests * Improve variable names * Improve docstring * Add link * Remove script * Add copied from * Address comment * Add note in docs * Add docstring, data format * Improve test * Add test * update * Update src/transformers/models/llava/image_processing_llava.py Co-authored-by: Pavel Iakubovskii <qubvel@gmail.com> * Update src/transformers/models/llava/image_processing_llava.py Co-authored-by: Pavel Iakubovskii <qubvel@gmail.com> * loop once only --------- Co-authored-by: raushan <raushan@huggingface.co> Co-authored-by: Raushan Turganbay <raushan.turganbay@alumni.nu.edu.kz> Co-authored-by: Pavel Iakubovskii <qubvel@gmail.com>	2025-01-21 12:47:04 +01:00
ivarflakstad	8e4cedd9ca	Update AMD Docker image (#35804 )	2025-01-21 12:11:23 +01:00
Raushan Turganbay	705aeaaa12	Fix "test_chat_template_dict" in video LLMs (#35660 ) * fix "test_chat_template_dict" in llava_onevision * Update src/transformers/models/llava_next_video/processing_llava_next_video.py Co-authored-by: Pavel Iakubovskii <qubvel@gmail.com> * get one video calles once --------- Co-authored-by: Pavel Iakubovskii <qubvel@gmail.com>	2025-01-21 10:23:40 +01:00
Cyril Vallez	e867b97443	Deterministic sorting in modular converter when adding new functions (#35795 ) deterministic sort	2025-01-21 09:38:48 +01:00
Nikos Antoniou	920f34a772	modular_model_converter bugfix on assignments (#35642 ) * added bugfix in modular converter to keep modular assignments for docstrings, expected outputs etc. * revert stracoder2 docstring copying, add forward in EMU3 to enable docstring assingment, remove verbatim assignments in modular converter * added _FOR_DOC in assignments to keep, corrected wrong checkpoint name in ijepa's configuration	2025-01-21 08:06:44 +01:00

1 2 3 4 5 ...

17879 commits