transformers/docs/source/zh
Yoni Gozlan 203e27059b
Add image text to text pipeline (#34170)
* Standardize image-text-to-text-models-output

add post_process_image_text_to_text to chameleon and cleanup

Fix legacy kwarg behavior and deprecation warning

add post_process_image_text_to_text to qwen2_vl and llava_onevision

Add post_process_image_text_to_text to idefics3, mllama, pixtral processor

* nit var name post_process_image_text_to_text udop

* nit fix deprecation warnings

* Add image-text-to-text pipeline

* add support for image url in chat template for pipeline

* Reformat to be fully compatible with chat templates

* Add tests chat template

* Fix imports and tests

* Add pipeline tag

* change logic handling of single prompt ans multiple images

* add pipeline mapping to models

* fix batched inference

* fix tests

* Add manual batching for preprocessing

* Fix outputs with nested images

* Add support for all common processing kwargs

* Add default padding when multiple text inputs (batch size>1)

* nit change version deprecation warning

* Add support for text only inference

* add chat_template warnings

* Add pipeline tests and add copied from post process function

* Fix batched pipeline tests

* nit

* Fix pipeline tests blip2

* remove unnecessary max_new_tokens

* revert processing kosmos2 and remove unnecessary max_new_tokens

* fix pipeline tests idefics

* Force try loading processor if pipeline supports it

* revert load_processor change

* hardcode loading only processor

* remove unnecessary try except

* skip imagetexttotext tests for kosmos2 as tiny model causes problems

* Make code clearer

* Address review comments

* remove preprocessing logic from pipeline

* fix fuyu

* add BC resize fuyu

* Move post_process_image_text_to_text to ProcessorMixin

* add guard in post_process

* fix zero shot object detection pipeline

* add support for generator input in pipeline

* nit

* change default image-text-to-text model to llava onevision

* fix owlv2 size dict

* Change legacy deprecation warning to only show when True
2024-10-31 15:48:11 -04:00
..
internal Forbid PretrainedConfig from saving generate parameters; Update deprecations in generate-related code 🧹 (#32659) 2024-08-23 11:12:53 +01:00
main_classes Add image text to text pipeline (#34170) 2024-10-31 15:48:11 -04:00
tasks Trainer - deprecate tokenizer for processing_class (#32385) 2024-10-02 14:08:46 +01:00
_toctree.yml translated gguf.md into chinese (#34163) 2024-10-24 11:47:58 +02:00
accelerate.md
add_new_pipeline.md
autoclass_tutorial.md
big_models.md
chat_templating.md 🚨 No more default chat templates (#31733) 2024-07-24 17:36:32 +01:00
contributing.md Drop support for Python 3.8 (#34314) 2024-10-24 11:16:55 +02:00
create_a_model.md
custom_models.md Updated the custom_models.md changed cross_entropy code (#33118) 2024-08-26 13:15:43 +02:00
debugging.md
fast_tokenizers.md
fsdp.md
gguf.md translated gguf.md into chinese (#34163) 2024-10-24 11:47:58 +02:00
hpo_train.md Trainer - deprecate tokenizer for processing_class (#32385) 2024-10-02 14:08:46 +01:00
index.md
installation.md Use HF_HUB_OFFLINE + fix has_file in offline mode (#31016) 2024-05-29 11:55:43 +01:00
llm_tutorial.md Fix issue #32518: Update llm_tutorial.md (#32523) 2024-08-08 10:54:02 +01:00
model_sharing.md
multilingual.md
peft.md Docs / Quantization: Replace all occurences of load_in_8bit with bnb config (#31136) 2024-05-30 16:47:35 +02:00
perf_hardware.md
perf_torch_compile.md docs: fix style (#31340) 2024-06-10 09:53:25 +01:00
performance.md
philosophy.md translate philosophy.md to chinese (#32177) 2024-07-25 09:01:06 -07:00
pipeline_tutorial.md
preprocessing.md
quicktour.md Trainer - deprecate tokenizer for processing_class (#32385) 2024-10-02 14:08:46 +01:00
run_scripts.md Fix broken link to Transformers notebooks (#30512) 2024-04-29 10:57:51 +01:00
serialization.md
task_summary.md docs/zh: fix style (#31334) 2024-06-10 11:40:40 +01:00
tf_xla.md
tflite.md
tokenizer_summary.md
torchscript.md
training.md
transformers_agents.md