transformers/docs/source/en
Billy Cao ac26260436
Allow FP16 or other precision inference for Pipelines (#31342)
* cast image features to model.dtype where needed to support FP16 or other precision in pipelines

* Update src/transformers/pipelines/image_feature_extraction.py

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Use .to instead

* Add FP16 pipeline support for zeroshot audio classification

* Remove unused torch imports

* Add docs on FP16 pipeline

* Remove unused import

* Add FP16 tests to pipeline mixin

* Add fp16 placeholder for mask_generation pipeline test

* Add FP16 tests for all pipelines

* Fix formatting

* Remove torch_dtype arg from is_pipeline_test_to_skip*

* Fix format

* trigger ci

---------

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
2024-07-05 17:21:50 +01:00
..
internal [whisper] static kv cache (#31166) 2024-07-02 13:24:15 +01:00
main_classes Allow FP16 or other precision inference for Pipelines (#31342) 2024-07-05 17:21:50 +01:00
model_doc Add training support for SigLIP (#31495) 2024-07-05 14:50:39 +01:00
quantization Docs / AQLM: Clarify torch.compile support for AQLM (#31473) 2024-06-19 11:26:25 +02:00
tasks Update mask_generation.md (#31543) 2024-06-23 20:27:21 +01:00
_config.py
_redirects.yml Docs / Quantization: Redirect deleted page (#31063) 2024-05-28 18:29:22 +02:00
_toctree.yml Fix post gemma merge (#31660) 2024-06-27 17:51:42 +02:00
accelerate.md
add_new_model.md
add_new_pipeline.md
agents.md Adds final answer tool for all agents (#31703) 2024-07-03 11:36:09 +02:00
attention.md
autoclass_tutorial.md
benchmarks.md
bertology.md
big_models.md
chat_templating.md Repeating an important warning in the chat template docs (#31796) 2024-07-05 15:30:24 +01:00
community.md
contributing.md
conversations.md Add sidebar tutorial for chat models (#30401) 2024-04-25 19:38:48 +01:00
create_a_model.md Enable HF pretrained backbones (#31145) 2024-06-06 22:02:38 +01:00
custom_models.md
debugging.md
deepspeed.md
fast_tokenizers.md
fsdp.md
generation_strategies.md Quantized KV Cache (#30483) 2024-05-23 17:25:20 +05:00
gguf.md Add Qwen2 GGUF loading support (#31175) 2024-06-03 14:55:10 +01:00
glossary.md
hpo_train.md
index.md Add gemma 2 (#31659) 2024-06-27 17:36:19 +02:00
installation.md Use HF_HUB_OFFLINE + fix has_file in offline mode (#31016) 2024-05-29 11:55:43 +01:00
llm_optims.md Add torch.compile for Mistral (#30642) 2024-05-20 16:27:24 +02:00
llm_tutorial.md Generate: update links on LLM tutorial doc (#30550) 2024-04-30 18:14:12 +01:00
llm_tutorial_optimization.md
model_memory_anatomy.md
model_sharing.md
model_summary.md
multilingual.md
notebooks.md
pad_truncation.md
peft.md Docs / Quantization: Replace all occurences of load_in_8bit with bnb config (#31136) 2024-05-30 16:47:35 +02:00
perf_hardware.md
perf_infer_cpu.md
perf_infer_gpu_one.md Fix post gemma merge (#31660) 2024-06-27 17:51:42 +02:00
perf_torch_compile.md docs: fix style (#31340) 2024-06-10 09:53:25 +01:00
perf_train_cpu.md
perf_train_cpu_many.md
perf_train_gpu_many.md Update perf_train_gpu_many.md (#31451) 2024-06-18 11:00:26 -07:00
perf_train_gpu_one.md Add torch_empty_cache_steps to TrainingArguments (#31546) 2024-07-04 13:20:49 -04:00
perf_train_special.md
perf_train_tpu_tf.md
performance.md
perplexity.md
philosophy.md
pipeline_tutorial.md Allow FP16 or other precision inference for Pipelines (#31342) 2024-07-05 17:21:50 +01:00
pipeline_webserver.md
pr_checks.md
preprocessing.md
quicktour.md docs: fix broken link (#31370) 2024-06-12 11:33:00 +01:00
run_scripts.md Fix broken link to Transformers notebooks (#30512) 2024-04-29 10:57:51 +01:00
sagemaker.md
serialization.md
task_summary.md
tasks_explained.md
testing.md Skip tests properly (#31308) 2024-06-26 21:59:08 +01:00
tf_xla.md
tflite.md
tokenizer_summary.md [docs] Spanish translation of tokenizer_summary.md (#31154) 2024-06-03 16:52:23 -07:00
torchscript.md
trainer.md FEAT / Trainer: LOMO optimizer support (#30178) 2024-05-21 10:16:37 +02:00
training.md Added the necessay import of module (#30804) 2024-05-14 18:45:06 +01:00
troubleshooting.md