transformers/docs/source/en
Aymeric Roucher 1ca9ff5c91
Add duckduckgo search tool (#32882)
* Add duckduckgo search tool
2024-09-02 09:56:20 +02:00
..
internal Add a static cache that offloads to the CPU or other device (#32161) 2024-08-29 11:51:09 +02:00
main_classes Create local Transformers Engine (#33218) 2024-08-30 18:22:27 +02:00
model_doc Granite language models (#31502) 2024-08-27 21:27:21 +02:00
quantization Add TorchAOHfQuantizer (#32306) 2024-08-14 16:14:24 +02:00
tasks Add video text to text docs (#33164) 2024-09-01 12:06:31 +03:00
_config.py
_redirects.yml Docs / Quantization: Redirect deleted page (#31063) 2024-05-28 18:29:22 +02:00
_toctree.yml Add video text to text docs (#33164) 2024-09-01 12:06:31 +03:00
accelerate.md
add_new_model.md
add_new_pipeline.md
agents.md Add duckduckgo search tool (#32882) 2024-09-02 09:56:20 +02:00
attention.md
autoclass_tutorial.md
benchmarks.md
bertology.md
big_models.md
chat_templating.md Update Jinja docs with new functions and general cleanup (#33097) 2024-08-23 17:40:06 +01:00
community.md
contributing.md
conversations.md [docs] change temperature to a positive value (#32077) 2024-07-23 17:47:51 +01:00
create_a_model.md Enable HF pretrained backbones (#31145) 2024-06-06 22:02:38 +01:00
custom_models.md Updated the custom_models.md changed cross_entropy code (#33118) 2024-08-26 13:15:43 +02:00
debugging.md
deepspeed.md Fix typos (#31819) 2024-07-08 11:52:47 +01:00
fast_tokenizers.md
fsdp.md
generation_strategies.md Docs: alert for the possibility of manipulating logits (#32467) 2024-08-07 16:34:46 +01:00
gguf.md Add Qwen2 GGUF loading support (#31175) 2024-06-03 14:55:10 +01:00
glossary.md Fix typos (#31819) 2024-07-08 11:52:47 +01:00
hpo_train.md
index.md Granite language models (#31502) 2024-08-27 21:27:21 +02:00
installation.md Use HF_HUB_OFFLINE + fix has_file in offline mode (#31016) 2024-05-29 11:55:43 +01:00
kv_cache.md Add a static cache that offloads to the CPU or other device (#32161) 2024-08-29 11:51:09 +02:00
llm_optims.md Cache: use batch_size instead of max_batch_size (#32657) 2024-08-16 11:48:45 +01:00
llm_tutorial.md Add SynCode to llm_tutorial (#32884) 2024-08-22 15:30:22 +02:00
llm_tutorial_optimization.md Fix typos (#31819) 2024-07-08 11:52:47 +01:00
model_memory_anatomy.md Very small change to one of the function parameters (#32548) 2024-08-27 09:29:05 -07:00
model_sharing.md Docs: formatting nits (#32247) 2024-07-30 15:49:14 +01:00
model_summary.md
multilingual.md
notebooks.md
pad_truncation.md
peft.md Docs / Quantization: Replace all occurences of load_in_8bit with bnb config (#31136) 2024-05-30 16:47:35 +02:00
perf_hardware.md Fix typos (#31819) 2024-07-08 11:52:47 +01:00
perf_infer_cpu.md
perf_infer_gpu_one.md [RoBERTa-based] Add support for sdpa (#30510) 2024-08-28 10:26:00 +02:00
perf_torch_compile.md fix(docs): Fixed a link in docs (#32274) 2024-07-29 10:50:43 +01:00
perf_train_cpu.md
perf_train_cpu_many.md Update the distributed CPU training on Kubernetes documentation (#32669) 2024-08-14 09:36:43 -07:00
perf_train_gpu_many.md Update perf_train_gpu_many.md (#31451) 2024-06-18 11:00:26 -07:00
perf_train_gpu_one.md Add torch_empty_cache_steps to TrainingArguments (#31546) 2024-07-04 13:20:49 -04:00
perf_train_special.md
perf_train_tpu_tf.md
performance.md
perplexity.md
philosophy.md
pipeline_tutorial.md Docs: Fixed whisper-large-v2 model link in docs (#32871) 2024-08-19 09:50:35 -07:00
pipeline_webserver.md
pr_checks.md
preprocessing.md chore: remove duplicate words (#31853) 2024-07-09 10:38:29 +01:00
quicktour.md docs: fix broken link (#31370) 2024-06-12 11:33:00 +01:00
run_scripts.md
sagemaker.md
serialization.md
task_summary.md
tasks_explained.md
testing.md Docs: Fixed WhisperModel.forward’s docstring link (#32498) 2024-08-07 11:01:33 -07:00
tf_xla.md fix(docs): Fixed a link in docs (#32274) 2024-07-29 10:50:43 +01:00
tflite.md
tokenizer_summary.md [docs] Spanish translation of tokenizer_summary.md (#31154) 2024-06-03 16:52:23 -07:00
torchscript.md
trainer.md Integrate Liger (Linkedin GPU Efficient Runtime) Kernel to Trainer (#32860) 2024-08-23 13:20:49 +02:00
training.md
troubleshooting.md