| .. |
|
internal
|
[whisper] static kv cache (#31166)
|
2024-07-02 13:24:15 +01:00 |
|
main_classes
|
Allow FP16 or other precision inference for Pipelines (#31342)
|
2024-07-05 17:21:50 +01:00 |
|
model_doc
|
Add FA2 and sdpa support for SigLIP (#31499)
|
2024-07-08 11:10:02 +01:00 |
|
quantization
|
Docs / AQLM: Clarify torch.compile support for AQLM (#31473)
|
2024-06-19 11:26:25 +02:00 |
|
tasks
|
Update mask_generation.md (#31543)
|
2024-06-23 20:27:21 +01:00 |
|
_config.py
|
|
|
|
_redirects.yml
|
Docs / Quantization: Redirect deleted page (#31063)
|
2024-05-28 18:29:22 +02:00 |
|
_toctree.yml
|
Add ZoeDepth (#30136)
|
2024-07-08 11:43:33 +02:00 |
|
accelerate.md
|
|
|
|
add_new_model.md
|
|
|
|
add_new_pipeline.md
|
|
|
|
agents.md
|
Adds final answer tool for all agents (#31703)
|
2024-07-03 11:36:09 +02:00 |
|
attention.md
|
|
|
|
autoclass_tutorial.md
|
|
|
|
benchmarks.md
|
|
|
|
bertology.md
|
|
|
|
big_models.md
|
|
|
|
chat_templating.md
|
Repeating an important warning in the chat template docs (#31796)
|
2024-07-05 15:30:24 +01:00 |
|
community.md
|
|
|
|
contributing.md
|
|
|
|
conversations.md
|
|
|
|
create_a_model.md
|
Enable HF pretrained backbones (#31145)
|
2024-06-06 22:02:38 +01:00 |
|
custom_models.md
|
|
|
|
debugging.md
|
|
|
|
deepspeed.md
|
Fix typos (#31819)
|
2024-07-08 11:52:47 +01:00 |
|
fast_tokenizers.md
|
|
|
|
fsdp.md
|
|
|
|
generation_strategies.md
|
Quantized KV Cache (#30483)
|
2024-05-23 17:25:20 +05:00 |
|
gguf.md
|
Add Qwen2 GGUF loading support (#31175)
|
2024-06-03 14:55:10 +01:00 |
|
glossary.md
|
Fix typos (#31819)
|
2024-07-08 11:52:47 +01:00 |
|
hpo_train.md
|
|
|
|
index.md
|
Add ZoeDepth (#30136)
|
2024-07-08 11:43:33 +02:00 |
|
installation.md
|
Use HF_HUB_OFFLINE + fix has_file in offline mode (#31016)
|
2024-05-29 11:55:43 +01:00 |
|
llm_optims.md
|
Add torch.compile for Mistral (#30642)
|
2024-05-20 16:27:24 +02:00 |
|
llm_tutorial.md
|
|
|
|
llm_tutorial_optimization.md
|
Fix typos (#31819)
|
2024-07-08 11:52:47 +01:00 |
|
model_memory_anatomy.md
|
|
|
|
model_sharing.md
|
|
|
|
model_summary.md
|
|
|
|
multilingual.md
|
|
|
|
notebooks.md
|
|
|
|
pad_truncation.md
|
|
|
|
peft.md
|
Docs / Quantization: Replace all occurences of load_in_8bit with bnb config (#31136)
|
2024-05-30 16:47:35 +02:00 |
|
perf_hardware.md
|
Fix typos (#31819)
|
2024-07-08 11:52:47 +01:00 |
|
perf_infer_cpu.md
|
|
|
|
perf_infer_gpu_one.md
|
Add FA2 and sdpa support for SigLIP (#31499)
|
2024-07-08 11:10:02 +01:00 |
|
perf_torch_compile.md
|
docs: fix style (#31340)
|
2024-06-10 09:53:25 +01:00 |
|
perf_train_cpu.md
|
|
|
|
perf_train_cpu_many.md
|
|
|
|
perf_train_gpu_many.md
|
Update perf_train_gpu_many.md (#31451)
|
2024-06-18 11:00:26 -07:00 |
|
perf_train_gpu_one.md
|
Add torch_empty_cache_steps to TrainingArguments (#31546)
|
2024-07-04 13:20:49 -04:00 |
|
perf_train_special.md
|
|
|
|
perf_train_tpu_tf.md
|
|
|
|
performance.md
|
|
|
|
perplexity.md
|
|
|
|
philosophy.md
|
|
|
|
pipeline_tutorial.md
|
Allow FP16 or other precision inference for Pipelines (#31342)
|
2024-07-05 17:21:50 +01:00 |
|
pipeline_webserver.md
|
|
|
|
pr_checks.md
|
|
|
|
preprocessing.md
|
|
|
|
quicktour.md
|
docs: fix broken link (#31370)
|
2024-06-12 11:33:00 +01:00 |
|
run_scripts.md
|
|
|
|
sagemaker.md
|
|
|
|
serialization.md
|
|
|
|
task_summary.md
|
|
|
|
tasks_explained.md
|
|
|
|
testing.md
|
Skip tests properly (#31308)
|
2024-06-26 21:59:08 +01:00 |
|
tf_xla.md
|
|
|
|
tflite.md
|
|
|
|
tokenizer_summary.md
|
[docs] Spanish translation of tokenizer_summary.md (#31154)
|
2024-06-03 16:52:23 -07:00 |
|
torchscript.md
|
|
|
|
trainer.md
|
FEAT / Trainer: LOMO optimizer support (#30178)
|
2024-05-21 10:16:37 +02:00 |
|
training.md
|
Added the necessay import of module (#30804)
|
2024-05-14 18:45:06 +01:00 |
|
troubleshooting.md
|
|
|