| .. |
|
internal
|
Quantized KV Cache (#30483)
|
2024-05-23 17:25:20 +05:00 |
|
main_classes
|
Fast image processor (#28847)
|
2024-06-11 15:47:38 +01:00 |
|
model_doc
|
[GPT2] Add SDPA support (#31172)
|
2024-06-19 09:40:57 +02:00 |
|
quantization
|
Docs / AQLM: Clarify torch.compile support for AQLM (#31473)
|
2024-06-19 11:26:25 +02:00 |
|
tasks
|
Decorators for deprecation and named arguments validation (#30799)
|
2024-06-10 12:35:10 +01:00 |
|
_config.py
|
|
|
|
_redirects.yml
|
Docs / Quantization: Redirect deleted page (#31063)
|
2024-05-28 18:29:22 +02:00 |
|
_toctree.yml
|
Docs / Quantization: refactor quantization documentation (#30942)
|
2024-05-23 14:31:52 +02:00 |
|
accelerate.md
|
|
|
|
add_new_model.md
|
Remove add-new-model in favor of add-new-model-like (#30424)
|
2024-04-24 09:38:18 +02:00 |
|
add_new_pipeline.md
|
|
|
|
agents.md
|
Add streaming, various fixes (#30838)
|
2024-05-31 14:16:23 +02:00 |
|
attention.md
|
|
|
|
autoclass_tutorial.md
|
|
|
|
benchmarks.md
|
|
|
|
bertology.md
|
|
|
|
big_models.md
|
|
|
|
chat_templating.md
|
Update chat template docs and bump Jinja version (#31455)
|
2024-06-18 14:16:30 +01:00 |
|
community.md
|
|
|
|
contributing.md
|
|
|
|
conversations.md
|
Add sidebar tutorial for chat models (#30401)
|
2024-04-25 19:38:48 +01:00 |
|
create_a_model.md
|
Enable HF pretrained backbones (#31145)
|
2024-06-06 22:02:38 +01:00 |
|
custom_models.md
|
|
|
|
debugging.md
|
|
|
|
deepspeed.md
|
Rename torch.run to torchrun (#30405)
|
2024-04-23 09:04:17 -07:00 |
|
fast_tokenizers.md
|
|
|
|
fsdp.md
|
|
|
|
generation_strategies.md
|
Quantized KV Cache (#30483)
|
2024-05-23 17:25:20 +05:00 |
|
gguf.md
|
Add Qwen2 GGUF loading support (#31175)
|
2024-06-03 14:55:10 +01:00 |
|
glossary.md
|
|
|
|
hpo_train.md
|
|
|
|
index.md
|
[Port] TensorFlow implementation of Mistral (#29708)
|
2024-05-23 17:48:49 +01:00 |
|
installation.md
|
Use HF_HUB_OFFLINE + fix has_file in offline mode (#31016)
|
2024-05-29 11:55:43 +01:00 |
|
llm_optims.md
|
Add torch.compile for Mistral (#30642)
|
2024-05-20 16:27:24 +02:00 |
|
llm_tutorial.md
|
Generate: update links on LLM tutorial doc (#30550)
|
2024-04-30 18:14:12 +01:00 |
|
llm_tutorial_optimization.md
|
|
|
|
model_memory_anatomy.md
|
|
|
|
model_sharing.md
|
|
|
|
model_summary.md
|
|
|
|
multilingual.md
|
|
|
|
notebooks.md
|
|
|
|
pad_truncation.md
|
|
|
|
peft.md
|
Docs / Quantization: Replace all occurences of load_in_8bit with bnb config (#31136)
|
2024-05-30 16:47:35 +02:00 |
|
perf_hardware.md
|
|
|
|
perf_infer_cpu.md
|
|
|
|
perf_infer_gpu_one.md
|
[GPT2] Add SDPA support (#31172)
|
2024-06-19 09:40:57 +02:00 |
|
perf_torch_compile.md
|
docs: fix style (#31340)
|
2024-06-10 09:53:25 +01:00 |
|
perf_train_cpu.md
|
|
|
|
perf_train_cpu_many.md
|
|
|
|
perf_train_gpu_many.md
|
Update perf_train_gpu_many.md (#31451)
|
2024-06-18 11:00:26 -07:00 |
|
perf_train_gpu_one.md
|
|
|
|
perf_train_special.md
|
|
|
|
perf_train_tpu_tf.md
|
|
|
|
performance.md
|
|
|
|
perplexity.md
|
|
|
|
philosophy.md
|
|
|
|
pipeline_tutorial.md
|
|
|
|
pipeline_webserver.md
|
|
|
|
pr_checks.md
|
|
|
|
preprocessing.md
|
|
|
|
quicktour.md
|
docs: fix broken link (#31370)
|
2024-06-12 11:33:00 +01:00 |
|
run_scripts.md
|
Fix broken link to Transformers notebooks (#30512)
|
2024-04-29 10:57:51 +01:00 |
|
sagemaker.md
|
|
|
|
serialization.md
|
|
|
|
task_summary.md
|
|
|
|
tasks_explained.md
|
|
|
|
testing.md
|
|
|
|
tf_xla.md
|
|
|
|
tflite.md
|
|
|
|
tokenizer_summary.md
|
[docs] Spanish translation of tokenizer_summary.md (#31154)
|
2024-06-03 16:52:23 -07:00 |
|
torchscript.md
|
|
|
|
trainer.md
|
FEAT / Trainer: LOMO optimizer support (#30178)
|
2024-05-21 10:16:37 +02:00 |
|
training.md
|
Added the necessay import of module (#30804)
|
2024-05-14 18:45:06 +01:00 |
|
troubleshooting.md
|
|
|