| .. |
|
internal
|
Add a static cache that offloads to the CPU or other device (#32161)
|
2024-08-29 11:51:09 +02:00 |
|
main_classes
|
Create local Transformers Engine (#33218)
|
2024-08-30 18:22:27 +02:00 |
|
model_doc
|
Granite language models (#31502)
|
2024-08-27 21:27:21 +02:00 |
|
quantization
|
Add TorchAOHfQuantizer (#32306)
|
2024-08-14 16:14:24 +02:00 |
|
tasks
|
Add video text to text docs (#33164)
|
2024-09-01 12:06:31 +03:00 |
|
_config.py
|
|
|
|
_redirects.yml
|
Docs / Quantization: Redirect deleted page (#31063)
|
2024-05-28 18:29:22 +02:00 |
|
_toctree.yml
|
Add video text to text docs (#33164)
|
2024-09-01 12:06:31 +03:00 |
|
accelerate.md
|
|
|
|
add_new_model.md
|
|
|
|
add_new_pipeline.md
|
|
|
|
agents.md
|
Add duckduckgo search tool (#32882)
|
2024-09-02 09:56:20 +02:00 |
|
attention.md
|
|
|
|
autoclass_tutorial.md
|
|
|
|
benchmarks.md
|
|
|
|
bertology.md
|
|
|
|
big_models.md
|
|
|
|
chat_templating.md
|
Update Jinja docs with new functions and general cleanup (#33097)
|
2024-08-23 17:40:06 +01:00 |
|
community.md
|
|
|
|
contributing.md
|
|
|
|
conversations.md
|
[docs] change temperature to a positive value (#32077)
|
2024-07-23 17:47:51 +01:00 |
|
create_a_model.md
|
Enable HF pretrained backbones (#31145)
|
2024-06-06 22:02:38 +01:00 |
|
custom_models.md
|
Updated the custom_models.md changed cross_entropy code (#33118)
|
2024-08-26 13:15:43 +02:00 |
|
debugging.md
|
|
|
|
deepspeed.md
|
Fix typos (#31819)
|
2024-07-08 11:52:47 +01:00 |
|
fast_tokenizers.md
|
|
|
|
fsdp.md
|
|
|
|
generation_strategies.md
|
Docs: alert for the possibility of manipulating logits (#32467)
|
2024-08-07 16:34:46 +01:00 |
|
gguf.md
|
Add Qwen2 GGUF loading support (#31175)
|
2024-06-03 14:55:10 +01:00 |
|
glossary.md
|
Fix typos (#31819)
|
2024-07-08 11:52:47 +01:00 |
|
hpo_train.md
|
|
|
|
index.md
|
Granite language models (#31502)
|
2024-08-27 21:27:21 +02:00 |
|
installation.md
|
Use HF_HUB_OFFLINE + fix has_file in offline mode (#31016)
|
2024-05-29 11:55:43 +01:00 |
|
kv_cache.md
|
Add a static cache that offloads to the CPU or other device (#32161)
|
2024-08-29 11:51:09 +02:00 |
|
llm_optims.md
|
Cache: use batch_size instead of max_batch_size (#32657)
|
2024-08-16 11:48:45 +01:00 |
|
llm_tutorial.md
|
Add SynCode to llm_tutorial (#32884)
|
2024-08-22 15:30:22 +02:00 |
|
llm_tutorial_optimization.md
|
Fix typos (#31819)
|
2024-07-08 11:52:47 +01:00 |
|
model_memory_anatomy.md
|
Very small change to one of the function parameters (#32548)
|
2024-08-27 09:29:05 -07:00 |
|
model_sharing.md
|
Docs: formatting nits (#32247)
|
2024-07-30 15:49:14 +01:00 |
|
model_summary.md
|
|
|
|
multilingual.md
|
|
|
|
notebooks.md
|
|
|
|
pad_truncation.md
|
|
|
|
peft.md
|
Docs / Quantization: Replace all occurences of load_in_8bit with bnb config (#31136)
|
2024-05-30 16:47:35 +02:00 |
|
perf_hardware.md
|
Fix typos (#31819)
|
2024-07-08 11:52:47 +01:00 |
|
perf_infer_cpu.md
|
|
|
|
perf_infer_gpu_one.md
|
[RoBERTa-based] Add support for sdpa (#30510)
|
2024-08-28 10:26:00 +02:00 |
|
perf_torch_compile.md
|
fix(docs): Fixed a link in docs (#32274)
|
2024-07-29 10:50:43 +01:00 |
|
perf_train_cpu.md
|
|
|
|
perf_train_cpu_many.md
|
Update the distributed CPU training on Kubernetes documentation (#32669)
|
2024-08-14 09:36:43 -07:00 |
|
perf_train_gpu_many.md
|
Update perf_train_gpu_many.md (#31451)
|
2024-06-18 11:00:26 -07:00 |
|
perf_train_gpu_one.md
|
Add torch_empty_cache_steps to TrainingArguments (#31546)
|
2024-07-04 13:20:49 -04:00 |
|
perf_train_special.md
|
|
|
|
perf_train_tpu_tf.md
|
|
|
|
performance.md
|
|
|
|
perplexity.md
|
|
|
|
philosophy.md
|
|
|
|
pipeline_tutorial.md
|
Docs: Fixed whisper-large-v2 model link in docs (#32871)
|
2024-08-19 09:50:35 -07:00 |
|
pipeline_webserver.md
|
|
|
|
pr_checks.md
|
|
|
|
preprocessing.md
|
chore: remove duplicate words (#31853)
|
2024-07-09 10:38:29 +01:00 |
|
quicktour.md
|
docs: fix broken link (#31370)
|
2024-06-12 11:33:00 +01:00 |
|
run_scripts.md
|
|
|
|
sagemaker.md
|
|
|
|
serialization.md
|
|
|
|
task_summary.md
|
|
|
|
tasks_explained.md
|
|
|
|
testing.md
|
Docs: Fixed WhisperModel.forward’s docstring link (#32498)
|
2024-08-07 11:01:33 -07:00 |
|
tf_xla.md
|
fix(docs): Fixed a link in docs (#32274)
|
2024-07-29 10:50:43 +01:00 |
|
tflite.md
|
|
|
|
tokenizer_summary.md
|
[docs] Spanish translation of tokenizer_summary.md (#31154)
|
2024-06-03 16:52:23 -07:00 |
|
torchscript.md
|
|
|
|
trainer.md
|
Integrate Liger (Linkedin GPU Efficient Runtime) Kernel to Trainer (#32860)
|
2024-08-23 13:20:49 +02:00 |
|
training.md
|
|
|
|
troubleshooting.md
|
|
|