| .. |
|
internal
|
Implement AsyncTextIteratorStreamer for asynchronous streaming (#34931)
|
2024-12-20 12:08:12 +01:00 |
|
main_classes
|
HIGGS Quantization Support (#34997)
|
2024-12-23 16:54:49 +01:00 |
|
model_doc
|
[doctest] Fixes (#35863)
|
2025-01-26 15:26:38 -08:00 |
|
quantization
|
Enable gptqmodel (#35012)
|
2025-01-15 14:22:49 +01:00 |
|
tasks
|
[doctest] Fixes (#35863)
|
2025-01-26 15:26:38 -08:00 |
|
_config.py
|
Add optimized PixtralImageProcessorFast (#34836)
|
2024-11-28 16:04:05 +01:00 |
|
_redirects.yml
|
|
|
|
_toctree.yml
|
Granite Vision Support (#35579)
|
2025-01-23 17:15:52 +01:00 |
|
accelerate.md
|
|
|
|
add_new_model.md
|
|
|
|
add_new_pipeline.md
|
[docs] Follow up register_pipeline (#35310)
|
2024-12-20 09:22:44 -08:00 |
|
agents.md
|
Multiple typo fixes in Tutorials docs (#35035)
|
2024-12-02 15:26:34 +00:00 |
|
agents_advanced.md
|
[doctest] Fixes (#35863)
|
2025-01-26 15:26:38 -08:00 |
|
attention.md
|
|
|
|
autoclass_tutorial.md
|
[docs] Increase visibility of torch_dtype="auto" (#35067)
|
2024-12-04 09:18:44 -08:00 |
|
bertology.md
|
|
|
|
big_models.md
|
|
|
|
chat_templating.md
|
[doctest] Fixes (#35863)
|
2025-01-26 15:26:38 -08:00 |
|
community.md
|
|
|
|
contributing.md
|
|
|
|
conversations.md
|
|
|
|
create_a_model.md
|
|
|
|
custom_models.md
|
|
|
|
debugging.md
|
|
|
|
deepspeed.md
|
[doc] deepspeed universal checkpoint (#35015)
|
2025-01-09 09:50:51 -08:00 |
|
fast_tokenizers.md
|
|
|
|
fsdp.md
|
Fix docs typos. (#35465)
|
2025-01-02 11:29:46 +01:00 |
|
generation_strategies.md
|
[doctest] Fixes (#35863)
|
2025-01-26 15:26:38 -08:00 |
|
gguf.md
|
Add Gemma2 GGUF support (#34002)
|
2025-01-03 14:50:07 +01:00 |
|
glossary.md
|
|
|
|
how_to_hack_models.md
|
|
|
|
hpo_train.md
|
|
|
|
index.md
|
add qwen2.5vl (#35569)
|
2025-01-23 11:23:00 +01:00 |
|
installation.md
|
Enhanced Installation Section in README.md (#35094)
|
2025-01-14 08:05:08 -08:00 |
|
kv_cache.md
|
[doctest] Fixes (#35863)
|
2025-01-26 15:26:38 -08:00 |
|
llm_optims.md
|
Update llm_optims docs for sdpa_kernel (#35481)
|
2025-01-06 08:54:31 -08:00 |
|
llm_tutorial.md
|
[chat] docs fix (#35840)
|
2025-01-22 14:32:27 +00:00 |
|
llm_tutorial_optimization.md
|
[docs] add explanation to release_memory() (#34911)
|
2024-11-27 07:47:28 -08:00 |
|
model_memory_anatomy.md
|
|
|
|
model_sharing.md
|
[docs] update not-working model revision (#34682)
|
2024-11-11 07:09:31 -08:00 |
|
model_summary.md
|
|
|
|
modular_transformers.md
|
Improve modular documentation (#35737)
|
2025-01-21 17:53:30 +01:00 |
|
multilingual.md
|
|
|
|
notebooks.md
|
|
|
|
pad_truncation.md
|
|
|
|
peft.md
|
|
|
|
perf_hardware.md
|
|
|
|
perf_infer_cpu.md
|
[docs] Increase visibility of torch_dtype="auto" (#35067)
|
2024-12-04 09:18:44 -08:00 |
|
perf_infer_gpu_multi.md
|
Fix image preview in multi-GPU inference docs (#35303)
|
2024-12-17 09:33:50 -08:00 |
|
perf_infer_gpu_one.md
|
add qwen2.5vl (#35569)
|
2025-01-23 11:23:00 +01:00 |
|
perf_torch_compile.md
|
[docs] use device-agnostic instead of cuda (#35047)
|
2024-12-03 10:53:45 -08:00 |
|
perf_train_cpu.md
|
[doc] use full path for run_qa.py (#34914)
|
2024-11-26 09:23:44 -08:00 |
|
perf_train_cpu_many.md
|
[doc] use full path for run_qa.py (#34914)
|
2024-11-26 09:23:44 -08:00 |
|
perf_train_gpu_many.md
|
Multiple typo fixes in Tutorials docs (#35035)
|
2024-12-02 15:26:34 +00:00 |
|
perf_train_gpu_one.md
|
|
|
|
perf_train_special.md
|
|
|
|
perf_train_tpu_tf.md
|
|
|
|
performance.md
|
Simplify Tensor Parallel implementation with PyTorch TP (#34184)
|
2024-11-18 19:51:49 +01:00 |
|
perplexity.md
|
[docs] use device-agnostic API instead of cuda (#34913)
|
2024-11-26 09:23:34 -08:00 |
|
philosophy.md
|
|
|
|
pipeline_tutorial.md
|
[docs] Increase visibility of torch_dtype="auto" (#35067)
|
2024-12-04 09:18:44 -08:00 |
|
pipeline_webserver.md
|
|
|
|
pr_checks.md
|
|
|
|
preprocessing.md
|
|
|
|
quicktour.md
|
[chat] docs fix (#35840)
|
2025-01-22 14:32:27 +00:00 |
|
run_scripts.md
|
|
|
|
sagemaker.md
|
|
|
|
serialization.md
|
|
|
|
task_summary.md
|
[doctest] Fixes (#35863)
|
2025-01-26 15:26:38 -08:00 |
|
tasks_explained.md
|
fix: Wrong task mentioned in docs (#34757)
|
2024-11-18 18:42:28 +00:00 |
|
testing.md
|
[tests] add XPU part to testing (#34778)
|
2024-11-18 09:59:11 -08:00 |
|
tf_xla.md
|
|
|
|
tflite.md
|
|
|
|
tiktoken.md
|
Updated documentation and added conversion utility (#34319)
|
2024-11-25 18:44:09 +01:00 |
|
tokenizer_summary.md
|
|
|
|
torchscript.md
|
|
|
|
trainer.md
|
Fix callback key name (#34762)
|
2024-11-18 18:41:12 +00:00 |
|
training.md
|
[docs] Increase visibility of torch_dtype="auto" (#35067)
|
2024-12-04 09:18:44 -08:00 |
|
troubleshooting.md
|
|
|