| .. |
|
agents
|
Agents use grammar (#31735)
|
2024-08-07 11:42:52 +02:00 |
|
benchmark
|
|
|
|
bettertransformer
|
|
|
|
deepspeed
|
Yell at the user if zero-3 init wasn't performed, but expected to have been done (#32299)
|
2024-08-01 15:18:43 -04:00 |
|
extended
|
Skip tests properly (#31308)
|
2024-06-26 21:59:08 +01:00 |
|
fixtures
|
|
|
|
fsdp
|
Llama et al. / FSDP : Fix breaking change in 4.40 for FSDP (#31161)
|
2024-06-26 14:50:08 +01:00 |
|
generation
|
Cache: new Cache format in decoder-only models (#31421)
|
2024-08-07 10:02:16 +05:00 |
|
models
|
Add codestral mamba2 (#32080)
|
2024-08-06 16:39:52 +02:00 |
|
optimization
|
fix: Fixed the 1st argument name in classmethods (#31907)
|
2024-07-11 12:11:50 +01:00 |
|
peft_integration
|
|
|
|
pipelines
|
[pipeline] fix padding for 1-d tensors (#31776)
|
2024-07-29 21:24:42 +08:00 |
|
quantization
|
Support dequantizing GGUF FP16 format (#31783)
|
2024-07-24 17:59:59 +02:00 |
|
repo_utils
|
|
|
|
sagemaker
|
Fixed log messages that are resulting in TypeError due to too many arguments (#32017)
|
2024-07-17 10:56:44 +01:00 |
|
tokenization
|
#32184 save total_vocab_size (#32240)
|
2024-08-05 09:22:48 +02:00 |
|
trainer
|
Enhancing SFT Training Efficiency Using Packing and FlashAttention2 with Position IDs (#31629)
|
2024-07-23 15:56:41 +02:00 |
|
utils
|
Respect the config's attn_implementation if set (#32383)
|
2024-08-05 16:33:19 +01:00 |
|
__init__.py
|
|
|
|
test_backbone_common.py
|
|
|
|
test_configuration_common.py
|
Refactor: Removed un-necessary object base class (#32230)
|
2024-07-26 10:33:02 +02:00 |
|
test_feature_extraction_common.py
|
|
|
|
test_image_processing_common.py
|
Update kwargs validation for preprocess with decorator (#32024)
|
2024-08-06 11:33:05 +01:00 |
|
test_image_transforms.py
|
fix: center_crop occasionally outputs off-by-one dimension matrix (#30934)
|
2024-05-21 13:56:52 +01:00 |
|
test_modeling_common.py
|
Cache: new Cache format in decoder-only models (#31421)
|
2024-08-07 10:02:16 +05:00 |
|
test_modeling_flax_common.py
|
add sdpa to ViT [follow up of #29325] (#30555)
|
2024-05-16 10:56:11 +01:00 |
|
test_modeling_tf_common.py
|
Port IDEFICS to tensorflow (#26870)
|
2024-05-13 15:59:46 +01:00 |
|
test_pipeline_mixin.py
|
fix: Fixed raising TypeError instead of ValueError for invalid type (#32111)
|
2024-07-22 17:46:17 +01:00 |
|
test_processing_common.py
|
add initial design for uniform processors + align model (#31197)
|
2024-06-13 16:27:16 +02:00 |
|
test_sequence_feature_extraction_common.py
|
|
|
|
test_tokenization_common.py
|
Fix conflicting key in init kwargs in PreTrainedTokenizerBase (#31233)
|
2024-08-01 14:32:13 +02:00 |