|
deepspeed
|
DeepSpeed github repo move sync (#36021)
|
2025-02-05 08:19:31 -08:00 |
|
optimization
|
Support constant lr with cooldown (#35453)
|
2025-02-10 13:21:55 +01:00 |
|
quantization
|
Fix words typos in ggml test. (#36060)
|
2025-02-06 15:32:40 +00:00 |
|
tp
|
Update-tp test (#35844)
|
2025-02-03 09:37:02 +01:00 |
|
trainer
|
layernorm_decay_fix (#35927)
|
2025-02-04 11:01:49 +01:00 |
|
test_modeling_common.py
|
Fix model kwargs (#35875)
|
2025-02-06 11:35:25 -05:00 |