mirror of
https://github.com/saymrwulf/transformers.git
synced 2026-05-14 20:58:08 +00:00
* fix GA bugs and add unit test * narrow down model loss unit test diff gap * format code to make ruff happy * send num_items_in_batch argument to decoder * fix GA loss bug in BertLMHeadModel * use TinyStories-33M to narrow down diff gap * fotmat code * missing .config * avoid add extra args --------- Co-authored-by: kangsheng <kangsheng@meituan.com> |
||
|---|---|---|
| .. | ||
| __init__.py | ||
| test_data_collator.py | ||
| test_trainer.py | ||
| test_trainer_callback.py | ||
| test_trainer_distributed.py | ||
| test_trainer_fsdp.py | ||
| test_trainer_seq2seq.py | ||
| test_trainer_tpu.py | ||
| test_trainer_utils.py | ||