mirror of
https://github.com/saymrwulf/transformers.git
synced 2026-05-15 21:01:19 +00:00
* add gradient accumulation steps tests for fsdp * invert no_sync context to fix training for fsdp |
||
|---|---|---|
| .. | ||
| test_fsdp.py | ||
* add gradient accumulation steps tests for fsdp * invert no_sync context to fix training for fsdp |
||
|---|---|---|
| .. | ||
| test_fsdp.py | ||