pytorch/test/distributed
Rohan Varma c866446d6c [FSDP] Check module.training for _root_cast_forward_inputs (#104223)
We might erroneously cast forward inputs for the root if it doesn't
manage any handles (FSDP parameters). As a fix, pass in the module and check
its training attribute to ensure we don't cast inputs in eval mode.

Differential Revision: [D47041673](https://our.internmc.facebook.com/intern/diff/D47041673/)
Pull Request resolved: https://github.com/pytorch/pytorch/pull/104223
Approved by: https://github.com/fegin
2023-06-28 16:38:01 +00:00
..
_composable [dynamo x fsdp] Simplify stream logic handling (#103902) 2023-06-21 01:34:19 +00:00
_shard
_spmd
_tensor [DTensor][Random] Introduce CudaRNGStateTracker to maintain parallel RNG state for DTensor (#103235) 2023-06-27 19:00:25 +00:00
_tools
algorithms
bin
checkpoint [DCP] Rewrite read slicing to use a wrapper. (#99167) 2023-06-08 13:52:13 +00:00
elastic
fsdp [FSDP] Check module.training for _root_cast_forward_inputs (#104223) 2023-06-28 16:38:01 +00:00
launcher
nn/jit
optim
pipeline/sync
rpc
tensor/parallel [TP] Add an input resharding wrapper for TP and unit test for 2D + AC (#103334) 2023-06-23 04:05:01 +00:00
argparse_util_test.py
test_c10d_common.py Back out "Revert "[DDP] multiple forward support for static graph (#103487)" (#103873)" (#103938) 2023-06-22 21:55:58 +00:00
test_c10d_gloo.py Enable test sparse allreduce basics Windows (#103317) 2023-06-14 07:37:50 +00:00
test_c10d_logger.py
test_c10d_nccl.py DDP + C10D sparse all_reduce changes (#103916) (#104256) 2023-06-28 00:37:52 +00:00
test_c10d_object_collectives.py [c10d] Remove test for init barrier (#103223) 2023-06-08 16:56:40 +00:00
test_c10d_pypg.py
test_c10d_spawn.py
test_c10d_spawn_gloo.py
test_c10d_spawn_nccl.py
test_c10d_spawn_ucc.py
test_c10d_ucc.py [CI] Enable UCC in CI (#100395) 2023-06-08 19:01:22 +00:00
test_collective_utils.py Initial commit of collective_utils (#101037) 2023-06-27 02:15:16 +00:00
test_data_parallel.py
test_distributed_spawn.py Back out "Revert "[DDP] multiple forward support for static graph (#103487)" (#103873)" (#103938) 2023-06-22 21:55:58 +00:00
test_dynamo_distributed.py [dynamo] FSDP + AC + torch.compile (#103953) 2023-06-24 01:40:56 +00:00
test_fake_pg.py
test_functional_api.py [C10D] Add functional collective reduce_scatter_into_tensor_coalesced. (#101023) 2023-06-23 19:24:11 +00:00
test_inductor_collectives.py [C10D] Add functional collective reduce_scatter_into_tensor_coalesced. (#101023) 2023-06-23 19:24:11 +00:00
test_launcher.py
test_multi_threaded_pg.py [Oncall][MTPG] Fix flaky test multi_threaded - test_broadcast_object_list (#103568) 2023-06-18 07:05:28 +00:00
test_nccl.py
test_pg_wrapper.py
test_store.py [gloo] Make it possible for gloo TCPStore to take over an existing socket fd (#103478) 2023-06-16 17:15:56 +00:00