| .. |
|
_composable
|
[ca] expose option to collect sizes as dynamic (#141153)
|
2024-11-22 19:26:27 +00:00 |
|
_shard
|
Flip default on weights_only (#137602)
|
2024-11-04 18:30:29 +00:00 |
|
_tensor
|
Fix condition for weights_only unpickler for DTensor (#140740)
|
2024-11-19 02:44:53 +00:00 |
|
_tools
|
ILP for auto FSDP wrapping (#140298)
|
2024-11-11 22:02:39 +00:00 |
|
algorithms
|
|
|
|
bin
|
|
|
|
checkpoint
|
[DCP] Unit Test to validate the stateful and non-stateful loads (#139251)
|
2024-10-31 01:12:51 +00:00 |
|
elastic
|
Fix rendezvous error due to EtcdStore get method not waiting in some cases (#137056)
|
2024-10-02 01:45:00 +00:00 |
|
flight_recorder
|
[FR] Polish the log message for dtype mismatch and don't exit when too many mismatch (#140451)
|
2024-11-13 07:24:53 +00:00 |
|
fsdp
|
[DTensor][unpickler] Add DTensor related classes to allowed globals so we can still torch.load(DTensor) with weights_only=True (#139949)
|
2024-11-08 05:06:11 +00:00 |
|
launcher
|
|
|
|
nn/jit
|
|
|
|
optim
|
|
|
|
pipelining
|
[pipelining] allow multiple backward grads (#140981)
|
2024-11-23 00:35:08 +00:00 |
|
rpc
|
|
|
|
tensor/parallel
|
[Inductor] improve the stride preservation logic of user-visible outputs (#136732)
|
2024-10-26 18:49:14 +00:00 |
|
argparse_util_test.py
|
|
|
|
test_backends.py
|
API to retrieve default distributed backend from device (#140536)
|
2024-11-22 11:01:53 +00:00 |
|
test_c10d_common.py
|
PyProcessGroup: support rank, world size, group name/desc overrides (#141529)
|
2024-11-26 20:56:57 +00:00 |
|
test_c10d_functional_native.py
|
[c10d][Partial-Graph Overlap] Support calling .wait_tensor() on output tensor of eager async_op=True collective if under allow_inflight_collective_as_graph_input_ctx() context manager (#137763)
|
2024-10-29 03:31:19 +00:00 |
|
test_c10d_gloo.py
|
|
|
|
test_c10d_logger.py
|
[c10d] Switch all timer logging in c10d to wait_counter (#141154)
|
2024-11-21 01:10:11 +00:00 |
|
test_c10d_nccl.py
|
[c10d] Test needs abort; otherwise will hang (#141509)
|
2024-11-27 05:47:17 +00:00 |
|
test_c10d_object_collectives.py
|
[c10d][CI] Improve world size setting in some tests (#138846)
|
2024-10-25 23:02:17 +00:00 |
|
test_c10d_ops_nccl.py
|
[c10d][CI] Improve world size setting in some tests (#138846)
|
2024-10-25 23:02:17 +00:00 |
|
test_c10d_pypg.py
|
PyProcessGroup: support rank, world size, group name/desc overrides (#141529)
|
2024-11-26 20:56:57 +00:00 |
|
test_c10d_spawn.py
|
|
|
|
test_c10d_spawn_gloo.py
|
|
|
|
test_c10d_spawn_nccl.py
|
|
|
|
test_c10d_spawn_ucc.py
|
|
|
|
test_c10d_ucc.py
|
|
|
|
test_collective_utils.py
|
|
|
|
test_compute_comm_reordering.py
|
[CI] Add Compiled DDP / Compiled FSDP2 / compute-comm reordering tests to test_inductor_distributed (#138178)
|
2024-10-20 19:38:18 +00:00 |
|
test_control_collectives.py
|
|
|
|
test_data_parallel.py
|
Replace clone.detach with detach.clone (#140264)
|
2024-11-13 07:01:02 +00:00 |
|
test_device_mesh.py
|
[DeviceMesh] fix sub mesh size calculation in create_sub_mesh() (#138945)
|
2024-10-29 17:56:56 +00:00 |
|
test_distributed_spawn.py
|
|
|
|
test_dynamo_distributed.py
|
[Dynamo][Distributed] Fix ProcessGroup getattr (#141638)
|
2024-11-27 21:42:33 +00:00 |
|
test_fake_pg.py
|
|
|
|
test_functional_api.py
|
Generalization of distributed test cases for non-CUDA devices (#138216)
|
2024-11-18 09:38:00 +00:00 |
|
test_inductor_collectives.py
|
Move Sympy printers to torch/utils/_sympy/printers.py (#140597)
|
2024-11-26 18:11:00 +00:00 |
|
test_launcher.py
|
|
|
|
test_multi_threaded_pg.py
|
|
|
|
test_nccl.py
|
[Pytorch][ATEN] Enable FP8 NCCL in Pytorch ATEN (#138776)
|
2024-10-25 21:56:47 +00:00 |
|
test_pg_wrapper.py
|
|
|
|
test_store.py
|
[TCPStore] Throw value error if passing world_size=0 to TCPStore (#137792)
|
2024-10-11 23:42:57 +00:00 |
|
test_symmetric_memory.py
|
[torch/distributed] Make _SymmetricMemory.has_multicast_support() ret… (#141598)
|
2024-11-26 23:36:32 +00:00 |