pytorch/torch/csrc/distributed/c10d
2024-12-27 05:46:57 +00:00
..
control_collectives
control_plane Re-enable some C++ warnings (#142332) 2024-12-12 04:02:12 +00:00
cuda [experimental] async-tp impl with cutlass-based, progress aware kernel (#139227) 2024-11-08 23:28:25 +00:00
quantization [Distributed] [19/N] Fix clang-tidy warnings in torch/csrc/distributed/ (#138903) 2024-10-28 05:29:25 +00:00
Backend.cpp
Backend.hpp Revert "[PGNCCL] Add an API to get the status/error code of each PG (#140087)" 2024-11-19 22:53:46 +00:00
Backoff.cpp [Distributed] [18/N] Fix clang-tidy warnings in torch/csrc/distributed/ (#138692) 2024-10-25 05:32:38 +00:00
Backoff.hpp
c10d.h
comm.cpp
comm.hpp
CudaDMAConnectivity.cpp [Distributed] [18/N] Fix clang-tidy warnings in torch/csrc/distributed/ (#138692) 2024-10-25 05:32:38 +00:00
CUDASymmetricMemory-inl.h [torch] Address -Wreturn-type warning when compiling for AMD (#138951) 2024-10-28 18:26:40 +00:00
CUDASymmetricMemory.cu Revert "[Environment Variable][7/N] Use thread-safe getenv functions (#140211)" 2024-11-21 21:05:22 +00:00
CUDASymmetricMemory.hpp [SymmetricMemory] support specifying group_name at rendezvous time (#139529) 2024-11-17 09:31:17 +00:00
CUDASymmetricMemoryOps.cu [SymmetricMemory] introduce multimem_all_gather (#142810) 2024-12-17 01:07:27 +00:00
debug.cpp
debug.h
default_comm_hooks.cpp
default_comm_hooks.hpp
DMAConnectivity.cpp Use Wextra-semi (#140236) 2024-11-13 02:15:16 +00:00
DMAConnectivity.hpp [Distributed] [19/N] Fix clang-tidy warnings in torch/csrc/distributed/ (#138903) 2024-10-28 05:29:25 +00:00
error.h
exception.h
FakeProcessGroup.hpp
FileStore.cpp Add and use thread-safe strerror (#140472) 2024-11-19 04:24:17 +00:00
FileStore.hpp
FlightRecorder.cpp [c10d][fr] flight recorder improvements (#143446) 2024-12-20 20:41:30 +00:00
FlightRecorder.hpp [2/N] Rename NCCLTraceBuffer to FlightRecorder (#141712) 2024-11-29 21:15:31 +00:00
Functional.cpp [c10d][Partial-Graph Overlap] Support calling .wait_tensor() on output tensor of eager async_op=True collective if under allow_inflight_collective_as_graph_input_ctx() context manager (#137763) 2024-10-29 03:31:19 +00:00
Functional.hpp [c10d][Partial-Graph Overlap] Support calling .wait_tensor() on output tensor of eager async_op=True collective if under allow_inflight_collective_as_graph_input_ctx() context manager (#137763) 2024-10-29 03:31:19 +00:00
GlooDeviceFactory.cpp Enable more C++ warnings (#143355) 2024-12-27 05:46:57 +00:00
GlooDeviceFactory.hpp
GroupRegistry.cpp [Distributed] [19/N] Fix clang-tidy warnings in torch/csrc/distributed/ (#138903) 2024-10-28 05:29:25 +00:00
GroupRegistry.hpp [Distributed] [18/N] Fix clang-tidy warnings in torch/csrc/distributed/ (#138692) 2024-10-25 05:32:38 +00:00
HashStore.cpp
HashStore.hpp
init.cpp c10d: no call_guard in init (#143598) 2024-12-20 22:23:36 +00:00
intra_node_comm.cpp [IntraNodeComm] fix a recent breakage (#141200) 2024-11-26 00:46:38 +00:00
intra_node_comm.cu [IntraNodeComm] fix a recent breakage (#141200) 2024-11-26 00:46:38 +00:00
intra_node_comm.hpp [Distributed] [18/N] Fix clang-tidy warnings in torch/csrc/distributed/ (#138692) 2024-10-25 05:32:38 +00:00
logger.cpp [2/N] Apply bugprone-unchecked-optional-access (#141091) 2024-12-09 19:30:19 +00:00
logger.hpp [fr][c10d] log trace capture enabled or not in flight recorder (#143865) 2024-12-27 03:07:55 +00:00
logging.cpp
logging.h
NanCheck.cu catch tensor.numel() == 0 in nan detector (#140741) 2024-11-15 05:03:20 +00:00
NanCheck.hpp
NCCLUtils.cpp [PGNCCL] Move NCCLComm impl to cpp (#142826) 2024-12-12 02:45:52 +00:00
NCCLUtils.hpp [PGNCCL] Move NCCLComm impl to cpp (#142826) 2024-12-12 02:45:52 +00:00
Ops.cpp [Distributed] [17/N] Fix clang-tidy warnings in torch/csrc/distributed/ (#138465) 2024-10-24 04:58:49 +00:00
ParamCommsUtils.cpp
ParamCommsUtils.hpp
PrefixStore.cpp
PrefixStore.hpp
ProcessGroup.cpp [c10d][Partial-Graph Overlap] Support calling .wait_tensor() on output tensor of eager async_op=True collective if under allow_inflight_collective_as_graph_input_ctx() context manager (#137763) 2024-10-29 03:31:19 +00:00
ProcessGroup.hpp Register Intel distributed Backend (XCCL) in PyTorch distributed package (#141856) 2024-12-10 01:58:06 +00:00
ProcessGroupGloo.cpp Use task submitter TLS in gloo working threads (#142184) 2024-12-06 17:03:17 +00:00
ProcessGroupGloo.hpp Use task submitter TLS in gloo working threads (#142184) 2024-12-06 17:03:17 +00:00
ProcessGroupMPI.cpp [c10d][Partial-Graph Overlap] Support calling .wait_tensor() on output tensor of eager async_op=True collective if under allow_inflight_collective_as_graph_input_ctx() context manager (#137763) 2024-10-29 03:31:19 +00:00
ProcessGroupMPI.hpp
ProcessGroupNCCL.cpp [fr][c10d] log trace capture enabled or not in flight recorder (#143865) 2024-12-27 03:07:55 +00:00
ProcessGroupNCCL.hpp [fr][c10d] log trace capture enabled or not in flight recorder (#143865) 2024-12-27 03:07:55 +00:00
ProcessGroupUCC.cpp [c10d][UCC] Add _reduce_scatter_base to c10d::ProcessGroupUCC (#138021) 2024-12-09 16:02:24 +00:00
ProcessGroupUCC.hpp [c10d][UCC] Add _reduce_scatter_base to c10d::ProcessGroupUCC (#138021) 2024-12-09 16:02:24 +00:00
ProcessGroupWrapper.cpp
ProcessGroupWrapper.hpp
PyProcessGroup.hpp PyProcessGroup: support rank, world size, group name/desc overrides (#141529) 2024-11-26 20:56:57 +00:00
python_comm_hook.cpp [Distributed] [18/N] Fix clang-tidy warnings in torch/csrc/distributed/ (#138692) 2024-10-25 05:32:38 +00:00
python_comm_hook.h
RankLocal.hpp [Distributed] [19/N] Fix clang-tidy warnings in torch/csrc/distributed/ (#138903) 2024-10-28 05:29:25 +00:00
reducer.cpp [3/N] Apply bugprone-unchecked-optional-access (#142442) 2024-12-11 01:39:10 +00:00
reducer.hpp [3/N] Apply bugprone-unchecked-optional-access (#142442) 2024-12-11 01:39:10 +00:00
reducer_cuda.cpp
reducer_timer.hpp
sequence_num.cpp [4/N] Apply bugprone-unchecked-optional-access (#142832) 2024-12-12 04:33:32 +00:00
sequence_num.hpp [Distributed] [19/N] Fix clang-tidy warnings in torch/csrc/distributed/ (#138903) 2024-10-28 05:29:25 +00:00
socket.cpp Enable more C++ warnings (#143099) 2024-12-17 02:03:39 +00:00
socket.h
socket_fmt.h
Store.cpp
Store.hpp [Distributed] [19/N] Fix clang-tidy warnings in torch/csrc/distributed/ (#138903) 2024-10-28 05:29:25 +00:00
SymmetricMemory.cpp [SymmetricMemory] introduce multimem_all_gather (#142810) 2024-12-17 01:07:27 +00:00
SymmetricMemory.hpp [torch/distributed] Make _SymmetricMemory.has_multicast_support() ret… (#141598) 2024-11-26 23:36:32 +00:00
TCPStore.cpp [2/N] Apply bugprone-unchecked-optional-access (#141091) 2024-12-09 19:30:19 +00:00
TCPStore.hpp
TCPStoreBackend.cpp Enable clang-tidy on torch/csrc/distributed (#139043) 2024-10-28 13:56:54 +00:00
TCPStoreBackend.hpp
TCPStoreLibUvBackend.cpp [2/N] Apply bugprone-unchecked-optional-access (#141091) 2024-12-09 19:30:19 +00:00
TraceUtils.h [pgnccl][simple] log started work numel (#139773) 2024-11-05 23:11:19 +00:00
Types.hpp
UCCTracing.cpp
UCCTracing.hpp
UCCUtils.cpp
UCCUtils.hpp [3/N] Replace c10::sv with std::sv (#139861) 2024-11-07 20:03:57 +00:00
UnixSockUtils.hpp
Utils.cpp
Utils.hpp Add and use thread-safe strerror (#140472) 2024-11-19 04:24:17 +00:00
WinSockUtils.hpp
Work.cpp [c10d][Partial-Graph Overlap] Support calling .wait_tensor() on output tensor of eager async_op=True collective if under allow_inflight_collective_as_graph_input_ctx() context manager (#137763) 2024-10-29 03:31:19 +00:00
Work.hpp