pytorch/c10
Aaron Enye Shi aa4ad711ef [CCA][Memory Snapshot] Create TraceEntryRingBuffer class for alloc_trace logic (#130741)
Summary:
Move the alloc_trace logic into a separate class, to reduce risk of deadlocks when mixing with CCA's lock. Switch to an std::mutex instead of std::recursive_mutex.

Let's us re-use the logic in TraceEntryRingBuffer class for later diffs.

Test Plan: CI, resnet run, and FBR model.

Differential Revision: D59690408

Pulled By: aaronenyeshi

Pull Request resolved: https://github.com/pytorch/pytorch/pull/130741
Approved by: https://github.com/davidberard98
2024-07-16 15:01:48 +00:00
..
benchmark [Split Build] Add option to create libtorch wheel and use it to build pytorch as a separate wheel (#126328) 2024-05-29 04:33:56 +00:00
core Invalidate StorageImpl instances when tensor is overwritten with cudagraphs (#125264) 2024-07-16 14:29:29 +00:00
cuda [CCA][Memory Snapshot] Create TraceEntryRingBuffer class for alloc_trace logic (#130741) 2024-07-16 15:01:48 +00:00
hip [Split Build] Add option to create libtorch wheel and use it to build pytorch as a separate wheel (#126328) 2024-05-29 04:33:56 +00:00
macros [torch/c10] Add C10_UBSAN_ENABLED macro and use it to disable SymInt_… (#127967) 2024-06-14 16:01:12 +00:00
mobile Fix broken lint after #116876 (#122253) 2024-03-20 04:09:00 +00:00
test [9/N] Replace c10::optional with std::optional (#130674) 2024-07-15 00:48:43 +00:00
util [9/N] Replace c10::optional with std::optional (#130674) 2024-07-15 00:48:43 +00:00
xpu [1/N] Change #include <c10/util/Optional.h> to #include <optional> (#128301) 2024-07-08 07:03:53 +00:00
BUCK.oss [1/4] Intel GPU Runtime Upstreaming for Device (#116019) 2024-01-12 07:36:25 +00:00
BUILD.bazel
build.bzl Revert "Move at::{Refcounted,}MapAllocator to c10 (#109881)" 2023-10-13 17:57:53 +00:00
CMakeLists.txt Expose nholmann json to torch (#129570) 2024-06-26 21:59:26 +00:00
ovrsource_defs.bzl [caffe2] Add build configuration for linux-arm64 (#124618) 2024-04-25 21:55:26 +00:00