pytorch/c10
Nikita Shulga 2328dcccb9 [MPSInductor] Implement Welford reduction (#146703)
Still work in progress, though fallback works as expected, but custom shader is not

Pull Request resolved: https://github.com/pytorch/pytorch/pull/146703
Approved by: https://github.com/jansel, https://github.com/dcci
2025-02-08 05:00:00 +00:00
..
benchmark
core Use std::string_view (#145906) 2025-01-30 03:14:27 +00:00
cuda [Windows][ROCm] Fix c10 hip tests (#146599) 2025-02-06 23:41:25 +00:00
hip
macros [ROCm][Windows] Fix export macros (#144098) 2025-01-04 17:12:46 +00:00
metal [MPSInductor] Implement Welford reduction (#146703) 2025-02-08 05:00:00 +00:00
mobile
test Fix cppcoreguidelines-init-variables ignorance (#141795) 2025-01-28 17:11:37 +00:00
util [ROCm][Windows] Fix unrecognized _BitScanReverse intrinsic (#146606) 2025-02-06 23:47:18 +00:00
xpu Filter out iGPU if dGPU is found on XPU (#144378) 2025-01-29 15:53:16 +00:00
BUCK.oss
BUILD.bazel
build.bzl
CMakeLists.txt
ovrsource_defs.bzl