pytorch/torch/_C
Eddie Yan 9ee506bd93 [CUDA][cuBLAS] Add fp16 accumulate option to cuBLAS/cuBLASLt (#144441)
Test for `cublasGemmEx` added, still need to figure out the best way to exercise the other APIs...

Pull Request resolved: https://github.com/pytorch/pytorch/pull/144441
Approved by: https://github.com/Chillee, https://github.com/malfet
2025-02-06 19:04:50 +00:00
..
_dynamo [dynamo][guards] Turn on profiling of guard manager (#145420) 2025-01-23 18:17:43 +00:00
__init__.pyi.in [CUDA][cuBLAS] Add fp16 accumulate option to cuBLAS/cuBLASLt (#144441) 2025-02-06 19:04:50 +00:00
_aoti.pyi
_autograd.pyi update _unsafe_set_version_counter to accept lists of tensors (#137921) 2025-02-04 04:51:11 +00:00
_cpu.pyi [CPUInductor] Fix SVE256 detection (#146207) 2025-02-01 18:51:34 +00:00
_cudnn.pyi Improve typing in torch/types.py (#145237) 2025-01-28 05:29:12 +00:00
_cusparselt.pyi
_distributed_autograd.pyi
_distributed_c10d.pyi [c10d] Add NCCL memory allocator (#145675) 2025-01-30 18:19:00 +00:00
_distributed_rpc.pyi
_distributed_rpc_testing.pyi
_export.pyi
_functions.pyi
_functorch.pyi
_instruction_counter.pyi
_itt.pyi
_lazy.pyi
_lazy_ts_backend.pyi
_monitor.pyi add WaitCounter type interface and get rid of type errors (#146175) 2025-02-01 23:24:52 +00:00
_nn.pyi.in
_nvtx.pyi
_onnx.pyi
_profiler.pyi
_VariableFunctions.pyi.in
_verbose.pyi
build.bzl
return_types.pyi.in