pytorch/torch/_C
Jeff Daily 0e7bd7fedd [ROCm] TunableOp improvements (#124362)
- use less memory; smaller default hipblaslt workspace size
- options to avoid cache effects
  - icache flush option
  - rotating buffers during tuning
- python APIs
- unit tests

Pull Request resolved: https://github.com/pytorch/pytorch/pull/124362
Approved by: https://github.com/xw285cornell
2024-06-03 22:30:11 +00:00
..
_dynamo [compiled autograd] log in cpp using python logger (#126483) 2024-05-19 23:49:52 +00:00
__init__.pyi.in [ROCm] TunableOp improvements (#124362) 2024-06-03 22:30:11 +00:00
_aoti.pyi
_autograd.pyi
_cpu.pyi
_cudnn.pyi
_distributed_autograd.pyi
_distributed_c10d.pyi Revert "distributed debug handlers (#126601)" 2024-05-31 01:21:24 +00:00
_distributed_rpc.pyi
_distributed_rpc_testing.pyi
_functions.pyi
_functorch.pyi Fix perf regression caused by #122074 (#126996) 2024-05-24 04:27:22 +00:00
_itt.pyi
_lazy.pyi
_lazy_ts_backend.pyi
_monitor.pyi
_nn.pyi.in
_nvtx.pyi
_onnx.pyi
_profiler.pyi [1/N][Easy] fix typo for usort config in pyproject.toml (kown -> known): sort stdlib (#127122) 2024-05-25 08:25:50 +00:00
_VariableFunctions.pyi.in
_verbose.pyi
build.bzl
return_types.pyi.in