pytorch/torch
Luca Wehrstedt f31452268b Add flop formula for _scaled_mm (#144872)
This will make it work correctly with the partitioner's AutoAC
Pull Request resolved: https://github.com/pytorch/pytorch/pull/144872
Approved by: https://github.com/vkuzo
2025-01-16 13:57:54 +00:00
..
_awaits
_C [CUDA][cuBLAS] Add fp16 accumulate option to cuBLAS/cuBLASLt (#144441) 2025-01-15 18:37:55 +00:00
_C_flatbuffer
_custom_op
_decomp Revert "Migrate from Tuple -> tuple in torch/_decomp (#144260)" 2025-01-10 01:47:29 +00:00
_dispatch
_dynamo symbolic_convert: Don't fail when we hit a undefined name (#144784) 2025-01-16 01:47:48 +00:00
_export fix as_bool serde (#144791) 2025-01-15 20:22:26 +00:00
_functorch Add non_c_binding torch functions to allowlist for AOTAutogradCache, confirm no special handlers for them (#144802) 2025-01-15 05:41:36 +00:00
_higher_order_ops [BE] typing for decorators - library (#138969) 2025-01-15 17:08:55 +00:00
_inductor [Quant][Inductor][X86] Separate binary post op fusion and lowering for qconv (#144318) 2025-01-16 03:30:36 +00:00
_lazy
_library [reland][export] don't decompose custom triton op when exporting (#144284) 2025-01-11 01:34:35 +00:00
_logging Implement increment and add_to_set for CompileEventLogger (#143427) 2025-01-14 02:42:49 +00:00
_numpy [BE] fix ruff rule E226: add missing whitespace around operator in f-strings (#144415) 2025-01-08 21:55:00 +00:00
_prims Remove extra copy torch/_prims (#144407) 2025-01-08 20:14:48 +00:00
_prims_common
_refs Fix torch._refs.tensor error with empty list (#143461) 2025-01-08 01:29:00 +00:00
_strobelight
_subclasses Add generator parameter to rand*_like functions (#136780) 2025-01-15 21:16:52 +00:00
_vendor
accelerator
amp
ao [BE] typing for decorators - library (#138969) 2025-01-15 17:08:55 +00:00
autograd [5/N] Apply Ruff fixes and pyupgrade to Python 3.9 (#144205) 2025-01-15 04:00:47 +00:00
backends [CUDA][cuBLAS] Add fp16 accumulate option to cuBLAS/cuBLASLt (#144441) 2025-01-15 18:37:55 +00:00
compiler Add AOTAutogradCache support for cache hot loading APIs (#144499) 2025-01-13 07:07:18 +00:00
contrib
cpu
csrc Remove unnecessary once flag usage (#143255) 2025-01-16 02:36:11 +00:00
cuda Support with statement on torch.Stream (#140138) 2025-01-10 02:05:19 +00:00
distributed [Pipelining] Improve shape inference debug logging (#144929) 2025-01-16 07:30:11 +00:00
distributions ReshapeTransform: added missing argument in docstring (#144401) 2025-01-13 17:59:59 +00:00
export [export] handle buffer/input mutations for joint-graph (#144806) 2025-01-16 00:22:16 +00:00
fft
func
futures
fx Register nonzero for meta device for FBLSim (#144727) 2025-01-15 19:40:42 +00:00
jit Apply Ruff fixes and pyupgrade to torch/jit (#144208) 2025-01-16 00:28:50 +00:00
legacy
lib
linalg
masked Update torch.masked.mean to upcast dtype for bool tensors (#139999) 2025-01-08 10:35:19 +00:00
monitor
mps Stop ignoring mypy errors in torch/testing/_internal/common_utils.py (#144483) 2025-01-14 22:32:51 +00:00
mtia
multiprocessing
nested
nn Add strict kwarg to nn.Module.set_submodule and fix bug for non dot delineated strings (#143455) 2025-01-16 05:06:33 +00:00
onnx [ONNX] Use python_dispatcher in type promotion (#144801) 2025-01-15 23:25:19 +00:00
optim Revert "Removed unused _RequiredParameter (#144771)" 2025-01-15 15:51:33 +00:00
package Revert "Use absolute path path.resolve() -> path.absolute() (#129409)" 2025-01-04 14:17:20 +00:00
profiler [Profiler] Fix device setting error of other backends in torch.profiler (#144237) 2025-01-10 10:41:11 +00:00
quantization
signal [BE] typing for decorators (#144161) 2025-01-04 16:40:09 +00:00
sparse
special
testing dynamic shape support for interpolate(antialias=True) backward (#141198) 2025-01-16 00:08:25 +00:00
utils Add flop formula for _scaled_mm (#144872) 2025-01-16 13:57:54 +00:00
xpu Refine torch.xpu.get_device_properties API error message (#144379) 2025-01-10 06:27:51 +00:00
__config__.py
__future__.py
__init__.py Revert "Fix torch.normal ignores default_device (#144070)" 2025-01-14 17:41:58 +00:00
_appdirs.py
_classes.py
_compile.py [BE] typing for decorators (#144161) 2025-01-04 16:40:09 +00:00
_custom_ops.py
_deploy.py
_environment.py
_guards.py
_jit_internal.py
_linalg_utils.py
_lobpcg.py
_lowrank.py
_meta_registrations.py [inductor] fix index.Tensor fallback (#144736) 2025-01-16 09:38:29 +00:00
_namedtensor_internals.py
_ops.py Propagate callable parameter types using ParamSpec (#142306) (#144047) 2025-01-06 16:16:18 +00:00
_python_dispatcher.py
_size_docs.py
_sources.py
_storage_docs.py
_streambase.py
_tensor.py
_tensor_docs.py Update pin memory related APIs to not pass 'device' argument (#131858) 2025-01-15 17:23:35 +00:00
_tensor_str.py
_thread_safe_fork.py
_torch_docs.py Add generator parameter to rand*_like functions (#136780) 2025-01-15 21:16:52 +00:00
_utils.py
_utils_internal.py
_VF.py
_vmap_internals.py
_weights_only_unpickler.py
abi-check.cpp
CMakeLists.txt Revert "export AOTI_TORCH_EXPORT on Windows. (#140030)" 2025-01-06 18:15:52 +00:00
custom_class.h
custom_class_detail.h
extension.h
functional.py
hub.py
library.h
library.py [BE] typing for decorators - library (#138969) 2025-01-15 17:08:55 +00:00
overrides.py Add generator parameter to rand*_like functions (#136780) 2025-01-15 21:16:52 +00:00
py.typed
quasirandom.py
random.py
README.txt
return_types.py
script.h
serialization.py
storage.py Update pin memory related APIs to not pass 'device' argument (#131858) 2025-01-15 17:23:35 +00:00
torch_version.py
types.py
version.py.tpl

Note [TH abstraction violation]
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

TH/THC provide some hpp headers, which are proper C++ headers rather than
C headers.  These headers serve double duty as *internal implementation
detail* headers, whose contents should largely not be used by external
clients.

Ideally, we would not install these headers at all; instead, you should
use public functions (in headers like `THTensor.h`, NOT `THTensor.hpp`)
to manipulate these structs.  However, there are a few places
in torch/csrc where we violate this abstraction.  They are marked with
a pointer to this note.  Each of those sites will have to be refactored
when we refactor the guts of THTensor and related structures.