pytorch/torch
Wang, Eikan 9895c19a7a To vectorize long datatype as mask index (#91076)
In this PR, we record the current fx node being executed to cache additional information to simply the vectorization checker. In addition, we supported `masked` in this PR by simplifying it as `mask_load` to support `max_pool2d`.

Pull Request resolved: https://github.com/pytorch/pytorch/pull/91076
Approved by: https://github.com/jgong5, https://github.com/desertfire, https://github.com/jansel
2023-02-05 03:36:22 +00:00
..
_awaits [jit] Support Awaitable type (#90863) 2023-01-30 17:38:59 +00:00
_C [jit] Support Awaitable type (#90863) 2023-01-30 17:38:59 +00:00
_C_flatbuffer
_decomp [primTorch] Rewrite nan_to_num ref in terms of aten functions (#93952) 2023-02-03 13:51:37 +00:00
_dispatch
_dynamo [dynamo 3.11] update instruction sizes (#93984) 2023-02-04 04:09:24 +00:00
_export [Export] Introduce as_none in ex.Argument union type (#93210) 2023-01-30 21:32:49 +00:00
_functorch Enable Python dispatcher on inference-only aot_dispatch_base (#94118) 2023-02-04 06:10:21 +00:00
_inductor To vectorize long datatype as mask index (#91076) 2023-02-05 03:36:22 +00:00
_lazy
_prims [pt2] Fix arange to match ATen behavior (#93353) 2023-02-03 00:44:32 +00:00
_prims_common [pt2] Fix arange to match ATen behavior (#93353) 2023-02-03 00:44:32 +00:00
_refs Removes stray print (#94079) 2023-02-03 21:56:45 +00:00
_subclasses Add count to FakeTensorMode.__torch_dispatch__ (#93936) 2023-02-03 14:21:11 +00:00
amp
ao ns: add compare_weights API with a single model (#92058) 2023-02-03 01:17:19 +00:00
autograd [Py3.11] Remove skip logic from vmap and forward_ad (#91825) 2023-01-25 22:40:56 +00:00
backends
contrib
cpu
csrc Some performance fixes (#94034) 2023-02-04 02:17:48 +00:00
cuda FusedAdam(W) should take OptState into account before unscaling grads (#94060) 2023-02-04 05:20:13 +00:00
distributed [c10d] Update get_backend() in exception_handler (#94063) 2023-02-04 19:39:36 +00:00
distributions [Dynamo] Fix calling UserDefinedObject.func should pass self object (#92050) 2023-01-21 05:47:01 +00:00
fft
func
futures
fx Add lowerings for all symbolic shape operators (#94121) 2023-02-04 12:57:22 +00:00
jit [jit] jit._drop fun modifier to allow in jit class non-jit decl funs (#93012) 2023-02-01 09:02:05 +00:00
legacy
lib More fixes and improved clang-tidy checkers (#93213) 2023-02-01 14:44:17 +00:00
linalg
masked
monitor
multiprocessing Set min supported Python version to 3.8 (#93155) 2023-01-29 18:28:46 +00:00
nested
nn MHA optimizations (#93234) 2023-02-03 15:18:35 +00:00
onnx [ONNX] Export 'aten::index_put(self, mask, v)' when rank(mask) < rank(self) (#92862) 2023-01-27 02:00:56 +00:00
optim [mta][foreach] Implement fused adamw (#88015) 2023-02-01 19:32:29 +00:00
package Set min supported Python version to 3.8 (#93155) 2023-01-29 18:28:46 +00:00
profiler Silence profiler error (#94013) 2023-02-03 17:33:47 +00:00
quantization
signal
sparse Revert "Improve bsr @ strided performance in baddmm for bfloat16/half with Triton kernels. (#88078)" 2023-01-26 16:22:29 +00:00
special
testing [small] multithreaded-pg guard attr (#93883) 2023-02-03 23:01:02 +00:00
utils collect CPU info with collect_env.py for new issues reporting (#93899) 2023-02-03 04:58:53 +00:00
__config__.py
__future__.py
__init__.py Hack SymInt.__iadd__ to be working. (#94136) 2023-02-04 21:17:36 +00:00
_appdirs.py
_classes.py
_deploy.py
_guards.py
_jit_internal.py [jit] jit._drop fun modifier to allow in jit class non-jit decl funs (#93012) 2023-02-01 09:02:05 +00:00
_linalg_utils.py Remove deprecated torch.symeig (#70988) 2023-01-31 11:59:11 +00:00
_lobpcg.py
_lowrank.py
_meta_registrations.py [pt2] Fix arange to match ATen behavior (#93353) 2023-02-03 00:44:32 +00:00
_namedtensor_internals.py
_ops.py Fix checking of current mode in PyOperator dispatch (#92357) 2023-01-18 23:08:36 +00:00
_python_dispatcher.py
_six.py
_sources.py
_storage_docs.py
_tensor.py Remove deprecated torch.symeig (#70988) 2023-01-31 11:59:11 +00:00
_tensor_docs.py Remove deprecated torch.symeig (#70988) 2023-01-31 11:59:11 +00:00
_tensor_str.py
_torch_docs.py Remove deprecated torch.symeig (#70988) 2023-01-31 11:59:11 +00:00
_utils.py
_utils_internal.py
_VF.py
_vmap_internals.py
_weights_only_unpickler.py
abi-check.cpp
CMakeLists.txt
custom_class.h More fixes and improved clang-tidy checkers (#93213) 2023-02-01 14:44:17 +00:00
custom_class_detail.h More tidy fixes (#93069) 2023-01-27 06:40:50 +00:00
extension.h
functional.py
hub.py
library.h
library.py
overrides.py Remove deprecated torch.symeig (#70988) 2023-01-31 11:59:11 +00:00
py.typed
quasirandom.py
random.py
README.txt
return_types.py
script.h
serialization.py
storage.py
torch_version.py
types.py

Note [TH abstraction violation]
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

TH/THC provide some hpp headers, which are proper C++ headers rather than
C headers.  These headers serve double duty as *internal implementation
detail* headers, whose contents should largely not be used by external
clients.

Ideally, we would not install these headers at all; instead, you should
use public functions (in headers like `THTensor.h`, NOT `THTensor.hpp`)
to manipulate these structs.  However, there are a few places
in torch/csrc where we violate this abstraction.  They are marked with
a pointer to this note.  Each of those sites will have to be refactored
when we refactor the guts of THTensor and related structures.