pytorch

mirror of https://github.com/saymrwulf/pytorch.git synced 2026-05-14 20:57:59 +00:00

Author	SHA1	Message	Date
Huy Do	aa7d01ea22	Use sccache 0.9.0 on ROCm build job (#144125 ) TSIA, sccache 0.9.0 seems to work fine with ROCm build job Pull Request resolved: https://github.com/pytorch/pytorch/pull/144125 Approved by: https://github.com/jithunnair-amd, https://github.com/wdvr, https://github.com/jeffdaily	2025-01-04 08:56:48 +00:00
Valentine233	636a2c7e0f	[Inductor][lowering] support out_dtype for dequant lowering (#143845 ) In lowering, support the parameter `out_dtype` for `dequant_per_tensor` and `dequant_per_channel`. Fix the following runtime error issue found in https://github.com/pytorch/ao/pull/1372: ``` File "/home/liaoxuan/pytorch_ao/torch/_inductor/lowering.py", line 452, in wrapped out = decomp_fn(args, *kwargs) torch._dynamo.exc.BackendCompilerFailed: backend='compile_fx_wrapper' raised: LoweringException: TypeError: quantized_decomposed_dequantize_per_tensor_default() got an unexpected keyword argument 'out_dtype' target: quantized_decomposed.dequantize_per_tensor.default args[0]: TensorBox(StorageBox( InputBuffer(name='arg0_1', layout=FixedLayout('cpu', torch.uint8, size=[1, 7, 7, 9], stride=[441, 63, 9, 1])) )) args[1]: 0.01 args[2]: 100 args[3]: 0 args[4]: 255 args[5]: torch.uint8 kwargs: {'out_dtype': torch.bfloat16} ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/143845 Approved by: https://github.com/jgong5, https://github.com/leslie-fang-intel, https://github.com/jansel	2025-01-04 08:48:41 +00:00
Xinran / Allan Rui	417d9c3522	[Inductor/Triton] Upcast FP16/BF16 math reductions to FP32 (#141052 ) Summary: Triton compiler does not automatically promote fp16/bf16 reductions to fp32 accumulation. This will result in significant accuracy issue. This diff will upcast the input to FP32 for all math reductions `["welford_reduce", "welford_combine", "prod", "sum", "xor_sum"]` Test Plan: CI ``` python test/inductor/test_torchinductor.py TritonCodeGenTests.test_low_precision_reduction ``` Differential Revision: D65965032 Pull Request resolved: https://github.com/pytorch/pytorch/pull/141052 Approved by: https://github.com/blaine-rister	2025-01-04 07:57:10 +00:00
Animesh Jain	816328fa51	[dynamo][lazy] LazyVT utils to get original value/source and is_hashable (#144160 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/144160 Approved by: https://github.com/williamwen42, https://github.com/jansel ghstack dependencies: #144129, #144130, #144141, #144158, #144163	2025-01-04 06:23:05 +00:00
Nikita Shulga	b5b1e9456a	[MPSInductor] Add `masked` implementation (#144084 ) More or less borrowed from `22580f160e/torch/_inductor/codegen/halide.py (L549-L563)` `pytest test/inductor/test_torchinductor.py -k _mps` score is 408 failed, 347 passed, 32 skipped Pull Request resolved: https://github.com/pytorch/pytorch/pull/144084 Approved by: https://github.com/Skylion007, https://github.com/jansel ghstack dependencies: #144167, #144162, #144083	2025-01-04 04:30:07 +00:00
Shangdi Yu	f15af077fb	Fix get_source_partitions when weights are tied (#142446 ) Summary: Fix https://github.com/pytorch/pytorch/issues/142035 and https://github.com/pytorch/pytorch/issues/143621 When Linear module params are tied to another parameter, like this: ``` class SimpleLinearModel(nn.Module): def __init__(self, input_size, output_size): super(SimpleLinearModel, self).__init__() # Define a linear layer self.linear = nn.Linear(input_size, output_size) self.tied_weight = self.linear.weight def forward(self, x): # Forward pass through the linear layer b = self.tied_weight + 1 return self.linear(x), b ``` We get a graph like below: ``` graph(): %p_tied_weight : [num_users=0] = placeholder[target=p_tied_weight] %p_linear_weight : [num_users=2] = placeholder[target=p_linear_weight] %p_linear_bias : [num_users=1] = placeholder[target=p_linear_bias] %x : [num_users=1] = placeholder[target=x] %add : [num_users=1] = call_function[target=torch.ops.aten.add.Tensor](args = (%p_linear_weight, 1), kwargs = {}) %linear : [num_users=1] = call_function[target=torch.ops.aten.linear.default](args = (%x, %p_linear_weight, %p_linear_bias), kwargs = {}) return (linear, add) ``` Notice that ` %p_linear_weight : [num_users=2]`. When we get source partitions, we should exclude attributes nodes like `p_linear_weight` from outputs. A real world example where people do something like this is in https://github.com/pytorch/pytorch/issues/142035. Test Plan: ``` buck2 run 'fbcode//mode/dev-nosan' fbcode//caffe2/test:fx -- -r test_module_partitioner_weight_tied ``` Differential Revision: D66998592 Pull Request resolved: https://github.com/pytorch/pytorch/pull/142446 Approved by: https://github.com/angelayi	2025-01-04 04:28:20 +00:00
cyy	f9bf9057ef	Fix ruff warnings in caffe2 and functorch (#144182 ) In preparation for upgrading ruff config to py3.9. Pull Request resolved: https://github.com/pytorch/pytorch/pull/144182 Approved by: https://github.com/malfet	2025-01-04 04:15:01 +00:00
Sam Ginzburg	ec1f56fdcf	[user triton] add support for prune_configs_by in @triton.autotune (#142207 ) This PR adds support for prune_configs_by in the @triton.autotune decorator [docs](https://triton-lang.org/main/python-api/generated/triton.autotune.html#triton.autotune). Supporting this lets users reduce autotuning time by running user-supplied code (early_config_prune, perf_model) to prune the provided list of configs. We implement this by realizing args/kwargs in call_triton_kernel(...), and then calling kernel.prune_configs(...). Pull Request resolved: https://github.com/pytorch/pytorch/pull/142207 Approved by: https://github.com/zou3519, https://github.com/aakhundov	2025-01-04 03:50:28 +00:00
Davide Italiano	479d6f2199	[mps/inductor] Add support for log(). (#144169 ) Tested via: ``` % pytest test/inductor/test_mps_basic.py ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/144169 Approved by: https://github.com/jansel, https://github.com/malfet	2025-01-04 03:07:56 +00:00
Animesh Jain	087c625261	[dynamo] Trace torch.typename (#144163 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/144163 Approved by: https://github.com/yanboliang, https://github.com/williamwen42, https://github.com/jansel ghstack dependencies: #144129, #144130, #144141, #144158	2025-01-04 02:52:58 +00:00
Animesh Jain	3292220c43	[dynamo][easy] Move symnode helpers to utils (#144158 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/144158 Approved by: https://github.com/williamwen42, https://github.com/jansel ghstack dependencies: #144129, #144130, #144141	2025-01-04 02:52:58 +00:00
PHLens	98949df7a4	Fix torch.distributed._functional_collectives.AsyncCollectiveTensor for aten.to. (#134661 ) Fixes #133421 Pull Request resolved: https://github.com/pytorch/pytorch/pull/134661 Approved by: https://github.com/bdhirsh	2025-01-04 02:33:38 +00:00
eqy	7e3cd0e488	[CUDA] Check `size` calculation in `ilpReduce` for `softmax` (#144009 ) For #143644 Pull Request resolved: https://github.com/pytorch/pytorch/pull/144009 Approved by: https://github.com/Skylion007	2025-01-04 02:31:15 +00:00
eqy	dbdda654af	[64-bit][CUDA] Upsample2D 64-bit indexing fix attempt 2 (#141923 ) #141831 Block/thread math requires a cast... Pull Request resolved: https://github.com/pytorch/pytorch/pull/141923 Approved by: https://github.com/ngimel	2025-01-04 02:30:38 +00:00
xinan.lin	1d091e47d6	[Inductor UT] Generalize device-bias code in test_torchinductor.py introduced by #143884 . (#144057 ) Fix #144056 Pull Request resolved: https://github.com/pytorch/pytorch/pull/144057 Approved by: https://github.com/EikanWang, https://github.com/jansel	2025-01-04 02:24:33 +00:00
isalia20	22580f160e	Multinomial sampling fix on mps for non contiguous tensors (#141515 ) Fixes #141457 As for the tests. I looked in `test/test_mps.py` but I saw that `test_multinomial` function is disabled. Glad to add test where needed if there is some place where multinomial function is tested on metal. Pull Request resolved: https://github.com/pytorch/pytorch/pull/141515 Approved by: https://github.com/malfet Co-authored-by: Nikita Shulga <2453524+malfet@users.noreply.github.com>	2025-01-04 01:21:37 +00:00
Nikita Shulga	464b50dbd7	[MPSInductor] Add `floor_div` and `index_expr` implementation (#144083 ) Simply copy-n-pasted from CPPInductor `pytest test/inductor/test_torchinductor.py -k _mps` score is 418 failed, 337 passed, 32 skipped Pull Request resolved: https://github.com/pytorch/pytorch/pull/144083 Approved by: https://github.com/jansel ghstack dependencies: #144167, #144162	2025-01-04 01:10:01 +00:00
Nikita Shulga	6d25938540	[MPSInductor] Add `remainder` op (#144162 ) For it to return correct result for half precision type it must be upcast to float Pull Request resolved: https://github.com/pytorch/pytorch/pull/144162 Approved by: https://github.com/jansel ghstack dependencies: #144167	2025-01-04 00:47:40 +00:00
Nikita Shulga	f8e1eacf2f	[MPSInductor] Extend `constant` to bool type (#144167 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/144167 Approved by: https://github.com/jansel	2025-01-04 00:47:40 +00:00
Yuanhao Ji	d41134f7e5	[Inductor] Fix `torch.polygamma()` when n == 0 (#144058 ) Fixes #143648 aten: `dec1a6d0f0/aten/src/ATen/native/cpu/UnaryOpsKernel.cpp (L436-L447)` compiled kernel code: ``` cpp_fused_polygamma_0 = async_compile.cpp_pybinding(['const float', 'float'], ''' #include "/tmp/torchinductor_devuser/tmpi1d9ksww/db/cdb7hyptwxpzukwd42x4ajfjlgrpum4a4htdd6lhb65apclsmno4.h" extern "C" void kernel(const float* in_ptr0, float* out_ptr0) { { { { auto tmp0 = in_ptr0[static_cast<int64_t>(0L)]; auto tmp1 = static_cast<float>(0.0); auto tmp2 = tmp1 == 0 ? calc_digamma(tmp0) : calc_polygamma(tmp0, tmp1); out_ptr0[static_cast<int64_t>(0L)] = tmp2; } } } } ''') ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/144058 Approved by: https://github.com/jansel	2025-01-04 00:22:10 +00:00
bobrenjc93	52742b07c5	remove allow-untyped-defs from nn/utils/_deprecation_utils.py (#144136 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/144136 Approved by: https://github.com/aorenste	2025-01-03 23:44:14 +00:00
Xiaodong Wang	0a94bb432e	[ROCm] CK Flash Attention Backend (#143695 ) Replace https://github.com/pytorch/pytorch/pull/138947 for re-import. Replaces https://github.com/ROCm/pytorch/pull/1592 This PR contains the initial implementation of SDPA with composable_kernel backend. The CK path can be forced by simply calling torch.backends.cuda.preferred_rocm_fa_library("ck"). Similarly, you can force the incumbent aotriton implementation by passing in "aotriton" or "default". As you'd expect, not setting this option will result in aotriton to be used as the backend. In the case of CK, if pytorch deems flash attention usable, then it will use the CK path in all the same places aotriton would have been used. This PR makes no changes to the heuristics which select which attention scheme to use (i.e. flash attention vs memory efficient attention vs math etc etc). It only gets called when flash attention is both enabled (via USE_FLASH_ATTENTION) and is selected at runtime by the existing heuristics. Files located in pytorch/aten/src/ATen/native/transformers/hip/flash_attn/ck/mha* have been pulled from https://github.com/Dao-AILab/flash-attention courtesy of @tridao's hard work who is the co-author NOTE: In order to use this backend, the user MUST set USE_CK_FLASH_ATTENTION=1 in their environment when they build PyTorch. Pull Request resolved: https://github.com/pytorch/pytorch/pull/143695 Approved by: https://github.com/malfet Co-authored-by: Andy Lugo <Andy.LugoReyes@amd.com> Co-authored-by: Jithun Nair <jithun.nair@amd.com>	2025-01-03 22:01:36 +00:00
Huy Do	3251171ae8	Make whl metadata public readable (#144164 ) After https://github.com/pytorch/pytorch/pull/143677/files#r1902138480 lands, the new nightly wheel metadata is not readable publicly causing pip install to fail, for example https://github.com/pytorch/pytorch/actions/runs/12603415308/job/35128414909. FBGEMM folks are also noticed this failure on their end (cc @q10) Pull Request resolved: https://github.com/pytorch/pytorch/pull/144164 Approved by: https://github.com/clee2000	2025-01-03 21:08:49 +00:00
drisspg	9bf2a9a616	[ScaledMM] Fix NaNs in test for garbage input data (#144042 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/144042 Approved by: https://github.com/janeyx99	2025-01-03 21:02:25 +00:00
Jay Zhang	b75f32b848	Update TorchDynamo-based ONNX Exporter memory usage example code. (#144139 ) Address related comments earlier. Pull Request resolved: https://github.com/pytorch/pytorch/pull/144139 Approved by: https://github.com/justinchuby Co-authored-by: Justin Chu <justinchuby@users.noreply.github.com>	2025-01-03 20:41:36 +00:00
bobrenjc93	64bffb3124	remove allow-untyped-defs onnx/_internal/exporter/_fx_passes.py (#144134 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/144134 Approved by: https://github.com/Skylion007	2025-01-03 20:18:40 +00:00
bobrenjc93	64b197b603	remove allow-untyped-defs from export/_remove_auto_functionalized_pass.py (#144135 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/144135 Approved by: https://github.com/Skylion007	2025-01-03 20:08:11 +00:00
bobrenjc93	9b8a4e7141	remove allow-untyped-defs from torch/onnx/operators.py (#144133 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/144133 Approved by: https://github.com/Skylion007	2025-01-03 20:07:56 +00:00
bobrenjc93	6e09d32c00	remove allow-untyped-defs from torch/jit/_passes/_property_propagation.py (#144132 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/144132 Approved by: https://github.com/Skylion007	2025-01-03 20:07:37 +00:00
Wanchao Liang	eb7a303d21	[dtensor] expose the __create_chunk_list__ in the doc (#144100 ) as titled, this PR expose this dunder method as a public API in the doc, so that different checkpoint implementations can leverage this protocol, instead of exposing a separate API Pull Request resolved: https://github.com/pytorch/pytorch/pull/144100 Approved by: https://github.com/awgu ghstack dependencies: #144099	2025-01-03 20:06:23 +00:00
Xuehai Pan	45411d1fc9	Use absolute path `path.resolve()` -> `path.absolute()` (#129409 ) Changes: 1. Always explicit `.absolute()`: `Path(__file__)` -> `Path(__file__).absolute()` 2. Replace `path.resolve()` with `path.absolute()` if the code is resolving the PyTorch repo root directory. Pull Request resolved: https://github.com/pytorch/pytorch/pull/129409 Approved by: https://github.com/albanD	2025-01-03 20:03:40 +00:00
bobrenjc93	e9e18a9617	remove allow-untyped-defs from _export/db/logging.py (#144093 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/144093 Approved by: https://github.com/Skylion007	2025-01-03 19:36:14 +00:00
Nikita Shulga	ad09395674	[MPSInductor] Fix multi rangevar kernel invocation (#144050 ) By changing `thread_position_in_grid` type to uint{n} and passing dimentions during the kernel call `pytest test/inductor/test_torchinductor.py -k _mps` score is 445 failed, 309 passed, 32 skipped Pull Request resolved: https://github.com/pytorch/pytorch/pull/144050 Approved by: https://github.com/jansel ghstack dependencies: #144055, #144051, #144122, #144105, #144156	2025-01-03 19:32:43 +00:00
Nikita Shulga	52e107a7ca	[MPSInductor] Add `constant`, `isinf` and `isnan` ops (#144156 ) Per Table 6.5 of [Metal Language Specification](https://developer.apple.com/metal/Metal-Shading-Language-Specification.pdf) infinity is `HUGE_VALF` Pull Request resolved: https://github.com/pytorch/pytorch/pull/144156 Approved by: https://github.com/Skylion007, https://github.com/jansel ghstack dependencies: #144055, #144051, #144122, #144105	2025-01-03 19:32:43 +00:00
Catherine Lee	383ff4011c	[ez] Use strip for arg sanitization in upload_metadata_file to improve readability (#144155 ) Minor thing that improves readability. I didn't realize you could specify characters for strip when I wrote this Pull Request resolved: https://github.com/pytorch/pytorch/pull/144155 Approved by: https://github.com/huydhn, https://github.com/Skylion007	2025-01-03 19:25:30 +00:00
bobrenjc93	8b3479e361	remove allow-untyped-defs from torch/distributed/fsdp/_dynamo_utils.py (#144131 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/144131 Approved by: https://github.com/Skylion007	2025-01-03 19:07:21 +00:00
Jane Xu	7b69f7b449	Clarify what we mean by decoupled weight decay in the *AdamWs (#144101 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/144101 Approved by: https://github.com/albanD	2025-01-03 19:06:00 +00:00
Yidi Wu	c36f94b373	[while_loop][dynamo] auto-unspecialize int input and output to unbacked symints (#143106 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/143106 Approved by: https://github.com/zou3519 ghstack dependencies: #143105, #143545	2025-01-03 19:01:07 +00:00
Yidi Wu	5660709856	[hop][BE] unify meta checking with check_meta_consistency (#143545 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/143545 Approved by: https://github.com/zou3519 ghstack dependencies: #143105	2025-01-03 19:01:07 +00:00
Yidi Wu	6e8dca9ff3	[while_loop][aot] auto-unspecialize int input and output to unbacked symints (#143105 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/143105 Approved by: https://github.com/zou3519	2025-01-03 19:01:07 +00:00
Davide Italiano	56f6289f6a	[mps/inductor] Add support for atanh(). (#144121 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/144121 Approved by: https://github.com/jansel, https://github.com/malfet	2025-01-03 18:55:05 +00:00
Nikita Shulga	a7b61c5b49	[MPSInductor] Add signbit op support (#144105 ) By mapping it to `metal::signbit` Pull Request resolved: https://github.com/pytorch/pytorch/pull/144105 Approved by: https://github.com/jansel, https://github.com/Skylion007 ghstack dependencies: #144055, #144051, #144122	2025-01-03 18:34:46 +00:00
PyTorch MergeBot	8d63a4a409	Revert "Set `enable_trace_contextlib_contextmanager` flag to True (#140604 )" This reverts commit `1c817fe671`. Reverted https://github.com/pytorch/pytorch/pull/140604 on behalf of https://github.com/guilhermeleobas due to breaking one of the benchmarks (moco) ([comment](https://github.com/pytorch/pytorch/pull/140604#issuecomment-2569640837))	2025-01-03 18:23:53 +00:00
Animesh Jain	c5c897c3a1	[dynamo][easy] Miscellaneous fixes (#144141 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/144141 Approved by: https://github.com/williamwen42 ghstack dependencies: #144129, #144130	2025-01-03 18:22:56 +00:00
Animesh Jain	732359c633	[dynamo][easy] Minor fixes in guards.cpp (#144130 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/144130 Approved by: https://github.com/williamwen42 ghstack dependencies: #144129	2025-01-03 18:22:56 +00:00
Animesh Jain	a450e177fd	[dynamo] remove inline inbuilt tests as flag is enabled by default (#144129 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/144129 Approved by: https://github.com/williamwen42	2025-01-03 18:22:56 +00:00
PyTorch MergeBot	2409b49a33	Revert "Rewrite _reparametrize_module to use `contextmanager` (#138203 )" This reverts commit `7bf3b7cdc5`. Reverted https://github.com/pytorch/pytorch/pull/138203 on behalf of https://github.com/guilhermeleobas due to breaking one of the benchmarks (moco) ([comment](https://github.com/pytorch/pytorch/pull/138203#issuecomment-2569634001))	2025-01-03 18:17:32 +00:00
Blaine Burton Rister	60fe8a65af	[Inductor] Generalize tiling algorithm to handle fused reductions (#144041 ) # Issue This PR cleans up an edge case that wasn't handled by https://github.com/pytorch/pytorch/pull/137243. The existing tiling code assumes that `node.get_ranges()` is a reliable source of pointwise and reduction numels. This is true for pointwise kernels, but the situation is more complicated with reductions. Since reductions change the number of elements in a tensor, not all ops within a reduction kernel will have the same number of iterations. For example, `var_mean` fuses pointwise division with the output of reduction sum, and the division lacks the corresponding reduction ranges. # Fix Instead of getting numels from `node.get_ranges()`, explicitly pass the global pointwise and reduction numels to the relevant tiling functions. In `SIMDKernel.complete_partial_tiling`, we solve for the missing numel by diving the global numel by the partial tiling's numel. This ensures all tilings have the correct global numel. Also, in `SIMDKernel.is_compatible`, add the global reduction numel to node ranges that are missing it. For example, `{"x": 8, "r0_": 8}` is compatible with a node of ranges `([8], [])` when we have `reduction_numel=8`. Finally, this PR generalizes some of the existing codegen to handle multiple reduction dims. We already had code to ignore reduction splits for pointwise kernels, but it only worked for 1D reductions. Now it can handle ND. # Test plan This PR parametrizes the existing CI test for `var_mean` to also run with tiled reductions. It also adds a new test checking that `var_mean` generates 2D tilings (with tiled reduction enabled). These new tests would fail on the current main branch. Pull Request resolved: https://github.com/pytorch/pytorch/pull/144041 Approved by: https://github.com/jansel	2025-01-03 18:16:27 +00:00
Colin Peppler	e93f625d00	[AOTI] don't codegen autotune_at_compile_time for non-Triton kernels (#143990 ) `autotune_at_compile_time` is a separate codegen file specifically for autotuning Triton kernels. We can skip it for non-Triton kernels (like CUTLASS). This test (test_aoti_workspace_ptr) checks that `workspace_0.data_ptr()` is codegen-ed correctly in AOTI. ``` // in AOTI codegen kernels.cuda_fused_0( (const half)arg0_1.data_ptr(), (const half)arg1_1.data_ptr(), (half)buf0.data_ptr(), (int)200, (int)5216, (int)10432, (int)10432, (int)5216, (int)0, (int)5216, (size_t)nullptr, (uint8_t*)workspace_0.data_ptr(), stream); ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/143990 Approved by: https://github.com/henrylhtsang, https://github.com/chenyang78, https://github.com/desertfire	2025-01-03 18:01:12 +00:00
Huy Do	f3968373c1	Migrate the rest of CUDA 12.1 jobs to 12.4 (#144118 ) CUDA 12.4 is the default now and we don't build nightly 12.1 anymore, so it's time to move the rest of CI jobs to 12.4. I also clean up some redundant CI jobs on periodic and inductor-periodic. Pull Request resolved: https://github.com/pytorch/pytorch/pull/144118 Approved by: https://github.com/atalman	2025-01-03 17:45:41 +00:00

1 2 3 4 5 ...

82818 commits