pytorch/aten/src/ATen/native/cuda
Davide Italiano 2a55311773 [cuda] Simplify the sinc function a bit. (#146774)
`else` after `return` can be removed & the indentation can be reduced, for readability.
Pull Request resolved: https://github.com/pytorch/pytorch/pull/146774
Approved by: https://github.com/malfet
2025-02-09 20:09:34 +00:00
..
cutlass_extensions
linalg c10::string_view -> std::string_view in aten (#141903) 2024-12-07 23:23:52 +00:00
AbsKernel.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
Activation.cpp c10::string_view -> std::string_view in aten (#141903) 2024-12-07 23:23:52 +00:00
Activation.h Modernize C++ code in aten/src/ATen/ (#141424) 2024-11-24 02:15:19 +00:00
ActivationEluKernel.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
ActivationGeluKernel.cu
ActivationGluKernel.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
ActivationHardshrinkKernel.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
ActivationHardsigmoidKernel.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
ActivationHardswishKernel.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
ActivationHardtanhKernel.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
ActivationLeakyReluKernel.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
ActivationLogSigmoidKernel.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
ActivationMishKernel.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
ActivationPreluKernel.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
ActivationSiluKernel.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
ActivationSoftplusKernel.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
ActivationSoftshrinkKernel.cu softshrink nan fixes (#138421) 2024-11-21 23:06:08 +00:00
ActivationThresholdKernel.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
AdaptiveAveragePooling.cu
AdaptiveAveragePooling3d.cu
AdaptiveMaxPooling2d.cu
AdaptiveMaxPooling3d.cu
airy_ai.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
AmpKernels.cu
AveragePool2d.cu [CUDA][B200] Update the number of threads in avg_pool2d backward for SM 10.0 (#145669) 2025-02-06 18:57:33 +00:00
AveragePool3d.cu
bessel_j0.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
bessel_j1.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
bessel_y0.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
bessel_y1.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
BinaryBitwiseOpsKernels.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
BinaryDivFloorKernel.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
BinaryDivTrueKernel.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
BinaryDivTruncKernel.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
BinaryGeometricKernels.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
BinaryInternal.h
BinaryLogicalOpsKernels.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
BinaryMiscBackwardOpsKernels.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
BinaryMiscOpsKernels.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
BinaryMulKernel.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
BinaryRemainderKernel.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
BinaryShiftOpsKernels.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
Blas.cpp [ROCm] hipblaslt rowwise f8 gemm (#144432) 2025-01-15 18:23:44 +00:00
block_reduce.cuh [BE] Use nested namespace in ATen/native/cuda (#136570) 2024-09-24 22:19:10 +00:00
Bucketization.cu [CUDA][64-bit indexing] Fix some existing problematic int64_t _ = blockIdx.* * blockDim.* code (#142010) 2024-12-19 00:55:11 +00:00
chebyshev_polynomial_t.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
chebyshev_polynomial_u.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
chebyshev_polynomial_v.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
chebyshev_polynomial_w.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
Col2Im.cu
CompareEQKernel.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
CompareKernels.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
ComplexKernel.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
CompositeRandomAccessor.h
ConvolutionMM2d.cu
Copy.cu use copy2d in h2d/d2h copy when possible (#146256) 2025-02-03 23:07:54 +00:00
Copy.h
CopysignKernel.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
CrossKernel.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
CUDAJitLoops.cuh [ATen][CUDA] Implement 128 bit vectorization v2 (#145746) 2025-01-31 06:42:08 +00:00
CUDALoops.cuh [ATen][CUDA] Implement 128 bit vectorization v2 (#145746) 2025-01-31 06:42:08 +00:00
CUDAScalar.cu [redo] Fp8 support for item() with cuda, index_select, and fill_ cpu (#137341) 2024-10-07 00:58:51 +00:00
CuFFTPlanCache.h [BE] Use C++17 convetion methods in CUDA kernels (#136575) 2024-09-25 04:30:01 +00:00
CuFFTUtils.h Remove deprecated alias macro(1/3) (#137556) 2024-10-21 17:32:32 +00:00
CumminmaxKernel.cu
CumprodKernel.cu
CumsumKernel.cu
DepthwiseConv2d.cu Work around buggy use_const_ref_for_mutable_tensors (#145530) 2025-01-24 14:38:49 +00:00
DepthwiseConv3d.cu [6/N] Fix Wextra-semi warning (#139605) 2024-11-04 13:43:16 +00:00
DeviceSqrt.cuh [BE] Use nested namespace in ATen/native/cuda (#136570) 2024-09-24 22:19:10 +00:00
DilatedMaxPool2d.cu
DilatedMaxPool3d.cu
DistanceKernel.cu [CUDA][64-bit indexing] Fix some existing problematic int64_t _ = blockIdx.* * blockDim.* code (#142010) 2024-12-19 00:55:11 +00:00
DistributionBernoulli.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
DistributionCauchyKernel.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
DistributionExponentialKernel.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
DistributionGeometricKernel.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
DistributionLogNormalKernel.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
DistributionNormal.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
DistributionRandomKernel.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
Distributions.cpp
Distributions.cu
Distributions.h
DistributionTemplates.h restore rng generation for fbcode (#144819) 2025-01-16 06:46:26 +00:00
DistributionUniform.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
Dropout.cu [ATen][CUDA] Implement 128 bit vectorization v2 (#145746) 2025-01-31 06:42:08 +00:00
Embedding.cu set CUB_VERSION to 200001 for USE_ROCM (#140861) 2024-12-10 02:28:48 +00:00
EmbeddingBackwardKernel.cu
EmbeddingBackwardKernel.cuh [BE] Use nested namespace in ATen/native/cuda (#136570) 2024-09-24 22:19:10 +00:00
EmbeddingBag.cu Add range check embedding_bag on input index >= 0 of cuda device (#140791) 2024-12-20 05:47:26 +00:00
Equal.cpp
FillKernel.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
FlattenIndicesKernel.cu [6/N] Fix Wextra-semi warning (#139605) 2024-11-04 13:43:16 +00:00
ForeachBinaryOpList.cu [FSDP2] support torch._foreach_copy_(float8) for fully_shard(Float8Linear) (#135955) 2024-10-07 16:36:31 +00:00
ForeachBinaryOpScalar.cu
ForeachBinaryOpScalarList.cu
ForeachBinaryOpScalarTensor.cu
ForeachFunctors.cuh Add ScalarList overload to _foreach_lerp (#134482) 2024-11-12 19:03:41 +00:00
ForeachMinMaxFunctors.cuh
ForeachPointwiseOp.cu
ForeachReduceOp.cu
ForeachTernaryOp.cu Add ScalarList overload to _foreach_lerp (#134482) 2024-11-12 19:03:41 +00:00
ForeachUnaryOp.cu implement torch._foreach_rsqrt (#134574) 2024-11-12 15:34:35 +00:00
FractionalMaxPool2d.cu Revert "Remove C10_DEPRECATED (#138406)" 2024-10-22 18:00:41 +00:00
FractionalMaxPool3d.cu [CUDA][64-bit indexing] Fix some existing problematic int64_t _ = blockIdx.* * blockDim.* code (#142010) 2024-12-19 00:55:11 +00:00
FunctionOfAMatrixUtilsKernel.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
fused_adam_amsgrad_impl.cu
fused_adam_amsgrad_impl.cuh [BE] Use nested namespace in ATen/native/cuda (#136570) 2024-09-24 22:19:10 +00:00
fused_adam_impl.cu
fused_adam_impl.cuh [BE] Use nested namespace in ATen/native/cuda (#136570) 2024-09-24 22:19:10 +00:00
fused_adam_utils.cuh [BE] Use nested namespace in ATen/native/cuda (#136570) 2024-09-24 22:19:10 +00:00
fused_adamw_amsgrad_impl.cu [BE] Use nested namespace in ATen/native/cuda (#136570) 2024-09-24 22:19:10 +00:00
fused_adamw_amsgrad_impl.cuh [BE] Use nested namespace in ATen/native/cuda (#136570) 2024-09-24 22:19:10 +00:00
fused_adamw_impl.cu [BE] Use nested namespace in ATen/native/cuda (#136570) 2024-09-24 22:19:10 +00:00
fused_adamw_impl.cuh [BE] Use nested namespace in ATen/native/cuda (#136570) 2024-09-24 22:19:10 +00:00
FusedAdamKernel.cu [5/N] Apply bugprone-unchecked-optional-access (#143111) 2024-12-15 01:07:28 +00:00
FusedAdamWKernel.cu [5/N] Apply bugprone-unchecked-optional-access (#143111) 2024-12-15 01:07:28 +00:00
FusedSgdKernel.cu [5/N] Apply bugprone-unchecked-optional-access (#143111) 2024-12-15 01:07:28 +00:00
GcdLcmKernel.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
GridSampler.cpp
GridSampler.cu
GridSampler.cuh [BE] Use nested namespace in ATen/native/cuda (#136570) 2024-09-24 22:19:10 +00:00
GridSampler.h Modernize C++ code in aten/src/ATen/ (#141424) 2024-11-24 02:15:19 +00:00
group_norm_kernel.cu [CUDA][64-bit indexing] Fix some existing problematic int64_t _ = blockIdx.* * blockDim.* code (#142010) 2024-12-19 00:55:11 +00:00
hermite_polynomial_h.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
hermite_polynomial_he.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
IGammaKernel.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
Im2Col.cu
im2col.cuh [BE] Remove unusued channels arg in col2im (#142336) 2024-12-09 01:49:41 +00:00
Indexing.cu c10::string_view -> std::string_view in aten (#141903) 2024-12-07 23:23:52 +00:00
IndexKernel.cpp [4/N] Avoid copy in std::get (#142285) 2024-12-09 07:59:35 +00:00
IndexKernel.cu add fp8 support to index_cuda (#144747) 2025-01-17 22:53:23 +00:00
IndexKernel.h Modernize C++ code in aten/src/ATen/ (#141424) 2024-11-24 02:15:19 +00:00
int4mm.cu [BE] Use C++17 convetion methods in CUDA kernels (#136575) 2024-09-25 04:30:01 +00:00
jit_utils.cpp [ATen][CUDA] Implement 128 bit vectorization v2 (#145746) 2025-01-31 06:42:08 +00:00
jit_utils.h [ATen][CUDA] Implement 128 bit vectorization v2 (#145746) 2025-01-31 06:42:08 +00:00
JitLoops.cuh [BE] Use nested namespace in ATen/native/cuda (#136570) 2024-09-24 22:19:10 +00:00
KernelUtils.cuh [AMD] Fix torch ck backend build with 6.2.1 (#138434) 2024-10-21 06:38:38 +00:00
laguerre_polynomial_l.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
LaunchUtils.h
layer_norm_kernel.cu [ROCm] fix torch.layer_norm invalid configuration problem when input is large tensor (#144007) 2025-01-07 19:17:02 +00:00
LegacyThrustHelpers.cu
legendre_polynomial_p.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
Lerp.cu Fix torch.lerp RuntimeError when weight is CPU scalar while input & end are CUDA tensor (#141820) 2024-12-09 18:14:54 +00:00
LinearAlgebra.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
LinearAlgebraStubs.cpp c10::string_view -> std::string_view in aten (#141903) 2024-12-07 23:23:52 +00:00
LogAddExpKernel.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
LogcumsumexpKernel.cu
Loops.cuh [3/N] Replace at::detail::Array with std::array (#141324) 2024-11-24 18:17:34 +00:00
Loss.cu Remove unneeded optional dereference (#141578) 2024-12-12 04:34:43 +00:00
LossCTC.cu [CUDA][64-bit indexing] Fix some existing problematic int64_t _ = blockIdx.* * blockDim.* code (#142010) 2024-12-19 00:55:11 +00:00
Math.cuh [cuda] Simplify the sinc function a bit. (#146774) 2025-02-09 20:09:34 +00:00
MaxMinElementwiseKernel.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
MaxUnpooling.cu [CUDA][64-bit indexing] Fix some existing problematic int64_t _ = blockIdx.* * blockDim.* code (#142010) 2024-12-19 00:55:11 +00:00
MemoryAccess.cuh [ATen][CUDA] Implement 128 bit vectorization v2 (#145746) 2025-01-31 06:42:08 +00:00
MiscUtils.h
MixedDtypesLinear.cu [cutlass-3] Update third-party/cutlass-3 from 3.4 to 3.5.1 (#143515) 2025-01-02 18:45:11 +00:00
modified_bessel_i0.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
modified_bessel_i1.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
modified_bessel_k0.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
modified_bessel_k1.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
MultiLabelMarginCriterion.cu
MultiMarginLoss.cu
MultinomialKernel.cu Revert "[ROCm] remove caffe2 from hipify (#137157)" 2024-10-08 17:45:45 +00:00
MultiTensorApply.cuh correctly keep track of processed tensors for foreach reductions (#140103) 2024-11-08 23:04:53 +00:00
NaiveConvolutionTranspose2d.cu [6/N] Fix Wextra-semi warning (#139605) 2024-11-04 13:43:16 +00:00
NaiveConvolutionTranspose3d.cu [6/N] Fix Wextra-semi warning (#139605) 2024-11-04 13:43:16 +00:00
NaiveDilatedConvolution.cu [6/N] Fix Wextra-semi warning (#139605) 2024-11-04 13:43:16 +00:00
NLLLoss2d.cu
Nonzero.cu Implements nonzero_static on cuda (#141838) 2024-12-11 06:44:48 +00:00
Normalization.cu Eliminate c10::value_or_else (#138818) 2024-10-25 17:59:01 +00:00
Normalization.cuh c10::string_view -> std::string_view in aten (#141903) 2024-12-07 23:23:52 +00:00
PersistentSoftmax.cuh Improve softmax's perf in cuda (#144679) 2025-01-23 00:02:57 +00:00
PointwiseOpsKernel.cu Add support for CPU scalar in addcmul (#143264) 2024-12-18 04:43:29 +00:00
Pow.cuh [BE] Use C++17 convetion methods in CUDA kernels (#136575) 2024-09-25 04:30:01 +00:00
PowKernel.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
Randperm.cu
Randperm.cuh [4/N] Avoid copy in std::get (#142285) 2024-12-09 07:59:35 +00:00
RangeFactories.cu
RecordStream.cu
Reduce.cu
Reduce.cuh [ROCm] Tune 3d tensor sums when not using fastest dimension (#146170) 2025-02-04 04:02:16 +00:00
ReduceAMinMaxKernel.cu
ReduceArgMaxKernel.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
ReduceArgMinKernel.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
ReduceLogicKernel.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
ReduceMaxValuesKernel.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
ReduceMinValuesKernel.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
ReduceMomentKernel.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
ReduceNormKernel.cu
ReduceOps.cpp [6/N] Fix Wextra-semi warning (#139605) 2024-11-04 13:43:16 +00:00
ReduceOps.h Modernize C++ code in aten/src/ATen/ (#141424) 2024-11-24 02:15:19 +00:00
ReduceSumProdKernel.cu [ROCM] Enable *_load_dwordx4 ISA for BFloat16 and Half. (#141397) 2024-12-12 03:27:49 +00:00
reduction_template.cuh [BE] Use C++17 convetion methods in CUDA kernels (#136575) 2024-09-25 04:30:01 +00:00
ReflectionPad.cu Add determinmistic kernel for reflection2d (#136241) 2025-01-29 20:34:03 +00:00
RenormKernel.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
Repeat.cu [CUDA][64-bit indexing] Fix some existing problematic int64_t _ = blockIdx.* * blockDim.* code (#142010) 2024-12-19 00:55:11 +00:00
ReplicationPadding.cu [CUDA][64-bit indexing] Fix some existing problematic int64_t _ = blockIdx.* * blockDim.* code (#142010) 2024-12-19 00:55:11 +00:00
Resize.cpp Make Context to be Device-agnostic Step by Step (1/N) (#136519) (#138155) 2024-10-17 20:58:56 +00:00
Resize.h
RNN.cu Eliminate c10::value_or_else (#138818) 2024-10-25 17:59:01 +00:00
RowwiseScaledMM.cu Build RowwiseScaledMM.cu for SM89 (#145676) 2025-02-01 11:44:58 +00:00
RowwiseScaledMM.h
RreluWithNoise.cu [4/N] Avoid copy in std::get (#142285) 2024-12-09 07:59:35 +00:00
scaled_modified_bessel_k0.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
scaled_modified_bessel_k1.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
ScanKernels.cpp Implement deterministic scan (#140887) 2024-11-19 23:43:26 +00:00
ScanKernels.h
ScanUtils.cuh [cumsum][CUDA][64-bit indexing] Add 64-bit indexing path for cumsum (#143696) 2024-12-24 03:45:28 +00:00
ScatterGatherKernel.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
SegmentReduce.cu [CUDA][64-bit indexing] Fix some existing problematic int64_t _ = blockIdx.* * blockDim.* code (#142010) 2024-12-19 00:55:11 +00:00
Shape.cu [2/N] Avoid copy in std::get (#141826) 2024-12-02 00:16:48 +00:00
shifted_chebyshev_polynomial_t.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
shifted_chebyshev_polynomial_u.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
shifted_chebyshev_polynomial_v.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
shifted_chebyshev_polynomial_w.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
SoftMax.cu Improve softmax's perf in cuda (#144679) 2025-01-23 00:02:57 +00:00
Sort.cpp Support torch.bool in torch.sort + CUDA (#139409) 2024-11-06 00:02:54 +00:00
Sort.cu Remove unused <ATen/core/Array.h> inclusion (#143701) 2024-12-22 14:30:11 +00:00
Sort.h
SortImpl.cu
Sorting.cpp
Sorting.cu Remove deprecated alias macro(1/3) (#137556) 2024-10-21 17:32:32 +00:00
Sorting.h Modernize C++ code in aten/src/ATen/ (#141424) 2024-11-24 02:15:19 +00:00
SortingCommon.cuh Recover non-standard bool test for msort (#139870) 2024-11-11 02:00:34 +00:00
SortingRadixSelect.cuh [BE] Use nested namespace in ATen/native/cuda (#136570) 2024-09-24 22:19:10 +00:00
SortStable.cu Remove unused <ATen/core/Array.h> inclusion (#143701) 2024-12-22 14:30:11 +00:00
SortStable.h
SortUtils.cuh [BE] Use nested namespace in ATen/native/cuda (#136570) 2024-09-24 22:19:10 +00:00
SparseBinaryOpIntersectionKernel.cu [6/N] Fix Wextra-semi warning (#139605) 2024-11-04 13:43:16 +00:00
SparseMM.cu Remove deprecated alias macro(1/3) (#137556) 2024-10-21 17:32:32 +00:00
SpectralOps.cpp Implement AcceleratorHooksInterface's virtual functions deviceCount() and getCurrentDevice() for CUDA and XPU (#136752) 2024-10-03 14:44:58 +00:00
SpectralOps.cu [1/N] Remove inclusion of ATen/core/Array.h (#122064) 2024-11-18 08:50:28 +00:00
spherical_bessel_j0.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
StepKernel.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
SummaryOps.cu [ROCm][Windows] Fix isnan integer overload errors on MS STL (#146605) 2025-02-06 23:44:11 +00:00
TensorCompare.cpp [6/N] Fix Wextra-semi warning (#139605) 2024-11-04 13:43:16 +00:00
TensorCompare.cu Add FP8 support for eye (#139974) 2024-12-24 10:00:23 +00:00
TensorFactories.cu [CUDA][64-bit indexing] Fix some existing problematic int64_t _ = blockIdx.* * blockDim.* code (#142010) 2024-12-19 00:55:11 +00:00
TensorModeKernel.cpp [6/N] Fix Wextra-semi warning (#139605) 2024-11-04 13:43:16 +00:00
TensorModeKernel.cu
TensorModeKernel.cuh [BE] Use nested namespace in ATen/native/cuda (#136570) 2024-09-24 22:19:10 +00:00
TensorModeKernel.h Modernize C++ code in aten/src/ATen/ (#141424) 2024-11-24 02:15:19 +00:00
TensorShape.cu
TensorShapeCUDA.cpp
TensorTopK.cpp
TensorTopK.cu Removes threadfence from topk kernel to improve AMD performance (#145536) 2025-01-29 01:29:15 +00:00
TensorTopK.h Modernize C++ code in aten/src/ATen/ (#141424) 2024-11-24 02:15:19 +00:00
TensorTransformations.cu [CUDA][64-bit indexing] Fix some existing problematic int64_t _ = blockIdx.* * blockDim.* code (#142010) 2024-12-19 00:55:11 +00:00
thread_constants.h [ATen][CUDA] Implement 128 bit vectorization v2 (#145746) 2025-01-31 06:42:08 +00:00
TriangularOps.cu
UnaryComplexKernels.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
UnaryFractionKernels.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
UnaryGammaKernels.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
UnaryGeometricAcoshKernel.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
UnaryGeometricAcosKernel.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
UnaryGeometricAsinhKernel.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
UnaryGeometricAsinKernel.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
UnaryGeometricAtanhKernel.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
UnaryGeometricAtanKernel.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
UnaryGeometricCoshKernel.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
UnaryGeometricCosKernel.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
UnaryGeometricSinhKernel.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
UnaryGeometricSinKernel.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
UnaryGeometricTanhKernel.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
UnaryGeometricTanKernel.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
UnaryLogKernels.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
UnaryOpsKernel.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
UnarySignKernels.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
UnarySpecialOpsKernel.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
UnfoldBackwardKernel.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00
Unique.cu
UniqueCub.cu
UniqueCub.cuh [BE] Use nested namespace in ATen/native/cuda (#136570) 2024-09-24 22:19:10 +00:00
UpSample.cuh [BE] Use nested namespace in ATen/native/cuda (#136570) 2024-09-24 22:19:10 +00:00
UpSampleBicubic2d.cu
UpSampleBilinear2d.cu Revert "Remove C10_DEPRECATED (#138406)" 2024-10-22 18:00:41 +00:00
UpSampleLinear1d.cu
UpSampleNearest1d.cu
UpSampleNearest2d.cu [64-bit][CUDA] Upsample2D 64-bit indexing fix attempt 2 (#141923) 2025-01-04 02:30:38 +00:00
UpSampleNearest3d.cu [64-bit] Int64 casting for UpSampleNearest3D (#144865) 2025-01-29 19:30:09 +00:00
UpSampleTrilinear3d.cu
ValidateCompressedIndicesKernel.cu
vol2col.cuh [BE] Use nested namespace in ATen/native/cuda (#136570) 2024-09-24 22:19:10 +00:00
WeightNorm.cu
ZetaKernel.cu [5/N] Fix Wextra-semi warning (#139465) 2024-11-03 20:40:50 +00:00