pytorch

mirror of https://github.com/saymrwulf/pytorch.git synced 2026-05-14 20:57:59 +00:00

History

Eddie Yan 9ee506bd93 [CUDA][cuBLAS] Add fp16 accumulate option to cuBLAS/cuBLASLt (#144441 ) Test for `cublasGemmEx` added, still need to figure out the best way to exercise the other APIs... Pull Request resolved: https://github.com/pytorch/pytorch/pull/144441 Approved by: https://github.com/Chillee, https://github.com/malfet		2025-02-06 19:04:50 +00:00
..
_coreml	PEP585 update - torch/_higher_order_ops torch/_subclasses torch/backends torch/compiler torch/cuda torch/masked torch/mtia torch/nested (#145202 )	2025-01-20 22:37:26 +00:00
_nnapi	PEP585 update - torch/_higher_order_ops torch/_subclasses torch/backends torch/compiler torch/cuda torch/masked torch/mtia torch/nested (#145202 )	2025-01-20 22:37:26 +00:00
cpu
cuda	[CUDA][cuBLAS] Add fp16 accumulate option to cuBLAS/cuBLASLt (#144441 )	2025-02-06 19:04:50 +00:00
cudnn
cusparselt	Revert "[sparse] add search for optimal alg_id to torch.compile (#137427 )"	2024-10-24 17:27:06 +00:00
kleidiai	Revert "Reverting the PR adding Kleidiai-based int4 kernels (#145392 )" (#145505 )	2025-01-23 18:50:59 +00:00
mha	[BE] replace incorrect .. note:: invocations (#142868 )	2024-12-11 19:58:18 +00:00
mkl
mkldnn
mps
nnpack
openmp
opt_einsum
quantized	PEP585 update - torch/_higher_order_ops torch/_subclasses torch/backends torch/compiler torch/cuda torch/masked torch/mtia torch/nested (#145202 )	2025-01-20 22:37:26 +00:00
xeon	PEP585 update - torch/_higher_order_ops torch/_subclasses torch/backends torch/compiler torch/cuda torch/masked torch/mtia torch/nested (#145202 )	2025-01-20 22:37:26 +00:00
xnnpack
__init__.py	Revert "Reverting the PR adding Kleidiai-based int4 kernels (#145392 )" (#145505 )	2025-01-23 18:50:59 +00:00