pytorch/test
Adnan Akhundov 2ead6c2f6e Skip launching kernels with zero grid in AOT Inductor (#110312)
Summary: with the grid computed in terms of unbacked `SymInt`s, it can happen that the grid is zero size. This causes CUDA error on `cuLaunchKernel` in the AOT Inductor codegen.

In this PR, when the grid contains unbacked `SymInt`s, a check is added around the `launchKernel` in the AOT Inductor's C++ wrapper codegen to make sure that the grid is not zero-size.

Pull Request resolved: https://github.com/pytorch/pytorch/pull/110312
Approved by: https://github.com/chenyang78
2023-09-30 09:12:56 +00:00
..
ao/sparsity
autograd
backends/xeon
benchmark_utils
bottleneck_test
cpp [Reland] Add -Wdeprecated and related fixes (#110019) 2023-09-28 03:34:29 +00:00
cpp_api_parity
cpp_extensions
custom_backend
custom_operator Add torch.ops.import_module (#110090) 2023-09-27 13:56:47 +00:00
distributed Define the public API for torch.distributed.fsdp (#109922) 2023-09-28 02:15:58 +00:00
distributions [BE]: enable ruff rules PLR1722 and PLW3301 (#109461) 2023-09-18 02:07:21 +00:00
dynamo [Dynamo] Add functional triton kernel wrapper (#110185) 2023-09-30 04:20:20 +00:00
edge [executorch][kernel reg] Allow kernel manual registration (#110086) 2023-09-27 16:04:20 +00:00
error_messages
expect Revert "[core IR] Add lift_fresh, split.Tensor, and unbind decompositions to core ATen decomp table (#110102)" 2023-09-28 19:03:25 +00:00
export dynamic_shapes + retrace exported program (#110276) 2023-09-29 21:06:46 +00:00
forward_backward_compatibility Flash Attention v2 (#105602) 2023-09-13 13:59:05 +00:00
functorch Replace node.meta source_fn with source_fn_stack (#108595) 2023-09-28 18:18:36 +00:00
fx Back out "Revert D49107540: [pytorch][PR] split by tag" (#109332) 2023-09-16 05:29:16 +00:00
inductor Skip launching kernels with zero grid in AOT Inductor (#110312) 2023-09-30 09:12:56 +00:00
jit update type() calling to not use unneeded device (#110163) 2023-09-28 17:34:46 +00:00
jit_hooks
lazy [LTC] correct AdaptiveAvgPool3d channel dim index for shape inference (#109822) 2023-09-29 22:54:12 +00:00
mobile
nn Ensure fill_ works when value is a view of self (#109835) 2023-09-26 17:12:48 +00:00
onnx add fp16 support for gemm (#99498) 2023-09-28 01:03:50 +00:00
onnx_caffe2 Revert "[ONNX] Remove the depreacated function _export (#109763)" 2023-09-25 17:47:21 +00:00
optim Add capturable ASGD impl (#107857) 2023-09-07 06:30:30 +00:00
package
profiler [profiler] Show shapes for lists of tensors in chrome traces #109263 (#109751) 2023-09-26 01:03:54 +00:00
quantization [quant] Enable quantization for wav2letter (#109830) 2023-09-29 00:47:34 +00:00
scripts
test_img
torch_np MAINT: pytorchify torch._numpy tests: core/ and fft/ (#109815) 2023-09-29 22:36:13 +00:00
typing Improve type annotations for jit.script (#108782) 2023-09-13 19:20:25 +00:00
_test_bazel.py
allowlist_for_publicAPI.json
conftest.py
create_dummy_torchscript_model.py
delete.py
HowToWriteTestsUsingFileCheck.md
linear.py
load_torchscript_model.py
minioptest_failures_dict.json [generate_opcheck_tests] Enable using same failures_dict for multiple testclasses (#110164) 2023-09-28 17:56:45 +00:00
mkl_verbose.py
mkldnn_verbose.py
run_doctests.sh
run_test.py [BE] Add sharding data by default to metrics (#110035) 2023-09-26 17:06:49 +00:00
simulate_nccl_errors.py
test_ao_sparsity.py
test_autocast.py Allow float dtype when Autocast CPU Disabled (#107348) 2023-09-01 00:49:44 +00:00
test_autograd.py Setup_context does not contain default values of forward() (#108561) 2023-09-19 16:23:52 +00:00
test_autograd_fallback.py
test_binary_ufuncs.py
test_bundled_images.py
test_bundled_inputs.py
test_comparison_utils.py
test_compile_benchmark_util.py
test_complex.py Don't fastpath conj copy when conj/neg bit mismatch (#108881) 2023-09-08 20:44:43 +00:00
test_content_store.py
test_cpp_api_parity.py Reland: Remove remaining global set_default_dtype calls from tests (#108088) 2023-09-07 03:04:34 +00:00
test_cpp_extensions_aot.py add torch.device to python type (#108116) 2023-09-18 02:20:30 +00:00
test_cpp_extensions_jit.py
test_cpp_extensions_open_device_registration.py
test_cuda.py [CUDA][CUDA Graphs] Fix CUDAGraph::reset function (#108896) 2023-09-11 19:49:31 +00:00
test_cuda_expandable_segments.py
test_cuda_multigpu.py
test_cuda_nvml_based_avail.py
test_cuda_primary_ctx.py
test_cuda_sanitizer.py
test_cuda_trace.py
test_custom_ops.py [generate_opcheck_tests] Enable using same failures_dict for multiple testclasses (#110164) 2023-09-28 17:56:45 +00:00
test_dataloader.py
test_datapipe.py Disable compile for massive data pipe test (#109063) 2023-09-12 00:15:52 +00:00
test_decomp.py [core IR] Add glu as a core decomposition (#110043) 2023-09-27 00:23:05 +00:00
test_deploy.py
test_determination.py
test_dispatch.py
test_dlpack.py Add DLPack bool support (#108486) 2023-09-08 17:55:33 +00:00
test_dynamic_shapes.py New export API with dynamic shape specifications instead of constraints (#108448) 2023-09-22 06:58:26 +00:00
test_expanded_weights.py
test_fake_tensor.py Define the public API for torch.distributed.fsdp (#109922) 2023-09-28 02:15:58 +00:00
test_flop_counter.py Flash Attention v2 (#105602) 2023-09-13 13:59:05 +00:00
test_foreach.py [foreach] Fix 0-size handling for real for real (#109402) 2023-09-26 17:38:20 +00:00
test_function_schema.py
test_functional_autograd_benchmark.py
test_functional_optim.py Cudagraphs support for compiled optimizers (#107504) 2023-08-31 20:47:18 +00:00
test_functionalization.py Make FunctionalTensor subclass to be more like functorch (interaction with ZeroTensor + Conjugate key) (#109023) 2023-09-22 07:09:04 +00:00
test_functionalization_of_rng_ops.py
test_futures.py
test_fx.py Replace node.meta source_fn with source_fn_stack (#108595) 2023-09-28 18:18:36 +00:00
test_fx_experimental.py
test_fx_passes.py
test_fx_reinplace_pass.py
test_hub.py
test_import_stats.py
test_indexing.py
test_itt.py
test_jit.py Reland: Remove remaining global set_default_dtype calls from tests (#108088) 2023-09-07 03:04:34 +00:00
test_jit_autocast.py
test_jit_disabled.py
test_jit_fuser.py
test_jit_fuser_legacy.py
test_jit_fuser_te.py [jit] Skip NNAPI, test_ivalue, CPU NNC tests in fbcode (#108937) 2023-09-11 22:42:30 +00:00
test_jit_legacy.py
test_jit_llga_fuser.py
test_jit_profiling.py
test_jit_simple.py
test_jit_string.py
test_jiterator.py
test_kernel_launch_checks.py
test_legacy_vmap.py
test_license.py
test_linalg.py add fp16 support for gemm (#99498) 2023-09-28 01:03:50 +00:00
test_logging.py
test_masked.py
test_maskedtensor.py
test_matmul_cuda.py Add better error messaging to scaled_mm (#108454) 2023-09-07 21:26:47 +00:00
test_meta.py add fp16 support for gemm (#99498) 2023-09-28 01:03:50 +00:00
test_metal.py
test_mkl_verbose.py
test_mkldnn.py add fp16 support for gemm (#99498) 2023-09-28 01:03:50 +00:00
test_mkldnn_fusion.py
test_mkldnn_verbose.py
test_mobile_optimizer.py
test_model_dump.py
test_model_exports_to_core_aten.py [aten decomp] Update sdpa decom (#108371) 2023-09-03 15:17:08 +00:00
test_module_init.py
test_modules.py
test_monitor.py
test_mps.py add fp16 support for gemm (#99498) 2023-09-28 01:03:50 +00:00
test_multiprocessing.py
test_multiprocessing_spawn.py
test_namedtensor.py
test_namedtuple_return_api.py
test_native_functions.py
test_native_mha.py
test_nestedtensor.py Pickle support for NT (#110219) 2023-09-29 15:30:06 +00:00
test_nn.py add Half support for BatchNorm on CPU (#102070) 2023-09-19 10:43:33 +00:00
test_nnapi.py
test_numba_integration.py
test_numpy_interop.py Implement numpy(force=True) (#109636) 2023-09-20 20:06:13 +00:00
test_openmp.py
test_ops.py [core IR] Add a core decomposition for floor_divide (#110046) 2023-09-26 08:39:21 +00:00
test_ops_fwd_gradients.py Reland: Remove remaining global set_default_dtype calls from tests (#108088) 2023-09-07 03:04:34 +00:00
test_ops_gradients.py Reland: Remove remaining global set_default_dtype calls from tests (#108088) 2023-09-07 03:04:34 +00:00
test_ops_jit.py Reland: Remove remaining global set_default_dtype calls from tests (#108088) 2023-09-07 03:04:34 +00:00
test_optim.py
test_out_dtype_op.py Allow registering decomps for HigherOrderOp; add decomp for out_dtype (#108080) 2023-08-31 03:15:38 +00:00
test_overrides.py Disabled UserWarnings for some public functions in torch.overrides (#109890) 2023-09-23 20:40:04 +00:00
test_package.py
test_per_overload_api.py
test_prims.py fix infinite loop with primtorch and .to(meta) (#109632) 2023-09-22 07:09:04 +00:00
test_proxy_tensor.py Add masked_select abstract impl (#110103) 2023-09-27 04:07:58 +00:00
test_pruning_op.py
test_public_bindings.py Rename torch._C._TensorBase to TensorBase (#109940) 2023-09-25 19:10:22 +00:00
test_python_dispatch.py Rename torch._C._TensorBase to TensorBase (#109940) 2023-09-25 19:10:22 +00:00
test_pytree.py Trace through pytree API with dynamo. (#108533) 2023-09-20 00:04:56 +00:00
test_quantization.py [quant][pt2e] Enable constant folding for quantize ops (#109343) 2023-09-27 06:04:45 +00:00
test_reductions.py
test_scatter_gather_ops.py
test_schema_check.py
test_segment_reductions.py
test_serialization.py When byteorder record is missing load as little endian by default (#108343) 2023-09-04 15:20:22 +00:00
test_set_default_mobile_cpu_allocator.py
test_shape_ops.py
test_show_pickle.py
test_sort_and_select.py
test_sparse.py fall back to eager on NotImplementedError (#107863) 2023-09-07 21:22:20 +00:00
test_sparse_csr.py Move has_triton to top level triton utils so that dynamo can also access (#109832) 2023-09-22 19:33:41 +00:00
test_sparse_semi_structured.py Minor fixes in semi-structured sparse code (#105595) 2023-09-25 14:06:08 +00:00
test_spectral_ops.py decomposition spectral ops fixes (#108360) 2023-09-09 04:48:09 +00:00
test_stateless.py
test_static_runtime.py
test_subclass.py
test_sympy_utils.py
test_tensor_creation_ops.py Reland: Remove remaining global set_default_dtype calls from tests (#108088) 2023-09-07 03:04:34 +00:00
test_tensorboard.py Add tensorboard to pip requirements (#109349) 2023-09-15 10:39:48 +00:00
test_tensorexpr.py
test_tensorexpr_pybind.py
test_testing.py [Inductor CUTLASS backend] Step 1: Inductor config for cuda / cutlass, util functions. (#107802) 2023-09-12 17:44:32 +00:00
test_throughput_benchmark.py
test_torch.py Rename torch._C._TensorBase to TensorBase (#109940) 2023-09-25 19:10:22 +00:00
test_transformers.py Adding Backward Support for NestedTensors and FlashAttention (#97485) 2023-09-29 21:34:47 +00:00
test_type_hints.py
test_type_info.py Add finfo properties for float8 dtypes (#109744) 2023-09-21 03:41:48 +00:00
test_type_promotion.py
test_typing.py
test_unary_ufuncs.py
test_utils.py Clean up test_external_module_register (#110254) 2023-09-29 17:02:13 +00:00
test_view_ops.py Implement numpy(force=True) (#109636) 2023-09-20 20:06:13 +00:00
test_vulkan.py
test_weak.py removing some redundant str splits (#106089) 2023-09-01 00:22:58 +00:00
test_xnnpack_integration.py