pytorch/test
Pritam Damania bf85642c4c Remove lock from GraphTask::set_exception_without_signal. (#45867)
Summary:
Pull Request resolved: https://github.com/pytorch/pytorch/pull/45867

In most cases the lock ordering was hold a lock in local autograd and
then hold a lock in DistAutogradContext.

In case of `set_exception_without_signal` the lock order was in reverse and as
a result we saw potential deadlock issues in our TSAN tests. To fix this, I
removed the lock and instead just used std::atomic exchange.

In addition to this, I fixed TestE2E to ensure that we use the appropriate
timeout.

TestE2EProcessGroup was flaky for these two reasons and now is fixed.
ghstack-source-id: 113592709

Test Plan: waitforbuildbot.

Reviewed By: albanD

Differential Revision: D24120962

fbshipit-source-id: 12447b84ceae772b91e9a183c90d1e6340f44e66
2020-10-05 20:02:29 -07:00
..
backward_compatibility remove beta defaulting in smooth_l1_loss_backward. added to the bc whitelist (#45588) 2020-10-02 07:53:04 -07:00
bottleneck_test
cpp Remove lock from GraphTask::set_exception_without_signal. (#45867) 2020-10-05 20:02:29 -07:00
cpp_api_parity Support tuple inputs in NN Module test (#44853) 2020-09-28 22:05:05 -07:00
cpp_extensions Add barrier() at the end of init_process_group and new_group. (#45181) 2020-09-25 15:46:59 -07:00
custom_backend [JIT] Modify to_backend API so that it accepts wrapped modules (#43612) 2020-09-28 17:17:01 -07:00
custom_operator
distributed Rocm skip test cases (#45782) 2020-10-05 15:12:25 -07:00
error_messages
expect
fx [FX] Make output a non-special Node (#45599) 2020-10-02 17:08:17 -07:00
jit [jit] allow slicing multiple dimensions with indicies (#45239) 2020-10-05 15:03:54 -07:00
mobile
namespace_b torch.package - a way to package models and code (#45015) 2020-09-22 21:21:21 -07:00
onnx Revert D23398534: [pytorch][PR] [ONNX] Improve error handling for adaptive_pool 2020-10-05 15:16:59 -07:00
optim
package_a torch.package - a way to package models and code (#45015) 2020-09-22 21:21:21 -07:00
quantization [FX] Make Tracer.trace() just return a Graph (#45704) 2020-10-03 21:13:48 -07:00
scripts
test_img
type_hint_tests
HowToWriteTestsUsingFileCheck.md
module_a.py torch.package - a way to package models and code (#45015) 2020-09-22 21:21:21 -07:00
print_test_stats.py add circle ci job name dimension to report test stats (#45457) 2020-09-28 20:51:58 -07:00
run_test.py adding sharding option to run_test.py (#45583) 2020-10-02 11:21:51 -07:00
simulate_nccl_errors.py
test_autograd.py Avoid NaN values in torch.cdist backward for p<1 (#45720) 2020-10-05 16:19:29 -07:00
test_bundled_images.py
test_bundled_inputs.py
test_complex.py
test_cpp_api_parity.py
test_cpp_extensions_aot.py
test_cpp_extensions_jit.py
test_cuda.py Enable NamedTuple data type to work with DDP (#44220) 2020-10-02 13:33:08 -07:00
test_cuda_primary_ctx.py
test_dataloader.py Add ShuffleDataset with buffer (#45290) 2020-09-30 07:58:15 -07:00
test_determination.py
test_dispatch.py Align casing in test_dispatch with dispatch keys. (#44933) 2020-09-22 10:50:08 -07:00
test_distributions.py
test_expecttest.py
test_foreach.py Use MTA for amp grad unscaling, enforce op math type in MTA functors, and allow op lambdas (#44778) 2020-10-01 07:51:16 -07:00
test_function_schema.py
test_functional_autograd_benchmark.py
test_futures.py
test_fx.py [FX][WIP] Mutable Graph APIs (#45227) 2020-10-05 17:07:08 -07:00
test_indexing.py
test_jit.py Change type inferred from empty annotation (#45360) 2020-10-05 15:16:56 -07:00
test_jit_cuda_fuser.py remove skip annotations since we already disabled the tests wholesale (#45698) 2020-10-01 17:47:48 -07:00
test_jit_cuda_fuser_legacy.py Rename jobs and cli switches for testing GraphExecutor configurations to something a little bit more sensical. (#45715) 2020-10-05 11:43:28 -07:00
test_jit_cuda_fuser_profiling.py Rename jobs and cli switches for testing GraphExecutor configurations to something a little bit more sensical. (#45715) 2020-10-05 11:43:28 -07:00
test_jit_disabled.py
test_jit_fuser.py Adjust TF32 tests (#44240) 2020-09-24 10:25:58 -07:00
test_jit_fuser_legacy.py Rename jobs and cli switches for testing GraphExecutor configurations to something a little bit more sensical. (#45715) 2020-10-05 11:43:28 -07:00
test_jit_fuser_te.py Build shape expressions and remove outputs that are only used by aten::sizes (#45080) 2020-09-28 10:45:56 -07:00
test_jit_legacy.py Rename jobs and cli switches for testing GraphExecutor configurations to something a little bit more sensical. (#45715) 2020-10-05 11:43:28 -07:00
test_jit_profiling.py Rename jobs and cli switches for testing GraphExecutor configurations to something a little bit more sensical. (#45715) 2020-10-05 11:43:28 -07:00
test_jit_py3.py [JIT] Enable @unused syntax for ignoring properties (#45261) 2020-09-29 10:24:25 -07:00
test_jit_simple.py Rename jobs and cli switches for testing GraphExecutor configurations to something a little bit more sensical. (#45715) 2020-10-05 11:43:28 -07:00
test_jit_string.py
test_linalg.py Test torch.svd using complex float and double numbers (take 2) (#45795) 2020-10-03 11:33:28 -07:00
test_logging.py
test_mkldnn.py
test_mobile_optimizer.py [pytorch] Replace "blacklist" in test/test_mobile_optimizer.py (#45512) 2020-09-30 10:43:59 -07:00
test_multiprocessing.py
test_multiprocessing_spawn.py
test_namedtensor.py
test_namedtuple_return_api.py
test_native_functions.py Revert D23753711: [pytorch][PR] Add foreach APIs for binary ops with ScalarList 2020-09-24 11:55:49 -07:00
test_nn.py some documentation and style fixes to smooth_l1_loss (#45587) 2020-10-02 07:47:31 -07:00
test_numba_integration.py
test_op_aliases.py Migrate addr from the TH to Aten (CPU) (#44364) 2020-09-25 01:18:09 -07:00
test_openmp.py
test_ops.py Revert D24027761: Update backward definition for more operators and reenable tests in test_ops.py 2020-10-02 18:52:57 -07:00
test_optim.py Update test_multi_tensor_optimizers test (#45510) 2020-09-30 08:59:18 -07:00
test_overrides.py Complex gradcheck logic (#43208) 2020-09-20 22:05:04 -07:00
test_package.py [package] Add dependency viz (#45214) 2020-09-28 15:38:41 -07:00
test_profiler.py Source code level attribution in profiler (#43898) 2020-09-30 00:57:35 -07:00
test_pruning_op.py
test_quantization.py
test_serialization.py
test_show_pickle.py
test_sparse.py Sparse softmax support (CUDA) (#42307) 2020-09-24 00:07:30 -07:00
test_spectral_ops.py torch.fft: Multi-dimensional transforms (#44550) 2020-09-23 22:09:58 -07:00
test_static_runtime.py [static runtime] Split out graph preparation from runtime (#44131) 2020-09-28 13:01:23 -07:00
test_tensor_creation_ops.py Rocm skip test cases (#45782) 2020-10-05 15:12:25 -07:00
test_tensorboard.py
test_tensorexpr.py
test_throughput_benchmark.py
test_torch.py Rocm skip test cases (#45782) 2020-10-05 15:12:25 -07:00
test_type_hints.py
test_type_info.py
test_type_promotion.py [fix] type promotion atan2 (#43466) 2020-09-23 22:23:05 -07:00
test_unary_ufuncs.py [numpy] Add torch.nan_to_num (#44592) 2020-10-05 01:38:56 -07:00
test_utils.py Re-land: Add callgrind collection to Timer #44717 (#45586) 2020-09-30 17:43:06 -07:00
test_vmap.py
test_vulkan.py
test_xnnpack_integration.py