pytorch/test/cpp
Nikita Shulga 80ea6955af Add cuda-11.3+clang9 build workflow (take 2)
To be able to detect unused captures in GPU code lambdas (as gcc does not support this diagnostic)

Remove unused opts lambda capture in `ProcessGroupMPI.cpp` and `Distributions.cu`

Fix sign-compare in nvfuser benchmark and ignore signed unsigned comparison in nvfuser tests
Fixes https://github.com/pytorch/pytorch/issues/75475 by aliasing CMAKE_CUDA_HOST_COMPILER to C_COMPILER when clang is used
Pull Request resolved: https://github.com/pytorch/pytorch/pull/75293
Approved by: https://github.com/atalman, https://github.com/seemethere
2022-04-11 17:13:01 +00:00
..
api Extend jit::load to work on flatbuffer file; Take 2 (#75256) 2022-04-06 17:54:01 +00:00
c10d Add cuda-11.3+clang9 build workflow (take 2) 2022-04-11 17:13:01 +00:00
common
dist_autograd
jit Add cuda-11.3+clang9 build workflow (take 2) 2022-04-11 17:13:01 +00:00
lazy [LT] Support diagonal op (#75230) 2022-04-08 19:49:42 +00:00
lite_interpreter_runtime [PyTorch] Add codegen unboxing ability (#69881) 2022-03-01 23:28:13 +00:00
monitor torch/monitor: merge Interval and FixedCount stats (#72009) 2022-01-30 23:21:59 +00:00
profiler [Profiler] Prefer TSC to wall clock when available (#73855) 2022-03-13 18:29:06 +00:00
rpc
tensorexpr Enable TE fuser to support user defined operator (#73073) 2022-04-07 04:36:39 +00:00
__init__.py