onnxruntime/tools
Weixing Zhang b4b1c6440a
Enable ORT with CUDA 11 toolkit (#4168)
* ORT on CUDA 11

1. Seperate HOROVOD and MPI
2. Seperate NCCL from HOROVOD in CMakeLists.txt
2. Remove dependency on external cub
3. cudnnSetRNNDescriptor is changed in cuDNN 8.0

* polish the code about MPI/NCCL in CMakeLists.txt and build.py

* check CUDA version

* ${MPI_INCLUDE_DIRS} should be PUBLIC

* sm30, sm50 are deprecated in CUDA 11 Toolkit

* update change based on code review feedback.

* add sm_52

* improve MPI/NCCL build path

Co-authored-by: Weixing Zhang <wezhan@microsoft.com>
2020-06-15 08:47:03 -07:00
..
ci_build Enable ORT with CUDA 11 toolkit (#4168) 2020-06-15 08:47:03 -07:00
nuget Enable .NET Core 2.0 and .NET Framework 4.6.1 in Microsoft.AI.MachineLearning NuGet package (#4125) 2020-06-09 09:08:19 -07:00
perf_util New perf metric - e2e throughput (#4085) 2020-06-01 12:11:34 -07:00
python add script to support update nodejs binding version (#4164) 2020-06-09 13:12:55 -07:00