onnxruntime/orttraining
Weixing Zhang b4b1c6440a
Enable ORT with CUDA 11 toolkit (#4168)
* ORT on CUDA 11

1. Seperate HOROVOD and MPI
2. Seperate NCCL from HOROVOD in CMakeLists.txt
2. Remove dependency on external cub
3. cudnnSetRNNDescriptor is changed in cuDNN 8.0

* polish the code about MPI/NCCL in CMakeLists.txt and build.py

* check CUDA version

* ${MPI_INCLUDE_DIRS} should be PUBLIC

* sm30, sm50 are deprecated in CUDA 11 Toolkit

* update change based on code review feedback.

* add sm_52

* improve MPI/NCCL build path

Co-authored-by: Weixing Zhang <wezhan@microsoft.com>
2020-06-15 08:47:03 -07:00
..
orttraining Enable ORT with CUDA 11 toolkit (#4168) 2020-06-15 08:47:03 -07:00
pytorch_frontend_examples Remove ORTModel from frontend API (#3825) 2020-05-11 18:20:33 -07:00
tools Remove evaluation output from training e2e test baseline data. (#4092) 2020-06-01 15:06:21 -07:00