onnxruntime/tools
Edward Chen 9810b9e02b
Reduce amount of compiled CUDA device code (#6118)
Move CudaKernel from cuda_common.h to a new separate header, cuda_kernel.h. Update include sites to use cuda_kernel.h instead if they need CudaKernel. Inclusions of cuda_common.h are now more lightweight.

Make corresponding changes for ROCM execution provider code.

Other minor cleanup.
2020-12-14 15:27:40 -08:00
..
ci_build Reduce amount of compiled CUDA device code (#6118) 2020-12-14 15:27:40 -08:00
nuget Enable C# .NET5 for WinML (#6120) 2020-12-14 15:05:15 -08:00
perf_util Update mysql-connector-java (#5802) 2020-11-16 14:09:14 -08:00
python Expand the documentation on using compiling EPs with a minimal build (#5893) 2020-12-02 09:12:36 +10:00
test Add Trilu custom op (#4537) 2020-08-17 14:42:26 -07:00