onnxruntime/cmake/external
Tianlei Wu 8b4517218b
Remove USE_CUTLASS flag (#19271)
### Description
Since Cutlass can be built with CUDA 11.4 (The minimum CUDA version for
onnxruntime CUDA build), there is no need to have a flag to disable
cutlass.

Changes:
(1) Reverted https://github.com/microsoft/onnxruntime/pull/18761
(2) remove the condition to build cutlass.
(3) Fix a few build errors or warnings during testing CUDA 11.4 build. 

Note that SM 89 and 90 (including fp8) requires CUDA 11.8 or later.
Flash attention and cutlass fused multihead attention will not be built
for CUDA < 11.6. It is recommended to use CUDA 11.8 or above to build if
you want to support latest GPUs.

It is better to include it in 1.17.0 (otherwise, the release branch
might encounter build failure with CUDA 11.4).

Tests:
(1) Build with flash attention and efficient attention off: **passed**
(2) Build with CUDA 11.4: **passed**

Example build command used in Ubuntu 20.04:
```
export CUDA_HOME=/usr/local/cuda-11.4
export CUDNN_HOME=/usr/lib/x86_64-linux-gnu/
export CUDACXX=/usr/local/cuda-11.4/bin/nvcc

sh build.sh --config Release  --build_shared_lib --parallel  --use_cuda --cuda_version 11.4 \
            --cuda_home $CUDA_HOME --cudnn_home $CUDNN_HOME --build_wheel --skip_tests \
            --cmake_extra_defines CMAKE_CUDA_ARCHITECTURES=80 \
            --disable_types float8
```

### Motivation and Context
<!-- - Why is this change required? What problem does it solve?
- If it fixes an open issue, please link to the issue here. -->
2024-01-25 16:57:58 -08:00
..
emsdk@4e2496141e update to emsdk-3.1.51 (#18844) 2024-01-12 16:04:33 -08:00
git.Win32.2.41.03.patch Fix ability to use patch on Windows CI machines (#18356) 2023-11-11 07:32:14 +10:00
libprotobuf-mutator@7a2ed51a6b
onnx@b86cc54efc use onnx rel-1.15.0, update cgman, cmake/external and requirement hash (#18177) 2023-10-31 14:58:21 -07:00
abseil-cpp.cmake Update abseil to a release tag and register neural_speed (#19255) 2024-01-24 14:37:39 -08:00
abseil-cpp.natvis Update abseil to a release tag and register neural_speed (#19255) 2024-01-24 14:37:39 -08:00
composable_kernel.cmake [ROCm] Update CK version (#17628) 2023-11-13 15:43:38 -08:00
cutlass.cmake Remove USE_CUTLASS flag (#19271) 2024-01-25 16:57:58 -08:00
dml.cmake Update DirectML nuget version to 1.13.1 (#19122) 2024-01-15 19:04:41 -08:00
dnnl.cmake [DNNL] add Arm Compute Library (ACL) backend for dnnl execution provider (#15847) 2023-12-01 09:16:44 -08:00
eigen.cmake Fix ability to use patch on Windows CI machines (#18356) 2023-11-11 07:32:14 +10:00
extensions.cmake Update C/C++ dependencies: abseil, date, nsync, googletest, wil, mp11, cpuinfo and safeint (#15470) 2023-09-08 13:35:04 -07:00
find_snpe.cmake
FindNumPy.cmake
helper_functions.cmake
ipp-crypto.cmake
mimalloc.cmake
neural_speed.cmake Update abseil to a release tag and register neural_speed (#19255) 2024-01-24 14:37:39 -08:00
onnx_minimal.cmake
onnx_protobuf.natvis Fix visualization issues with Attribute/Tensor protos (#17188) 2023-08-16 13:56:51 -07:00
onnxruntime_external_deps.cmake Download protoc for all Apple host builds, remove protoc build from iOS packaging pipeline. (#19209) 2024-01-19 15:30:09 -08:00
protobuf_function.cmake
pybind11.cmake
pyxir.cmake
tvm.cmake
wil.cmake Rework WIL dependency retrieval/usage (#17130) 2023-08-15 09:11:46 -07:00
xnnpack.cmake Enable RISC-V 64-bit Cross-Compiling Support for ONNX Runtime on Linux (#19238) 2024-01-24 16:27:05 -08:00