onnxruntime/tools/ci_build/github
Tianlei Wu b3fc9b5a0e
[CUDA] upgrade cutlass to 3.5.0 (#20940)
### Description
Upgrade cutlass to 3.5 to fix build errors using CUDA 12.4 or 12.5 in
Windows
- [x] Upgrade cutlass to 3.5.0.
- [x] Fix flash attention build error with latest cutlass header files
and APIs. This fix is provided by @wangyems.
- [x] Update efficient attention to use new cutlass fmha interface.
- [x] Patch cutlass to fix `hrsqrt` not found error for sm < 53.
- [x] Disable TF32 Staged Accumulation to fix blkq4_fp16_gemm_sm80_test
build error for cuda 11.8 to 12.3.
- [x] Disable TRT 10 deprecate warnings. 

The following are not included in this PR:
* TRT provider replaces the deprecated APIs.
* Fix blkq4_fp16_gemm_sm80_test build error for cuda 12.4 or 12.5. This
test is not built by default unless you add `--cmake_extra_defines
onnxruntime_ENABLE_CUDA_EP_INTERNAL_TESTS=ON` in build command.

To integrate to rel-1.18.1: Either bring in other changes (like onnx
1.16.1), or generate manifest and upload a new ONNX Runtime Build Time
Deps artifact based on rel-1.18.1.

### Motivation and Context
https://github.com/microsoft/onnxruntime/issues/19891
https://github.com/microsoft/onnxruntime/issues/20924
https://github.com/microsoft/onnxruntime/issues/20953
2024-06-11 13:32:15 -07:00
..
android Remove deprecated "mobile" packages (#20941) 2024-06-07 16:20:32 -05:00
apple Remove deprecated "mobile" packages (#20941) 2024-06-07 16:20:32 -05:00
azure-pipelines [CUDA] upgrade cutlass to 3.5.0 (#20940) 2024-06-11 13:32:15 -07:00
js Remove deprecated "mobile" packages (#20941) 2024-06-07 16:20:32 -05:00
linux Updating cudnn from 8 to 9 on exsiting cuda 12 docker image (#20925) 2024-06-11 09:37:16 -07:00
pai fix rocm ci pipeline (#19525) 2024-02-15 00:02:08 -08:00
windows Fix Nuget Cuda pipeline package pipeline (#20741) 2024-05-24 09:15:57 -07:00
Doxyfile_csharp.cfg