mirror of
https://github.com/saymrwulf/onnxruntime.git
synced 2026-05-21 21:52:11 +00:00
### Description Upgrade cutlass to 3.5 to fix build errors using CUDA 12.4 or 12.5 in Windows - [x] Upgrade cutlass to 3.5.0. - [x] Fix flash attention build error with latest cutlass header files and APIs. This fix is provided by @wangyems. - [x] Update efficient attention to use new cutlass fmha interface. - [x] Patch cutlass to fix `hrsqrt` not found error for sm < 53. - [x] Disable TF32 Staged Accumulation to fix blkq4_fp16_gemm_sm80_test build error for cuda 11.8 to 12.3. - [x] Disable TRT 10 deprecate warnings. The following are not included in this PR: * TRT provider replaces the deprecated APIs. * Fix blkq4_fp16_gemm_sm80_test build error for cuda 12.4 or 12.5. This test is not built by default unless you add `--cmake_extra_defines onnxruntime_ENABLE_CUDA_EP_INTERNAL_TESTS=ON` in build command. To integrate to rel-1.18.1: Either bring in other changes (like onnx 1.16.1), or generate manifest and upload a new ONNX Runtime Build Time Deps artifact based on rel-1.18.1. ### Motivation and Context https://github.com/microsoft/onnxruntime/issues/19891 https://github.com/microsoft/onnxruntime/issues/20924 https://github.com/microsoft/onnxruntime/issues/20953 |
||
|---|---|---|
| .. | ||
| generated | ||
| cgmanifest.json | ||
| generate_cgmanifest.py | ||
| print_submodule_info.py | ||
| README.md | ||
CGManifest Files
This directory contains CGManifest (cgmanifest.json) files. See here for details.
cgmanifests/generated/cgmanifest.json
This file contains generated CGManifest entries.
It covers these dependencies:
- git submodules
- dependencies from the Dockerfile
tools/ci_build/github/linux/docker/Dockerfile.manylinux2014_cuda11 - the entries in ../cmake/deps.txt
If any of these dependencies change, this file should be updated. When updating, please regenerate instead of editing manually.
How to Generate
- Change to the repository root directory.
- Ensure the git submodules are checked out and up to date. For example, with:
$ git submodule update --init --recursive - Run the generator script:
$ python cgmanifests/generate_cgmanifest.py --username <xxx> --token <your_access_token>
Please supply your github username and access token to the script. If you don't have a token, you can generate one at https://github.com/settings/tokens. This is for authenticating with Github REST API so that you would not hit the rate limit.
cgmanifests/cgmanifest.json
This file contains non-generated CGManifest entries. Please edit directly as needed.