onnxruntime/tools/ci_build
Yifan Li 5c3c7643db
Update range of gpu arch (#23309)
### Description
<!-- Describe your changes. -->
* Remove deprecated gpu arch to control nuget/python package size
(latest TRT supports sm75 Turing and newer arch)
* Add 90 to support blackwell series in next release (86;89 not
considered as adding them will rapidly increase package size)

| arch_range | Python-cuda12 | Nuget-cuda12 |
| -------------- |
------------------------------------------------------------ |
---------------------------------- |
| 60;61;70;75;80 | Linux: 279MB Win: 267MB | Linux: 247MB Win: 235MB |
| 75;80 | Linux: 174MB Win: 162MB | Linux: 168MB Win: 156MB |
| **75;80;90** | **Linux: 299MB Win: 277MB** | **Linux: 294MB Win:
271MB** |
| 75;80;86;89 | [Linux: MB Win:
390MB](https://aiinfra.visualstudio.com/Lotus/_build/results?buildId=647457&view=results)
| Linux: 416MB Win: 383MB |
| 75;80;86;89;90 | [Linux: MB Win:
505MB](https://aiinfra.visualstudio.com/Lotus/_build/results?buildId=646536&view=results)
| Linux: 541MB Win: 498MB |

### Motivation and Context
<!-- - Why is this change required? What problem does it solve?
- If it fixes an open issue, please link to the issue here. -->

Callout: While adding sm90 support, the build of cuda11.8+cudnn8 will be
dropped in the coming ORT release,
as the build has issue with blackwell (mentioned in comments) and demand
on cuda 11 is minor, according to internal ort-cuda11 repo.
2025-01-14 14:27:34 -08:00
..
github Update range of gpu arch (#23309) 2025-01-14 14:27:34 -08:00
requirements Update transformers test requirements (#22911) 2024-11-22 09:45:12 -08:00
__init__.py
amd_hipify.py fix issue when build with hipblasLt on rocm6.1 (#22553) 2024-10-28 13:57:08 +08:00
build.py Add VCPKG triplet files (#23298) 2025-01-09 16:18:51 -08:00
compile_triton.py
coverage.py
gen_def.py Initial WebGPU EP checkin (#22318) 2024-10-08 16:10:46 -07:00
get_docker_image.py Fix docker image layer caching to avoid redundant docker building and transient connection exceptions. (#21612) 2024-08-06 21:37:09 +08:00
logger.py
op_registration_utils.py
op_registration_validator.py
patch_manylinux.py
policheck_exclusions.xml
reduce_op_kernels.py Update ruff and clang-format versions (#21479) 2024-07-24 11:50:11 -07:00
replace_urls_in_deps.py Update ruff and clang-format versions (#21479) 2024-07-24 11:50:11 -07:00
set-trigger-rules.py Cleanup code (#22827) 2024-11-19 14:13:33 -08:00
update_tsaoptions.py
upload_python_package_to_azure_storage.py Update ruff and clang-format versions (#21479) 2024-07-24 11:50:11 -07:00