onnxruntime/tools/ci_build
Tianlei Wu 94b1791974
Upgrade CUTLASS to v2.11 and add sequence length threshold for cutlass FMHA (#14401)
### Description
Add sequence length threshold for triggering cutlass FMHA in FP32. See
performance test results in
https://github.com/microsoft/onnxruntime/pull/14343 to see how this
threshold is selected.

Upgrade cutlass to v2.11 and update deps.txt and cgmanifest for nuget
pipeline build (test build:
https://aiinfra.visualstudio.com/Lotus/_build/results?buildId=268574&view=results)
2023-01-25 09:43:48 -08:00
..
github Upgrade CUTLASS to v2.11 and add sequence length threshold for cutlass FMHA (#14401) 2023-01-25 09:43:48 -08:00
__init__.py
amd_hipify.py ROCm header path updates (#14170) 2023-01-16 10:28:13 +08:00
build.py Unit test modification for TensorRT EP (#14339) 2023-01-18 21:30:19 -08:00
clean_docker_image_cache.py
coverage.py
gen_def.py Add CANN EP (#12416) 2022-09-22 14:53:40 -07:00
get_docker_image.py Improve dependency management (#13523) 2022-12-01 09:51:59 -08:00
logger.py
op_registration_utils.py Update kernel matching logic: decouple from op schemas and remove kernel def hashes (#12791) 2022-09-20 14:24:59 -07:00
op_registration_validator.py Update CUDA ArgMin/ArgMax op kernels to have end version 11 since opset 12+ is not supported yet. (#13983) 2022-12-21 19:01:00 -05:00
policheck_exclusions.xml Exculde hipify option from policheck (#13431) 2022-10-25 16:35:16 +08:00
reduce_op_kernels.py Update kernel matching logic: decouple from op schemas and remove kernel def hashes (#12791) 2022-09-20 14:24:59 -07:00
replace_urls_in_deps.py Move C/C++ deps' URLs to deps.txt (#13769) 2022-11-29 18:06:35 -08:00
requirements.txt fix reshape fusion error in numpy 1.24 (#14231) 2023-01-11 10:37:41 -08:00
update_tsaoptions.py Add license header to some files. (#13074) 2022-09-23 18:46:02 -07:00
upload_python_package_to_azure_storage.py