onnxruntime/tools
Tianlei Wu 94b1791974
Upgrade CUTLASS to v2.11 and add sequence length threshold for cutlass FMHA (#14401)
### Description
Add sequence length threshold for triggering cutlass FMHA in FP32. See
performance test results in
https://github.com/microsoft/onnxruntime/pull/14343 to see how this
threshold is selected.

Upgrade cutlass to v2.11 and update deps.txt and cgmanifest for nuget
pipeline build (test build:
https://aiinfra.visualstudio.com/Lotus/_build/results?buildId=268574&view=results)
2023-01-25 09:43:48 -08:00
..
android_custom_build Android package custom build script update (#14403) 2023-01-25 09:19:05 -08:00
ci_build Upgrade CUTLASS to v2.11 and add sequence length threshold for cutlass FMHA (#14401) 2023-01-25 09:43:48 -08:00
doc
nuget Openvino ep 2022.3 v4.3 (#14210) 2023-01-11 16:31:26 -08:00
perf_view
python Free OrtStatus in ASSERT_ORT_STATUS_OK, make run_android_emulator.py work with newer JDK version (#14369) 2023-01-20 09:27:47 -08:00