onnxruntime/tools
Changming Sun b7ef81a034
Move Linux GPU CI pipeline to A10 (#23235)
Move Linux GPU CI pipeline to A10 machines which are more advanced.
Retire onnxruntime-Linux-GPU-T4 machine pool.
Disable run_lean_attention test because the new machines do not have
enough shared memory.

```
skip loading trt attention kernel fmha_mhca_fp16_128_256_sm86_kernel because no enough shared memory
[E:onnxruntime:, sequential_executor.cc:505 ExecuteKernel] Non-zero status code returned while running MultiHeadAttention node. Name:'MultiHeadAttention_0' Status Message: CUDA error cudaErrorInvalidValue:invalid argument
```
2025-01-04 19:11:37 -08:00
..
android_custom_build Update all JDK version to 17 (#22786) 2024-11-12 11:42:18 -08:00
ci_build Move Linux GPU CI pipeline to A10 (#23235) 2025-01-04 19:11:37 -08:00
doc Update ruff and clang-format versions (#21479) 2024-07-24 11:50:11 -07:00
nuget OVEP 1.21.0 Development Updates (#23080) 2024-12-11 22:26:32 -08:00
perf_view
python Refactor emulator start and stop functions for clarity and efficiency (#22861) 2024-12-02 09:29:17 -08:00
scripts Cleanup code (#22827) 2024-11-19 14:13:33 -08:00