onnxruntime/onnxruntime
Changming Sun b7ef81a034
Move Linux GPU CI pipeline to A10 (#23235)
Move Linux GPU CI pipeline to A10 machines which are more advanced.
Retire onnxruntime-Linux-GPU-T4 machine pool.
Disable run_lean_attention test because the new machines do not have
enough shared memory.

```
skip loading trt attention kernel fmha_mhca_fp16_128_256_sm86_kernel because no enough shared memory
[E:onnxruntime:, sequential_executor.cc:505 ExecuteKernel] Non-zero status code returned while running MultiHeadAttention node. Name:'MultiHeadAttention_0' Status Message: CUDA error cudaErrorInvalidValue:invalid argument
```
2025-01-04 19:11:37 -08:00
..
contrib_ops [CUDA] Make cubins const (#23225) 2024-12-31 16:20:21 -08:00
core [webgpu] Add kernel type to profile info (#23167) 2025-01-03 14:28:48 -08:00
lora Accomodate BE platforms. Make sure we always write flatbuffers LE (#22375) 2024-10-11 09:14:44 -07:00
python Integrate onnx 1.17.0 (#21897) 2024-12-24 09:02:02 -08:00
test Move Linux GPU CI pipeline to A10 (#23235) 2025-01-04 19:11:37 -08:00
tool/etw
wasm [WebNN] Fixed WebNN Module undefined issue (#22795) 2024-11-11 21:31:24 -08:00
__init__.py bumps up version in main from 1.20 -> 1.21 (#22482) 2024-10-17 12:32:35 -07:00
ReformatSource.ps1
ReformatSourcePython.bat
VSCodeCoverage.runsettings