onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-06-09 00:30:53 +00:00

History

Changming Sun b7ef81a034 Move Linux GPU CI pipeline to A10 (#23235 ) Move Linux GPU CI pipeline to A10 machines which are more advanced. Retire onnxruntime-Linux-GPU-T4 machine pool. Disable run_lean_attention test because the new machines do not have enough shared memory. ``` skip loading trt attention kernel fmha_mhca_fp16_128_256_sm86_kernel because no enough shared memory [E:onnxruntime:, sequential_executor.cc:505 ExecuteKernel] Non-zero status code returned while running MultiHeadAttention node. Name:'MultiHeadAttention_0' Status Message: CUDA error cudaErrorInvalidValue:invalid argument ```		2025-01-04 19:11:37 -08:00
..
contrib_ops	[CUDA] Make cubins const (#23225 )	2024-12-31 16:20:21 -08:00
core	[webgpu] Add kernel type to profile info (#23167 )	2025-01-03 14:28:48 -08:00
lora	Accomodate BE platforms. Make sure we always write flatbuffers LE (#22375 )	2024-10-11 09:14:44 -07:00
python	Integrate onnx 1.17.0 (#21897 )	2024-12-24 09:02:02 -08:00
test	Move Linux GPU CI pipeline to A10 (#23235 )	2025-01-04 19:11:37 -08:00
tool/etw
wasm	[WebNN] Fixed WebNN Module undefined issue (#22795 )	2024-11-11 21:31:24 -08:00
__init__.py	bumps up version in main from 1.20 -> 1.21 (#22482 )	2024-10-17 12:32:35 -07:00
ReformatSource.ps1
ReformatSourcePython.bat
VSCodeCoverage.runsettings