onnxruntime/onnxruntime
Hariharan Seshadri 42399dfd2b
Fix a potential race in the CUDA TopK kernel (#19917)
### Description
If the `K` value is flowing through as a tensor, we are updating a
mutable member of the `TopK` class and basing the compute off that -
which is likely to cause data race issues with concurrent Run() calls
and `K` value changes.


### Motivation and Context
Fix potential race in CUDA TopK kernel
2024-03-14 18:13:47 -07:00
..
contrib_ops Update replacing MultiHeadAttention with GroupQueryAttention (#19882) 2024-03-13 14:10:52 -07:00
core Fix a potential race in the CUDA TopK kernel (#19917) 2024-03-14 18:13:47 -07:00
python [Cuda] Demo multiple cuda graphs and user compute stream (#19883) 2024-03-14 13:48:37 -07:00
test [MLAS][AArch64] SQ4BitGemm CompInt8 multi-block implementation (#19826) 2024-03-14 13:05:42 -07:00
tool/etw
wasm [js/webgpu] Support capture and replay for jsep (#18989) 2024-01-30 18:28:03 -08:00
__init__.py [ORT 1.17.0 release] Bump up version to 1.18.0 (#19170) 2024-01-17 11:18:32 -08:00
ReformatSource.ps1
ReformatSourcePython.bat
VSCodeCoverage.runsettings