onnxruntime/onnxruntime
cao lei 2c525a79b1
Add new API KernelContext_GetScratchBuffer (#19809)
### Description
<!-- Describe your changes. -->
add new API KernelContext_GetScratchBuffer to get scratch buffer from
kernel context


### Motivation and Context
<!-- - Why is this change required? What problem does it solve?
- If it fixes an open issue, please link to the issue here. -->
add new API KernelContext_GetScratchBuffer to get scratch buffer from
kernel context which will be used in ORT extension project for
GroupQueryAttention custom op
2024-03-13 19:41:15 -07:00
..
contrib_ops Update replacing MultiHeadAttention with GroupQueryAttention (#19882) 2024-03-13 14:10:52 -07:00
core Add new API KernelContext_GetScratchBuffer (#19809) 2024-03-13 19:41:15 -07:00
python Update replacing MultiHeadAttention with GroupQueryAttention (#19882) 2024-03-13 14:10:52 -07:00
test Set seed for DynamicQuantizeMatMul tests (#19896) 2024-03-13 17:49:55 -07:00
tool/etw
wasm [js/webgpu] Support capture and replay for jsep (#18989) 2024-01-30 18:28:03 -08:00
__init__.py [ORT 1.17.0 release] Bump up version to 1.18.0 (#19170) 2024-01-17 11:18:32 -08:00
ReformatSource.ps1
ReformatSourcePython.bat
VSCodeCoverage.runsettings