onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-06-06 00:03:22 +00:00

History

cao lei 2c525a79b1 Add new API KernelContext_GetScratchBuffer (#19809 ) ### Description <!-- Describe your changes. --> add new API KernelContext_GetScratchBuffer to get scratch buffer from kernel context ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. --> add new API KernelContext_GetScratchBuffer to get scratch buffer from kernel context which will be used in ORT extension project for GroupQueryAttention custom op		2024-03-13 19:41:15 -07:00
..
contrib_ops	Update replacing MultiHeadAttention with GroupQueryAttention (#19882 )	2024-03-13 14:10:52 -07:00
core	Add new API KernelContext_GetScratchBuffer (#19809 )	2024-03-13 19:41:15 -07:00
python	Update replacing MultiHeadAttention with GroupQueryAttention (#19882 )	2024-03-13 14:10:52 -07:00
test	Set seed for DynamicQuantizeMatMul tests (#19896 )	2024-03-13 17:49:55 -07:00
tool/etw
wasm	[js/webgpu] Support capture and replay for jsep (#18989 )	2024-01-30 18:28:03 -08:00
__init__.py	[ORT 1.17.0 release] Bump up version to 1.18.0 (#19170 )	2024-01-17 11:18:32 -08:00
ReformatSource.ps1
ReformatSourcePython.bat
VSCodeCoverage.runsettings