onnxruntime/onnxruntime/core
cao lei 966fa74597
Add 2 C API for ort extension (#19808)
### Description
<!-- Describe your changes. -->
Add 2 C API for ORT extension:
- KernelInfo_GetAllocator
- OrtCustomOp::GetMayInplace


### Motivation and Context
<!-- - Why is this change required? What problem does it solve?
- If it fixes an open issue, please link to the issue here. -->
Add 2 C API for ORT extension project, which will leverage these 2 APIs
for GroupQueryAttention custom op.
2024-03-14 06:00:41 -07:00
..
codegen Replace some ORT_ENFORCE with ORT_THROW_IF_ERROR (#18812) 2023-12-14 10:14:22 -08:00
common Define recomputable op list with domain/opset (#19722) 2024-03-07 09:12:12 +08:00
dll
dlpack
eager
flatbuffers Introduce a Nominal Checkpoint for On-Device Training (#19232) 2024-01-30 22:11:25 -08:00
framework Refine logging for execution plan print (#19777) 2024-03-14 16:31:32 +08:00
graph Add SpaceToDepth and DepthToSpace CUDA NHWC Ops (#19646) 2024-03-06 12:35:55 -08:00
language_interop_ops
mickey Adding cuda kernel (optimized for sm80) for block-wise 4b quantized float 16 GEMM. (#18619) 2024-03-05 09:37:45 -08:00
mlas [MLAS AArch64] SQNBitGemm optimization (#19272) 2024-01-30 14:29:12 -08:00
optimizer Add support for LRN NHWC OPs (#19866) 2024-03-13 17:52:07 -07:00
platform fix memory mapping on Windows (#19623) 2024-02-25 08:50:45 -08:00
providers Add new API KernelContext_GetScratchBuffer (#19809) 2024-03-13 19:41:15 -07:00
quantization
session Add 2 C API for ort extension (#19808) 2024-03-14 06:00:41 -07:00
util Adding cuda kernel (optimized for sm80) for block-wise 4b quantized float 16 GEMM. (#18619) 2024-03-05 09:37:45 -08:00