mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-05-14 20:48:00 +00:00

Rephrasing

2018-11-26 14:57:54 -08:00

1.8 KiB

Raw Blame History

C API

Q: Why have a C API?

Q: Why not just live in a C++ world? Why C?
A: We want to distribute the onnxruntime as a DLL, which can be used in .Net languages through P/Invoke. This is the only option we have.

Q: Is it only for .Net?
A: No. It is designed for:

Creating language bindings for the onnxruntime. e.g. C#, python, java, ...
Dynamic linking has some benefits. For example, solving diamond dependency problems.

Q: Can I export C++ types and functions across DLL or "Shared Object" Library(.so) boundaries?
A: Well, you can, but it's not a good practice. We won't do it in this project.

What's inside

Creating an InferenceSession from an on-disk model file and a set of SessionOptions.
Registering customized loggers.
Registering customized allocators.
Registering predefined providers and set the priority order. ONNXRuntime has a set of predefined execution providers,like CUDA, MKLDNN. User can register providers to their InferenceSession. The order of registration indicates the preference order as well.
Running a model with inputs. These inputs must be in CPU memory, not GPU. If the model has multiple outputs, user can specify which outputs they want.
Converting an in-memory ONNX Tensor encoded in protobuf format, to a pointer that can be used as model input.
Setting the thread pool size for each session.
Dynamically loading custom ops.

How to use it

Include onnxruntime_c_api.h.
Call ONNXRuntimeInitialize
Create Session: ONNXRuntimeCreateInferenceSession(env, model_uri, nullptr,...)
Create Tensor
1. ONNXRuntimeCreateAllocatorInfo
2. ONNXRuntimeCreateTensorWithDataAsONNXValue
ONNXRuntimeRunInference

1.8 KiB Raw Blame History

C API

Q: Why have a C API?

What's inside

How to use it

1.8 KiB

Raw Blame History