onnxruntime/java
Edward Chen 454f77cd94
Update kernel matching logic: decouple from op schemas and remove kernel def hashes (#12791)
# Motivation
Currently, ORT minimal builds use kernel def hashes to map from nodes to
kernels to execute when loading the model. As the kernel def hashes must
be known ahead of time, this works for statically registered kernels.
This works well for the CPU EP.
For this approach to work, the kernel def hashes must also be known at
ORT format model conversion time, which means the EP with statically
registered kernels must also be enabled then. This is not an issue for
the always-available CPU EP. However, we do not want to require that any
EP which statically registers kernels is always available too.
Consequently, we explore another approach to match nodes to kernels that
does not rely on kernel def hashes. An added benefit of this is the
possibility of moving away from kernel def hashes completely, which
would eliminate the maintenance burden of keeping the hashes stable.

# Approach
In a full build, ORT uses some information from the ONNX op schema to
match a node to a kernel. We want to avoid including the ONNX op schema
in a minimal build to reduce binary size. Essentially, we take the
necessary information from the ONNX op schema and make it available in a
minimal build.
We decouple the ONNX op schema from the kernel matching logic. The
kernel matching logic instead relies on per-op information which can
either be obtained from the ONNX op schema or another source.
This per-op information must be available in a minimal build when there
are no ONNX op schemas. We put it in the ORT format model.
Existing uses of kernel def hashes to look up kernels are replaced
with the updated kernel matching logic. We no longer store
kernel def hashes in the ORT format model’s session state and runtime
optimization representations. We no longer keep the logic to
generate and ensure stability of kernel def hashes.
2022-09-20 14:24:59 -07:00
..
src Update kernel matching logic: decouple from op schemas and remove kernel def hashes (#12791) 2022-09-20 14:24:59 -07:00
testdata
build-android.gradle Update protobuf-java to 3.20.1 (#10420) 2022-05-11 07:52:12 -07:00
build.gradle Update protobuf-java to 3.20.1 (#10420) 2022-05-11 07:52:12 -07:00
README.md
settings-android.gradle
settings.gradle

ONNX Runtime Java API

This directory contains the Java language binding for the ONNX runtime. Java Native Interface (JNI) is used to allow for seamless calls to ONNX runtime from Java.

Usage

This document pertains to developing, building, running, and testing the API itself in your local environment. For general purpose usage of the publicly distributed API, please see the general Java API documentation.

Building

Use the main project's build instructions with the --build_java option.

Requirements

JDK version 8 or later is required. The Gradle build system is required and used here to manage the Java project's dependency management, compilation, testing, and assembly. You may use your system Gradle installation installed on your PATH. Version 6 or newer is recommended. Optionally, you may use your own Gradle wrapper which will be locked to a version specified in the build.gradle configuration. This can be done once by using system Gradle installation to invoke the wrapper task in the java project's directory: cd REPO_ROOT/java && gradle wrapper Any installed wrapper is gitignored.

Build Output

The build will generate output in $REPO_ROOT/build/$OS/$CONFIGURATION/java/build:

  • docs/javadoc/ - HTML javadoc
  • reports/ - detailed test results and other reports
  • libs/onnxruntime-VERSION.jar - JAR with compiled classes, platform-specific JNI shared library, and platform-specific onnxruntime shared library.

Build System Overview

The main CMake build system delegates building and testing to Gradle. This allows the CMake system to ensure all of the C/C++ compilation is achieved prior to the Java build. The Java build depends on C/C++ onnxruntime shared library and a C JNI shared library (source located in the src/main/native directory). The JNI shared library is the glue that allows for Java to call functions in onnxruntime shared library. Given the fact that CMake injects native dependencies during CMake builds, some gradle tasks (primarily, build, test, and check) may fail.

When running the build script, CMake will compile the onnxruntime target and the JNI glue onnxruntime4j_jni target and expose the resulting libraries in a place where Gradle can ingest them. Upon successful compilation of those targets, a special Gradle task to build will be executed. The results will be placed in the output directory stated above.

Advanced Loading

The default behavior is to load the shared libraries using classpath resources. If your use case requires custom loading of the shared libraries, please consult the javadoc in the package-info.java or OnnxRuntime.java files.

Development

Code Formatting

Spotless is used to keep the code properly formatted. Gradle's spotlessCheck task will show any misformatted code. Gradle's spotlessApply task will try to fix the formatting. Misformatted code will raise failures when checks are ran during test run.

JNI Headers

When adding or updating native methods in the Java files, it may be necessary to examine the relevant JNI headers in build/headers/ai_onnxruntime*.h. These files can be manually generated using Gradle's compileJava task which will compile the Java and update the header files accordingly. Then the corresponding C files in ./src/main/native/ai_onnxruntime*.c may be updated and the build can be ran.

Dependencies

The Java API does not have any runtime or compile dependencies currently.