onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-05-17 21:10:43 +00:00

Author	SHA1	Message	Date
Adam Pocock	aed43f429a	[java] Enable output pinning in OrtSession and OrtTrainingSession (#16835 )	2023-09-26 01:49:13 -07:00
Adam Pocock	03c3e91b0d	[java] Relaxing CoreML test (#16777 ) ### Description Reduces precision on the CoreML provider test as it returns slightly different answers than the other tested providers. Checked on a 2020 13" M1 MBP. ### Motivation and Context Fixes Java CoreML test failure after #16763.	2023-08-09 11:43:05 -07:00
Adam Pocock	a1bb670536	[java] Fp16 fix for android/react native (#16832 ) ### Description This PR splits out the FP16 conversions into a separate package we can override in the android build with a version which works on old versions of Android. I'm not sure the android build system changes are correct as I haven't got an android build environment configured on my workstation. @YUNQIUGUO if the CI build fails we should follow up offline to get my environment configured so I can iterate on it. ### Motivation and Context Fixes the CI failure after #16703.	2023-07-25 12:31:32 -07:00
Adam Pocock	a8e776b78b	[java] Adds support for fp16 and bf16 tensors (#16703 ) ### Description The Java API currently only supports fp16 output tensors which it automatically casts to floats on the way out. This PR adds support for creating fp16 and bf16 tensors (from `java.nio.Buffer` objects or as the output of models, creation from Java short arrays is not supported), along with efficient methods for casting `FloatBuffer` into `ShortBuffer` filled with fp16 or bf16 values and vice versa. The fp16 conversions use a trick to pull in the efficient conversion methods added to Java 20, falling back to ports of the MLAS methods otherwise. The Java 20 methods can be special cased by the C2 JIT compiler to emit the single instruction on x86 and ARM which converts fp32<->fp16, or the vectorized versions thereof, so they should be quite a bit faster than the MLAS ported one. ### Motivation and Context fp16 and bf16 are increasingly popular formats and we've had several requests for this functionality. Fixes #7003. cc @yuslepukhin @cassiebreviu --------- Co-authored-by: Scott McKay <Scott.McKay@microsoft.com>	2023-07-21 21:14:41 +10:00
Adam Pocock	ba91457183	[java] Adding addExternalInitializers and addInitializer to OrtSession.SessionOptions (#16198 ) ### Description Adds support for adding external initializers or overriding initializers to a session options from Java. ### Motivation and Context We want to instantiate large models from Java without filesystem access. cc @yuslepukhin	2023-07-05 12:51:59 -07:00
Baiju Meswani	10ba1e270c	Minimal Build for On-Device Training (#16326 ) 🛠️ __Changes in this pull request:__ This pull request introduces two significant changes to the project: - Changing on device training checkpoint format: The current implementation stores the on device training checkpoint as a sequence of tensors in multiple files inside a checkpoint folder, which can be inefficient in terms of storage and performance. In this PR, I have modified the checkpoint format to utilize the flatbuffer table to save the checkpoint to a single file, providing a more compact and efficient representation. The changes around this are twofold: - Add the checkpoint flatbuffer schema that will generate the necessary checkpoint source files. - Update the checkpoint saving and loading functionality to use the new format. - Adding support for onnxruntime minimal build: To support scenarios where binary size is a constraint, I made changes to ensure that the training build can work well with the minimal build. 🔍 __Open Issues:__ - In order to extract the optimizer type, the existing implementation re-loaded the onnx optimizer model and parsed it. This is no longer possible, since the model format can either be onnx or ort. One idea is to do the same for ort format optimizer model. This needs some investigation. - Changes to the offline tooling to generate ort format training artifacts. - End-to-end training example showcasing the use of the minimal training build. - Add support for export model for inferencing in a minimal build.	2023-06-22 12:27:23 -07:00
Adam Pocock	bca49d62a0	Fixing CoreML in Java (#16231 ) ### Description The name of the flag we set when compiling the JNI binding to enable the CoreML EP changed at some point in the past. This PR fixes it by updating the flag in the JNI. I also added a quick smoke test for the CoreML provider to make sure it doesn't crash and can be enabled. ### Motivation and Context All the EPs should work as expected in Java. Fixes #16230.	2023-06-07 12:24:57 -07:00
Adam Pocock	3c2a11f2f1	[java] Allow the creation of boolean tensors from ByteBuffer (#15556 ) ### Description The tensor creation code now allows the creation of boolean tensors from non-direct `ByteBuffer` instances. It previously only allowed them from arrays and direct `ByteBuffer` instances and this fixes that inconsistency. The boolean tensor test has been updated to cover all three cases. ### Motivation and Context Fixes #15509.	2023-06-05 09:58:50 -07:00
Adam Pocock	8a1a40ac63	[Java] CheckpointState AddProperty & GetProperty support (#15730 )	2023-04-28 09:52:52 -07:00
Ashwini Khade	ccb2243ee7	Update build option for training in java to enable_training_api (#15638 ) ### Description Updating the build option for enabling training in java builds from ENABLE_TRAINING -> ENABLE_TRAINING_APIS. In the native codebase ENABLE_TRAINING is used for enabling full training and ENABLE_TRAINING_APIS is used for creating the lte builds with training apis. Making the change to sync the naming convention across all the language bindings. It was a bit confusing to see ENABLE_TRAINING when debugging the android build failures for training. Making this change just to improve readability of logs during debugging. ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. -->	2023-04-24 11:53:08 -07:00
Adam Pocock	ef11032c89	[java] Allows the creation and extraction of zero length tensors (#15116 ) ### Description Allows the creation of zero length tensors via the buffer path (the array path with zero length arrays still throws as the validation logic to check it's not ragged would require more intrusive revision), and allows the `tensor.getValue()` method to return a Java multidimensional array with a zero dimension. Also added a test for the creation and extraction behaviour. ### Motivation and Context The Python interface can return zero length tensors (e.g. if object detection doesn't find any objects), and before this PR in Java calling `tensor.getValue()` throws an exception with a confusing error message. Fixes #7270 & #15107.	2023-04-05 10:49:59 -07:00
Edward Chen	c46c7ccba5	Update Gradle version (#14862 ) - Update Gradle version used in most places from 6.8.3 to 8.0.1. Update Android Gradle Plugin version where applicable. Not updated in this change: React Native Android projects (under `js/react_native/`). That can be done later along with updating the React Native projects. - Add Gradle wrapper in `java/` to make it easier to consistently use a specific Gradle version.	2023-03-08 12:22:06 -08:00
Adam Pocock	47f00b5d49	[Java] Initial on device training support (#14027 ) contributor: @Craigacp	2023-03-08 10:01:08 -08:00
Adam Pocock	150043f74f	Adds a Java accessor for GetVersionString (#14876 ) ### Description Java part of #14873.	2023-03-07 09:46:56 -08:00
Scott McKay	114f18357a	Add Java and Objective-C bindings for RegisterCustomOpsUsingFunction. (#14256 ) Description Add bindings for Android and iOS. Motivation and Context Enable mobile app linking against ort-extensions library and registering the custom ops with ORT.	2023-01-13 09:04:26 -08:00
Adam Pocock	dd2c031d95	[java] Sparse tensor support (#10653 ) Description: Adds support for creating and receiving sparse tensors in the ORT Java API. CSRC and COO tensors as inputs are tested, but there is no op which accepts a block sparse tensor to test. COO tensors are tested as outputs, but there is no op which emits a CSRC or block sparse tensor to test. Motivation and Context - Why is this change required? What problem does it solve? Request to expose ORT sparse tensor support in Java. cc @yuslepukhin	2022-11-22 10:29:24 -08:00
Adam Pocock	388d3cf847	[Java] Fix OnnxSequence semantics (#13012 ) Previously OnnxSequence would flatten out a list of tensors into a single output array assuming they were all scalar values. This doesn't accurately represent the semantics of an ONNX sequence, but was what the semantics appeared to be years ago when I first wrote that class. This PR changes it so that the `getValue` method on `OnnxSequence` unwraps the sequence and returns `List<? extends OnnxValue>` allowing the user to process the individual ONNX values separately. It's done this way rather than returning a multidimensional array for a tensor and a Java map for a map as multidimensional arrays are very inefficient in Java and best practice when operating with a OnnxTensor in Java is to use a `java.nio.ByteBuffer`. So allowing users to access each `OnnxTensor`s individually allows them to control how the data is materialised on the Java heap.	2022-09-28 15:53:30 -07:00
RandySheriffH	a83a9ed6b0	Remove miscellaneous nuphar configs (#13070 ) Remove a handful of nuphar related configurations after deprecation. Co-authored-by: Randy Shuai <rashuai@microsoft.com>	2022-09-26 13:41:28 -07:00
Edward Chen	454f77cd94	Update kernel matching logic: decouple from op schemas and remove kernel def hashes (#12791 ) # Motivation Currently, ORT minimal builds use kernel def hashes to map from nodes to kernels to execute when loading the model. As the kernel def hashes must be known ahead of time, this works for statically registered kernels. This works well for the CPU EP. For this approach to work, the kernel def hashes must also be known at ORT format model conversion time, which means the EP with statically registered kernels must also be enabled then. This is not an issue for the always-available CPU EP. However, we do not want to require that any EP which statically registers kernels is always available too. Consequently, we explore another approach to match nodes to kernels that does not rely on kernel def hashes. An added benefit of this is the possibility of moving away from kernel def hashes completely, which would eliminate the maintenance burden of keeping the hashes stable. # Approach In a full build, ORT uses some information from the ONNX op schema to match a node to a kernel. We want to avoid including the ONNX op schema in a minimal build to reduce binary size. Essentially, we take the necessary information from the ONNX op schema and make it available in a minimal build. We decouple the ONNX op schema from the kernel matching logic. The kernel matching logic instead relies on per-op information which can either be obtained from the ONNX op schema or another source. This per-op information must be available in a minimal build when there are no ONNX op schemas. We put it in the ORT format model. Existing uses of kernel def hashes to look up kernels are replaced with the updated kernel matching logic. We no longer store kernel def hashes in the ORT format model’s session state and runtime optimization representations. We no longer keep the logic to generate and ensure stability of kernel def hashes.	2022-09-20 14:24:59 -07:00
Cheng	76d17b0f48	Add java API for xnnpack (#12788 ) * Add java API for xnnpack * provider option support * a more general interface for creating EP	2022-09-03 08:29:40 +08:00
Yulong Wang	1a402a3f25	replace 'master' branch ref to 'main' for onnx repo (#12678 )	2022-08-30 13:41:42 -07:00
Yulong Wang	c144acc534	Replace 'master' branch ref to 'main' in the code (#12547 )	2022-08-22 10:48:12 -07:00
Mina Asham	6cd1931a93	Specify list/map capacity when initializing where possible (#11110 ) * Specify list/map capacity when initializing where possible - This really depends on the use case, but in some cases the array/map resizing can be slightly costly, there is effectively no downside setting the initial capacity for a collection if we know for sure its final size * Supply list/map capacity when initializing where possible - This really depends on the use case, but in some cases the array/map resizing can be slightly costly, there is effectively no downside setting the initial capacity for a collection if we know for sure its final size - Introduce an extra utility to help creating maps with expected capacity * Move utility function to OrtUtil and drop MapUtil, also add Java doc to method * Move test to the right class	2022-04-27 20:59:18 -07:00
Adam Pocock	9616ad483f	[Java] Support configuring CUDA and TensorRT execution providers (#10697 ) Java side parts for configuring CUDA and TensorRT. Adding tests for CUDA and TensorRT. Refactoring library loading logic as provider options need to have their shared library loaded before they can be constructed.	2022-03-30 14:26:51 -07:00
Adam Pocock	4ef81b142d	Making the Java tests faster by optionally disabling ones which require running multiple JVMs. (#10811 )	2022-03-08 22:19:37 -08:00
Adam Pocock	f856608599	[java] Changes OrtEnvironment so it can't be closed by users (#10670 ) * Changes OrtEnvironment so it can't be closed by users. * Fix the formatting and add a same instance check.	2022-02-28 21:03:40 -08:00
Adam Pocock	e47434ea12	[java] Adding the graph description to the exposed model metadata. (#10318 )	2022-02-28 10:05:03 -08:00
Ye Wang	2803a9465d	Add example of registering custom cuda op as shared lib (#10025 )	2022-01-05 09:22:15 -08:00
Guoyu Wang	9cedb12c89	Add App Center e2e test to Android package (#9653 ) * Add app center test for Android pipeline * update android test path * Add logging of system abi * fix java spotless error	2021-11-03 22:56:46 -07:00
Jeff Daily	c8789d3047	[ROCm] static re-hipify of CUDA EP to ROCm EP, now a shared provider (#8877 ) * re-hipify all rocm EP sources * fix all other files affected by re-hipify * add cuda_provider_factory.h to amd_hipify.py * do not use cudnn_conv_algo_search in ROCm EP, missing reduce min registration * Fix ReduceConsts template specialization introduced in #9101. Fixes the error when building for ROCm 4.3.1: error: too many template headers for onnxruntime::rocm::ReduceConsts<__half>::One (should be 0) * fix flake8 error in amd_hipify.py * speed up hipify with concurrent.futures * flake8 fix in amd_hipify.py	2021-10-14 15:15:51 -07:00
Guoyu Wang	ddafe50199	Fix Android build break after Virtual Environment update to 20210919 (#9163 )	2021-09-23 10:07:18 -07:00
Guoyu Wang	8404a2d011	Add NNAPI E2E test for Android java package (#8912 ) * Add NNAPI E2E test for Android java package * address cr comment	2021-08-31 17:34:33 -07:00
Guoyu Wang	6a1939252f	Fix Android java API failure (#8865 ) * Fix Android Package break * Without java fix -- pipeline should fail * With java fix, should pass now * address CR comments	2021-08-27 15:58:56 -07:00
Frank Liu	002e427c5b	Add UINT8 datatype support to Java (#8401 ) Add UINT8 datatype support Add inference test for UINT8 model	2021-07-22 17:11:49 -07:00
Adam Pocock	9a6fa057c8	[Java] Allow extraction of multidimensional String tensors (#8452 ) Fixing a bug where String tensors would always be single dimensional in Java.	2021-07-22 13:19:49 -07:00
Adam Pocock	55b26b6951	[Java] Adds support for DNNL, OpenVINO, TensorRT shared providers and refactors the CUDA shared provider loader (#8013 )	2021-07-20 22:33:15 -07:00
Adam Pocock	7ed9f5fc90	[Java] Fixing the creation of OnnxTensors from scalars, adding tests (#8023 ) * Fixing the creation of OnnxTensors from scalars, adding tests. * Documentation fixes from the review.	2021-06-24 13:21:35 -07:00
Guoyu Wang	a47a234b7e	Add minsdkver for AAR and AndroidTest (#7669 )	2021-05-12 16:01:25 -07:00
Rachel Guo	d8cf960412	Add android test app to validate Java API for ORT-Mobile Android (#7477 ) * test * [gwang] make cmake compile work * [gwang] enble build apks * some build update * add simple sigmoid test android project and cmake * add build.py * refine and remove unused import lib * address CR comments * remove unnecessary files * add README.md * minor update * remove * minor change * fix ci failure and minor update * fix typo in project folder * remove * remove and minor update * refine * minor fix * fix * fix typo * add gradle spotlessApply task to fix CI failure * fix * enable spotlessApply in build gradle * revert some changes * minor fix * run spotless apply for format * address CR comments and fix CI version and format * refine * Refine * address comments * refine * refine * modify * reformat * resolve version conflicts * minor update * minor update * address comments * minor update Co-authored-by: Guoyu Wang <wanggy@outlook.com>	2021-05-04 15:39:14 -07:00
Adam Pocock	0100f336d7	[java] Adds support for OrtEnvironment thread pools (#6406 ) * Updates for Gradle 7. * Adding support for OrtThreadingOptions into the Java API. * Fixing a typo in the JNI code. * Adding a test for the environment's thread pool. * Fix cuda test, add comment to failure. * Updating build.gradle	2021-01-27 13:25:22 -08:00
Adam Pocock	fddbd8935c	Adding Java support for getAvailableProviders and other small methods (#5366 ) * Adding Java support for getAvailableProviders, addFreeDimensionOverrideByName, disablePerSessionThreads and getProfilingStartTimeNs. * Fixing copyright years, running spotless and adding javadoc and an accessor to OrtProvider. * Renaming OrtSession.getProfilingStartTimeInNs. * Removing ngraph as it's been deprecated.	2020-11-24 21:42:57 -08:00
Adam Pocock	d26c71f55c	[java] Fixing the buffer semantics. (#5223 ) * [java] Fixing the buffer semantics. * Renaming bufferCapacity to bufferRemaining. * Adding a cast to char* so the pointer arithmetic works on Windows.	2020-09-22 21:29:01 -07:00
Guoyu Wang	e30530d9ea	Add java API for AddSessionConfigEntry (#5241 ) * Add session option config entry API for java * Java format * Add extra test verification * Address PR comments * Update comments Co-authored-by: gwang0000 <62914304+gwang0000@users.noreply.github.com>	2020-09-22 14:51:39 -07:00
Changming Sun	2ab3a19728	Enlarge the read buffer size in C#/Java test code (#4150 ) 1. Enlarge the read buffer size further, so that our code can run even faster. TODO: need apply the similar changes to python some other language bindings. 2. Add coreml_VGG16_ImageNet to the test exclusion set of x86_32. It is not a new model but previously we didn't run the test against x86_32.	2020-06-08 16:13:11 -07:00
Dmitri Smirnov	afca0d15ee	Create Java publishing pipeline (#3944 ) Create CPU and GPu Java publishing pipelines. Final jars are tested on all platforms. However, signing and publishing to maven are manual steps.	2020-06-01 18:18:57 -07:00
Adam Pocock	9d2d1eb6f6	[java] Adds a CUDA test (#3956 ) * [java] - adding a cuda enabled test. * Adding --build_java to the windows gpu ci pipeline. * Removing a stray line from the unit tests that always enabled CUDA for Java.	2020-05-18 12:05:51 -07:00
Adam Pocock	d38b79c6e5	[Java] Adding missing methods on Session, SessionOptions and RunOptions (v2) (#3832 ) * java - adding support for custom op libraries. * Adding support for RunOptions and additional methods for SessionOptions and OrtSession. As a result OrtEnvironment.LoggingLevel moved to be a top level enum called OrtLoggingLevel. * java - adding unit tests for RunOptions and SessionOptions. * java - removing unused releaseNamesHandle method * java - add test for custom op library. * java - adding log verbosity methods, and tests for the same. * java - fixes for custom op loading test on Windows. * Cleanup after rebase on master.	2020-05-06 01:19:46 -07:00
Dmitri Smirnov	5db30a470e	[Java] Tidying up the sample MNIST code (#3824 ) * Updating the Java sample to load MNIST in libsvm format. * java - code formatting fix. Co-authored-by: Adam Pocock <adam.pocock@oracle.com>	2020-05-05 14:34:13 -07:00
Adam Pocock	3dd3f84116	[Java] Adding model metadata support (#3573 ) * java - adding deployment information to build.gradle. * java - adding support for model metadata.	2020-04-21 02:28:15 -07:00
Adam Pocock	c91527235a	[Java] Add support for map and sequence information on output nodes (#3468 )	2020-04-16 02:29:23 -07:00

1 2

57 commits