onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-07-02 03:55:34 +00:00

Author	SHA1	Message	Date
Edward Chen	c46c7ccba5	Update Gradle version (#14862 ) - Update Gradle version used in most places from 6.8.3 to 8.0.1. Update Android Gradle Plugin version where applicable. Not updated in this change: React Native Android projects (under `js/react_native/`). That can be done later along with updating the React Native projects. - Add Gradle wrapper in `java/` to make it easier to consistently use a specific Gradle version.	2023-03-08 12:22:06 -08:00
Changming Sun	d9436407b6	Use safe allocator for JNI code (#13999 ) ### Description Use a customized allocarray function to replace the original malloc calls to avoid integer overflow. ### Motivation and Context Fix Prefast warnings. Fixed [AB#8990](https://aiinfra.visualstudio.com/6a833879-cd9b-44a4-a9de-adc2d818f13c/_workitems/edit/8990) Fixed [AB#8991](https://aiinfra.visualstudio.com/6a833879-cd9b-44a4-a9de-adc2d818f13c/_workitems/edit/8991) Fixed [AB#9016](https://aiinfra.visualstudio.com/6a833879-cd9b-44a4-a9de-adc2d818f13c/_workitems/edit/9016)	2023-03-08 11:40:55 -08:00
Adam Pocock	47f00b5d49	[Java] Initial on device training support (#14027 ) contributor: @Craigacp	2023-03-08 10:01:08 -08:00
Adam Pocock	150043f74f	Adds a Java accessor for GetVersionString (#14876 ) ### Description Java part of #14873.	2023-03-07 09:46:56 -08:00
Erick Muñoz	d1533c27eb	[oneDNN] Improved thread handling (#13618 ) * Added the OrtDnnlProviderOptions structure to expose configuration options to the user * The number of threads can be defined by the user with the -i flag on the perftest * Number of threads can also be configured via the OMP_NUM_THREADS environment variable * The number of threads defined in the OrtDnnlProviderOptions is prioritized over the environment variable ### Description Avoids thread oversubscription caused by OpenMP allocating the maximum number of threads possible for oneDNN EP. Added support for the OrtDnnlProviderOptions, this will allow for more EP customization capabilities, and allows for user defined number of threads. ### Motivation and Context - Improves performances and allows for user to fine tune the number of threads	2023-01-31 14:37:13 -08:00
Wei-Sheng Chin	679ae7ff33	[Java] Fix warnings (#14076 ) Fix C6011, C6385, C6386 found by Visual Studio. Basically, I set the maximum number of options for every EP to 128. To my knowledge, 128 is big enough to support all EPs. For support arbitrary number of EP options, we probably need #13999 and create a "std::vector"-like struct in C language.	2023-01-30 09:22:28 -08:00
Scott McKay	114f18357a	Add Java and Objective-C bindings for RegisterCustomOpsUsingFunction. (#14256 ) Description Add bindings for Android and iOS. Motivation and Context Enable mobile app linking against ort-extensions library and registering the custom ops with ORT.	2023-01-13 09:04:26 -08:00
Adam Pocock	dd2c031d95	[java] Sparse tensor support (#10653 ) Description: Adds support for creating and receiving sparse tensors in the ORT Java API. CSRC and COO tensors as inputs are tested, but there is no op which accepts a block sparse tensor to test. COO tensors are tested as outputs, but there is no op which emits a CSRC or block sparse tensor to test. Motivation and Context - Why is this change required? What problem does it solve? Request to expose ORT sparse tensor support in Java. cc @yuslepukhin	2022-11-22 10:29:24 -08:00
Adam Pocock	388d3cf847	[Java] Fix OnnxSequence semantics (#13012 ) Previously OnnxSequence would flatten out a list of tensors into a single output array assuming they were all scalar values. This doesn't accurately represent the semantics of an ONNX sequence, but was what the semantics appeared to be years ago when I first wrote that class. This PR changes it so that the `getValue` method on `OnnxSequence` unwraps the sequence and returns `List<? extends OnnxValue>` allowing the user to process the individual ONNX values separately. It's done this way rather than returning a multidimensional array for a tensor and a Java map for a map as multidimensional arrays are very inefficient in Java and best practice when operating with a OnnxTensor in Java is to use a `java.nio.ByteBuffer`. So allowing users to access each `OnnxTensor`s individually allows them to control how the data is materialised on the Java heap.	2022-09-28 15:53:30 -07:00
RandySheriffH	77a066c700	Drop nuphar from java API (#13107 ) Drop nuphar from: - java API - tvm.cmake - run_build.sh	2022-09-26 17:06:08 -07:00
RandySheriffH	a83a9ed6b0	Remove miscellaneous nuphar configs (#13070 ) Remove a handful of nuphar related configurations after deprecation. Co-authored-by: Randy Shuai <rashuai@microsoft.com>	2022-09-26 13:41:28 -07:00
Adam Pocock	5d55b0730e	[Java] JNI refactor for OrtJniUtil (#12516 ) Refactoring more JNI methods in OrtJniUtil. Make the strings const. Removing unnecessary use of OrtAllocator.	2022-09-08 17:04:42 -07:00
Cheng	76d17b0f48	Add java API for xnnpack (#12788 ) * Add java API for xnnpack * provider option support * a more general interface for creating EP	2022-09-03 08:29:40 +08:00
Nat Kershaw (MSFT)	0757d51334	Fix Java api docs broken link (#12686 )	2022-08-24 09:56:51 -07:00
Adam Pocock	733db31420	[Java] JNI refactor for OrtSession (#12496 ) Refactor JNI error reporting	2022-08-16 13:43:06 -07:00
Adam Pocock	8a86b346a5	[Java] JNI refactor for ONNX Tensor (#12281 ) Working on JNI refactor for OnnxTensor. Simplifying the error handling logic in createTensor. Collapsing casting branches and migrating to ONNX element type enum. Disable cpplint for JNI C files.	2022-08-08 12:48:30 -07:00
Adam Pocock	e0ed9f0f2f	[java] First part of the JNI error handling rewrite (#12013 ) Description: This fixes error handling in the JNI code in OnnxMap, OnnxSequence, OnnxRuntime, RunOptions. SessionOptions and OrtEnvironment are correct as is. The bulk of the work will be in rewriting OnnxTensor, OnnxSparseTensor (after the merge of #10653) and OrtSession, along with the helper methods in OrtJniUtil. I plan to tackle those in separate PRs to reduce the amount of code to review. Motivation and Context - Why is this change required? What problem does it solve? The current native interop code doesn't return control to Java immediately on throwing an exception from an ORT error code, which can cause incorrect interactions with native ORT, and issues with exception propagation on the Java side. - If it fixes an open issue, please link to the issue here. Partial work towards solving #11451.	2022-07-12 15:16:54 -07:00
Wenbing Li	479e71a7a8	enable the extensions custom build for java and android (#11823 )	2022-07-05 10:34:14 -07:00
Mina Asham	6cd1931a93	Specify list/map capacity when initializing where possible (#11110 ) * Specify list/map capacity when initializing where possible - This really depends on the use case, but in some cases the array/map resizing can be slightly costly, there is effectively no downside setting the initial capacity for a collection if we know for sure its final size * Supply list/map capacity when initializing where possible - This really depends on the use case, but in some cases the array/map resizing can be slightly costly, there is effectively no downside setting the initial capacity for a collection if we know for sure its final size - Introduce an extra utility to help creating maps with expected capacity * Move utility function to OrtUtil and drop MapUtil, also add Java doc to method * Move test to the right class	2022-04-27 20:59:18 -07:00
Adam Pocock	9616ad483f	[Java] Support configuring CUDA and TensorRT execution providers (#10697 ) Java side parts for configuring CUDA and TensorRT. Adding tests for CUDA and TensorRT. Refactoring library loading logic as provider options need to have their shared library loaded before they can be constructed.	2022-03-30 14:26:51 -07:00
Adam Pocock	f856608599	[java] Changes OrtEnvironment so it can't be closed by users (#10670 ) * Changes OrtEnvironment so it can't be closed by users. * Fix the formatting and add a same instance check.	2022-02-28 21:03:40 -08:00
Adam Pocock	e47434ea12	[java] Adding the graph description to the exposed model metadata. (#10318 )	2022-02-28 10:05:03 -08:00
Valery Chernov	1cdc23aba4	[TVM EP] Rename Standalone TVM (STVM) Execution Provider to TVM EP (#10260 ) * update java API for STVM EP. Issue is from PR#10019 * use_stvm -> use_tvm * rename stvm worktree * STVMAllocator -> TVMAllocator * StvmExecutionProviderInfo -> TvmExecutionProviderInfo * stvm -> tvm for cpu_targets. resolve onnxruntime::tvm and origin tvm namespaces conflict * STVMRunner -> TVMRunner * StvmExecutionProvider -> TvmExecutionProvider * tvm::env_vars * StvmProviderFactory -> TvmProviderFactory * rename factory funcs * StvmCPUDataTransfer -> TvmCPUDataTransfer * small clean * STVMFuncState -> TVMFuncState * USE_TVM -> NUPHAR_USE_TVM * USE_STVM -> USE_TVM * python API: providers.stvm -> providers.tvm. clean TVM_EP.md * clean build scripts #1 * clean build scripts, java frontend and others #2 * once more clean #3 * fix build of nuphar tvm test * final transfer stvm namespace to onnxruntime::tvm * rename stvm->tvm * NUPHAR_USE_TVM -> USE_NUPHAR_TVM * small fixes for correct CI tests * clean after rebase. Last renaming stvm to tvm, separate TVM and Nuphar in cmake and build files * update CUDA support for TVM EP * roll back CudaNN home check * ERROR for not positive input shape dimension instead of WARNING * update documentation for CUDA * small corrections after review * update GPU description * update GPU description * misprints were fixed * cleaned up error msgs Co-authored-by: Valery Chernov <valery.chernov@deelvin.com> Co-authored-by: KJlaccHoeUM9l <wotpricol@mail.ru> Co-authored-by: Thierry Moreau <tmoreau@octoml.ai>	2022-02-15 10:21:02 +01:00
Shucai Xiao	ce103ace93	Amdmigraphx fix build error (#9272 ) * fix build error * rename a missing api for the MIGraphX EP	2022-01-10 15:18:43 -08:00
Valery Chernov	b327e89efa	Standalone TVM Executor Provider (#10019 ) * squashed commit for standalone tvm execution provider * critical fix for correct python build with stvm ep * get tuning log file from ep options. It has priority over AUTOTVM_TUNING_LOG * updates and fixes * update parsing of stvm provider options * add support of external data for onnx model * add conditional dump of subgraphs * remove unused code * get input tensor shapes through provider options. get output shapes for fixed input ones by TVM API * support AUTO_TVM tuning log file inside ORT. Selector for Ansor and Auto_TVM is provider option (tuning_type) * add fp16 * add functionality of conversion of model layout to NHWC if need. Necessary parameter was added to STVM provider options * fix license text in header. fix log format * small fixes * fix issues from flake8 * remove model proto construction from GetCapability * reserve memory for vector of DLTensors * add simple tutorial for STVM EP * STVM docs * jroesch/tvm -> apache/tvm * remove dead code, unneccessary logs and comments * fix in readme * improve tutorial notebook * tvm update * update STVM_EP.md * fix default value * update STVM_EP.md * some TODOs for the future development * shorten long lines * add hyperlink to STVM_EP.md * fix Linux CI error * fix error in csharp test Co-authored-by: Jared Roesch <jroesch@octoml.ai> Co-authored-by: Valery Chernov <valery.chernov@deelvin.com> Co-authored-by: KJlaccHoeUM9l <wotpricol@mail.ru>	2021-12-15 16:59:20 -08:00
Hariharan Seshadri	bbeceb7541	Support optional type in ORT (#8339 )	2021-11-04 15:01:42 -07:00
Jeff Daily	c8789d3047	[ROCm] static re-hipify of CUDA EP to ROCm EP, now a shared provider (#8877 ) * re-hipify all rocm EP sources * fix all other files affected by re-hipify * add cuda_provider_factory.h to amd_hipify.py * do not use cudnn_conv_algo_search in ROCm EP, missing reduce min registration * Fix ReduceConsts template specialization introduced in #9101. Fixes the error when building for ROCm 4.3.1: error: too many template headers for onnxruntime::rocm::ReduceConsts<__half>::One (should be 0) * fix flake8 error in amd_hipify.py * speed up hipify with concurrent.futures * flake8 fix in amd_hipify.py	2021-10-14 15:15:51 -07:00
Guoyu Wang	bee5c26580	Add CPU_ONLY runtime option to NNAPI EP (#9066 ) * Add NNAPI cpu only option * update java * Update comments	2021-09-15 15:50:18 -07:00
Guoyu Wang	6a1939252f	Fix Android java API failure (#8865 ) * Fix Android Package break * Without java fix -- pipeline should fail * With java fix, should pass now * address CR comments	2021-08-27 15:58:56 -07:00
Frank Liu	002e427c5b	Add UINT8 datatype support to Java (#8401 ) Add UINT8 datatype support Add inference test for UINT8 model	2021-07-22 17:11:49 -07:00
Adam Pocock	9a6fa057c8	[Java] Allow extraction of multidimensional String tensors (#8452 ) Fixing a bug where String tensors would always be single dimensional in Java.	2021-07-22 13:19:49 -07:00
Adam Pocock	55b26b6951	[Java] Adds support for DNNL, OpenVINO, TensorRT shared providers and refactors the CUDA shared provider loader (#8013 )	2021-07-20 22:33:15 -07:00
Ryan Hill	cc9f793b48	Move one function from cuda_provider_factory.h (#8407 )	2021-07-19 17:55:59 -07:00
Adam Pocock	7ed9f5fc90	[Java] Fixing the creation of OnnxTensors from scalars, adding tests (#8023 ) * Fixing the creation of OnnxTensors from scalars, adding tests. * Documentation fixes from the review.	2021-06-24 13:21:35 -07:00
Guoyu Wang	9003df5d87	Fix 32bit Android java API crash (#8122 ) * Fix 32bit Android java API crash * fix code formating	2021-06-22 17:41:11 -07:00
Ryan Hill	c99aa3a3f3	Ryanunderhill/cuda shared (#7626 ) * First iteration of making cuda a shared provider. Separated out shared OpKernel change, so doing this to merge with that change. * More cuda shared library refactoring * More cuda shared library refactoring * More build options tested, converted the training ops over. * Fix merge breaks * Fix submodules * Fix submodules * Fix submodules * Fix python * Fix compile errors * Duplicate symbol fix * Test fix for ROCM provider * Another ROCM test workaround * ROCM Build Test * ROCM build fix * ROCM * ROCM * ROCM * ROCM * ROCM * ROCM test * Reduce header dependencies * Remove redundant namespace * Test fix for linux * Fix linux build * Fix Eigen build error * Fix unused parameter warning * Test link error * Another linker test * Linker test * Linker test * Another test * Another build test * Fix linux link error * Build test * Fix control flow ops to use common base class with core code * Remove extra qualifiers * Fix template syntax for linux * Fix cuda memory leak * Fix pybind * Test disabling cast * Cleanup * Restore cuda in test * Remove more header dependencies * Test not adding cuda provider to session * Make GetProviderInfo_CUDA throw * No-op cuda provider creation * Fix some setup issues * Fix memory cleanup on unload * Diagnostics * Don't unload library * Add diagnostics * Fix deleting registry at right time. * Test disabling profiler * Fix merge break * Revert profiler change * Move unloading of shared providers into Environment * Free more global allocations before library unloads * Add more diagnostics * Move unloading back to the OrtEnv as there are multiple Environments created during a session. Remove some library dependencies for tests. * Fix more cmake files * ERROR -> WARNING * Fix python shutdown * Test not using dml in pipeline * Change python version and disable dml * Update python version * Test adding unload method for shared providers * Disable DLL test * Python test * Revert "Python test" This reverts commit `c7ec2cfe98`. * Revert "Disable DLL test" This reverts commit `e901cb93aa`. * Revert "Test adding unload method for shared providers" This reverts commit `c427b78799`. * Point to RyanWinGPU * Revert python version * Fix id_to_allocator_map * Another python exit test * Remove extra debug messages Try a more clean python shutdown through DllMain * Revert DllMain idea, it didn't work * Merge conflicts * Fix merge with master issues. * Comments * Undo edit to file * Cleanup + new training ops * Revert yml changes * Fix another merge error * ROCM fix * ROCM fix v2 * Put back Linux hack, it is necessary * Stupid fixes * Fix submodule out of sync * ROCM fix 3 * ROCM 4 * Test java fix * Fix typos * Java test on my VM * Fix build error * Spotless fix * Leave temp file around to load properly * Fix cleanup on exit * Fix break * Java comments * Remove LongformerAttentionBase workaround * Spotless fix * Switch yml back to regular build pool * Revert "Switch yml back to regular build pool" This reverts commit `be35fc2a5a`. * Code review feedback * Fix errors due to merge * Spotless fix * Fix minimal build * Java fix for non cuda case * Java fix for CPU build * Fix Nuphar? * Fix nuphar 2 * Fix formatting * Revert "Remove LongformerAttentionBase workaround" This reverts commit `648679b370`. * Training fix * Another java fix * Formatting * Formatting * For orttraining * Last orttraining build fix... * training fixes * Fix test provider error * Missing pass command * Removed in wrong spot * Python typo * Python typos * Python crash on exit, possibly due to unloading of libraries. * Remove test_execution_provider from training build Only enable python atexit on windows Remove assert on provider library exit * Still can't unload providers in python, alas. * Disable Nvtx temporarily * MPI Kernels for Training * MPI Kernels part 2 * Patch through INcclService * Oops, wrong CMakeLists * Missing namespace * Fix missing () * Move INcclService::GetInstance around to link nicer * Missing } * Missing MPI libraries for Cuda * Add extra GetType functions used by MPI * Missing Nccl library * Remove LOGS statements as a test * Add in a couple more missing GetType methods * Update comments * Missed a logging reference in mpi_context.h * Convert aten_op to shared (due to marge with master) * Test moving DistributedRunContext instance into shared provider layer (with purpose error to verify it's being built properly) * Test passed, now with fix * Missing static * Oops, scope DistributedRunContext to just NCCL * Merge related issues and code review feedback. * Merge error * Bump to rel-1.9.1 (#7684) * Formatting * Code review feedback for Java build on non Windows * Remove cupti library dependency from core library * Test Java pipeline fix * Linux build fix * Revert "Linux build fix" This reverts commit `a73a811516`. * Revert "Remove cupti library dependency from core library" This reverts commit `6a889ee8bf`. * Packaging pipeline fixes to copy cuda shared provider for tensorrt & standard packages * Add cuda to Tensorrt nuget package * onnxruntime_common still has a cuda header dependency Co-authored-by: ashbhandare <ash.bhandare@gmail.com>	2021-05-20 07:53:47 -07:00
Changming Sun	7b003967b1	Add static code analyzer to Windows CPU/GPU CI builds and fix the warnings (#7489 )	2021-04-29 11:54:57 -07:00
Changming Sun	d68cedfa85	Fix some C/C++ warnings in the jni part (#7385 )	2021-04-28 14:25:58 -07:00
Guoyu Wang	4969431eba	Fix codeql java warning (#7280 )	2021-04-08 11:08:12 -07:00
Adam Pocock	5a473216b7	[Java] Adds extra providers (#6770 ) Add providers for CoreML, ROCM, NNAPI, ArmNN Adding the structs for OrtCUDAProviderOptions and OrtOpenVINOProviderOptions Updating NNAPI flags. Adding the new CoreML flag. Adding hooks to the build system to tell Java about the new providers.	2021-02-24 10:25:05 -08:00
Adam Pocock	77d0eb3f56	Fixing a leak in OnnxSequences with String keys or values. (#6473 )	2021-01-28 11:28:56 -08:00
Adam Pocock	0100f336d7	[java] Adds support for OrtEnvironment thread pools (#6406 ) * Updates for Gradle 7. * Adding support for OrtThreadingOptions into the Java API. * Fixing a typo in the JNI code. * Adding a test for the environment's thread pool. * Fix cuda test, add comment to failure. * Updating build.gradle	2021-01-27 13:25:22 -08:00
Dmitri Smirnov	6d0fb3ebb3	Java: Set C language warnings to W4 and adjust JNI code (#6347 ) Set /W3 for C language and fix up JNI warnings.	2021-01-14 15:04:47 -08:00
Adam Pocock	396074d2a8	Fixing OrtEnvironment.getEnvironment() so it doesn't print a warning if the environment already exists with a non-default name. (#5973 )	2020-12-01 15:21:06 -08:00
Adam Pocock	fddbd8935c	Adding Java support for getAvailableProviders and other small methods (#5366 ) * Adding Java support for getAvailableProviders, addFreeDimensionOverrideByName, disablePerSessionThreads and getProfilingStartTimeNs. * Fixing copyright years, running spotless and adding javadoc and an accessor to OrtProvider. * Renaming OrtSession.getProfilingStartTimeInNs. * Removing ngraph as it's been deprecated.	2020-11-24 21:42:57 -08:00
Adam Pocock	8b83c51a35	[Java] Initial Apple Silicon support (#5891 ) * Rearranging checks in onnxruntime_mlas.cmake to pickup Apple Silicon. On an M1 Macbook Pro clang reports: $ clang -dumpmachine arm64-apple-darwin20.1.0 So the regex check needs to look for "arm64" first, as otherwise it matches 32-bit ARM and you get NEON compilation failures. * Adding Java side library loading support for Apple Silicon (and other aarch64 architectures). * Adding Qgemm fix from @tracysh * Fixes the java packaging on Windows. * Missed a check in the java platform detector.	2020-11-24 15:51:40 -08:00
S. Manohar Karlapalem	ff58f621fa	Remove nGraph Execution Provider (#5858 ) * Remove nGraph Execution Provider Pursuant to nGraph deprecation notice: https://github.com/microsoft/onnxruntime/blob/master/docs/execution_providers/nGraph-ExecutionProvider.md#deprecation-notice Deprecation Notice \| \| \| \| --- \| --- \| \| Deprecation Begins \| June 1, 2020 \| \| Removal Date \| December 1, 2020 \| Starting with the OpenVINO™ toolkit 2020.2 release, all of the features previously available through nGraph have been merged into the OpenVINO™ toolkit. As a result, all the features previously available through ONNX RT Execution Provider for nGraph have been merged with ONNX RT Execution Provider for OpenVINO™ toolkit. Therefore, ONNX RT Execution Provider for nGraph will be deprecated starting June 1, 2020 and will be completely removed on December 1, 2020. Users are recommended to migrate to the ONNX RT Execution Provider for OpenVINO™ toolkit as the unified solution for all AI inferencing on Intel® hardware. * Remove nGraph Licence info from ThirdPartyNotices.txt * Use simple Test.Run() for tests without EP exclusions To be consistent with rest of test code. * Remove nGraph EP functions from Java code	2020-11-19 16:47:55 -08:00
Guoyu Wang	261462be0d	Change NNAPI runtime options to use uint32_t (#5863 ) * Change nnapi options unsigned long -> uint32_t * Move options from long to int in java code	2020-11-19 13:38:49 -08:00
Adam Pocock	d1d82065b9	[Java] Fixes an error allocating large direct byte buffers during OnnxTensor creation (#5619 ) * Fixing an error with allocating large direct byte buffers during tensor creation. * Removing the redundant overflow check.	2020-11-05 15:02:41 -08:00
Guoyu Wang	a2b551ff08	Add runtime options for NNAPI EP (#5576 ) * Add options for nnapi ep * Add nnapi flags test * add comments * Add flag comments * Make the flags bitset const * Fix build break * Add stub changes to java and c# api * Fix java related build break * Fix java build break * Switch to bit flags instead of bitset	2020-11-04 10:08:43 -08:00

1 2

73 commits