onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-05-17 21:10:43 +00:00

Author	SHA1	Message	Date
Edward Chen	c46c7ccba5	Update Gradle version (#14862 ) - Update Gradle version used in most places from 6.8.3 to 8.0.1. Update Android Gradle Plugin version where applicable. Not updated in this change: React Native Android projects (under `js/react_native/`). That can be done later along with updating the React Native projects. - Add Gradle wrapper in `java/` to make it easier to consistently use a specific Gradle version.	2023-03-08 12:22:06 -08:00
Adam Pocock	47f00b5d49	[Java] Initial on device training support (#14027 ) contributor: @Craigacp	2023-03-08 10:01:08 -08:00
Ashwini Khade	f71ac9859e	Update acpt image in the training pipeline (#14855 ) ### Description Current pipeline refers to an old image which is causing test failures. Updating the image to the latest one. ### Motivation and Context <!-- - Why is this change required? What problem does it solve? Fixes pipeline failure: https://dev.azure.com/onnxruntime/onnxruntime/_build?definitionId=198 - If it fixes an open issue, please link to the issue here. -->	2023-03-07 14:10:32 -08:00
Changming Sun	3e08a67dd6	Add Linux ARM64 CI pipeline (#14904 )	2023-03-06 21:47:10 -08:00
Adrian Lizarraga	d45b47945c	Linux QNN Pipeline: fix build error reporting (#14922 ) ### Description Split up the ORT build step in the Linux QNN CI Pipeline. ### Motivation and Context Build errors were not being immediately reported at the end of the build step. The build step currently concatenates multiple shell commands, and the return code for the last (mkdir) was being reported. This PR ensures that the return code of the `python build.py ...` command is reported for the build step.	2023-03-06 17:49:35 -08:00
Changming Sun	c1155b70c5	Remove 37 and 50 from CUDA compute archs (#14874 ) ### Description To reduce CUDA package's size a little bit. 37 is for Tesla K80. Azure's NC-series uses it, but in most cases CUDA can dynamic generate device code .	2023-03-03 12:24:21 -08:00
Yi Zhang	8c454a76e0	Check Mac silicon package name (#14898 ) ### Description 1. add comments 2. check Mac silicon package name ### Motivation and Context There isn't Mac silicon Agent in ADO. We couldn't add smoking test to test the wheel can be installed. But We can check whether the package name is correct to avoid the mistake in 1.14 release. Test run https://dev.azure.com/aiinfra/Lotus/_build/results?buildId=283100&view=logs&j=fe710151-df7c-5aa4-0cea-cf5331faa499&t=3182cefe-2612-53c6-4445-e5b3e0c4ac57	2023-03-03 18:27:54 +08:00
Changming Sun	f3b6664384	Remove Python 3.7 from the python packaging pipeline (#14887 ) ### Description 1. Remove Python 3.7 from the python packaging pipeline. It is planned for the next release and approved by the PMs. Also we will add 3.11, but it will be addressed in another PR. 2. Stop generating python packages based on Ubuntu 18.04 which will reach EOL next month. We will either replace them with Ubuntu 20.04 or a CentOS 8 variant.	2023-03-02 19:44:49 -08:00
Chun-Wei Chen	70a31e047a	Consume ONNX 1.13.1 in ONNX Runtime (#14812 ) ### Description <!-- Describe your changes. --> Consume ONNX 1.13.1 in ONNX Runtime. (ONNX 1.13.0 to ONNX 1.13.1) ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. --> ONNX 1.13.1 patch was just released yesterday. This PR is making ORT's ONNX submodule consistent with the latest released ONNX. Not sure whether this PR is really needed, but let me make it ready. Previous PR for testing ONNX 1.13.1rc2 : https://github.com/microsoft/onnxruntime/pull/14634. Fixed [AB#13174](https://aiinfra.visualstudio.com/6a833879-cd9b-44a4-a9de-adc2d818f13c/_workitems/edit/13174) .	2023-03-02 14:57:35 -08:00
Hector Li	c6074f3a4b	OnnxRuntime QNN EP (#14791 ) ### Description Integrate Qualcomm QNN SDK to enable inference on QC hexagon NPU devices ### Motivation and Context Enable Ort inference on QC hexagon NPU devices. --------- Co-authored-by: Satya Jandhyala <sajandhy@microsoft.com> Co-authored-by: Adrian Lizarraga <adlizarraga@microsoft.com> Co-authored-by: Adrian Lizarraga <adrianlm2@gmail.com>	2023-03-01 13:48:20 -08:00
Scott McKay	b7fde84341	Changes to support standalone custom ops in a minimal build. (#14497 ) ### Description <!-- Describe your changes. --> Changes to support standalone custom ops in a minimal build. Also incorporates changes from #14492 (needed to test builds prior to that being checked in). We first need to save the schema info from the operators used by the standalone op invoker in the ORT format model. Add mechanism for that. Merge the kernel lookup logic so the same is used in full and minimal build. NOTE: the version matching is now consistent with all other kernel lookups, and the call to CreateOp MUST use the exact version for the operator. Previously matching wasn't as strict, but this can lead to the incorrect kernel being chosen. Add tests. NOTE: There is currently no way to detect the ops/types/opsets used inside these custom ops as they don't exist until we create kernels, which is after model loading completes (which is the point the ORT format model is saved). Due to that they have to be manually added to the configuration used to do the reduced ops build. That shouldn't be too hard for the custom op author to add given the custom op implementation is specifying the op, opset and type constraints (i.e. they have the info and it's just a case of capturing/formatting it correctly). ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. --> Enable usage of the standalone op invoker by custom ops in a minimal build. --------- Co-authored-by: Edward Chen <18449977+edgchen1@users.noreply.github.com>	2023-03-01 11:22:54 +10:00
Yulong Wang	69c5edb11b	[wasm] upgrade emsdk from 3.1.19 to 3.1.32 (#14818 ) ### Description upgrade emsdk from 3.1.19 to 3.1.32 also add explicit config for stack size (1MB).	2023-02-28 11:06:09 -08:00
Yi Zhang	6320decf04	increase Test GPU Job's timeout to 8 hours (#14850 ) ### Description <!-- Describe your changes. --> ### Motivation and Context In practice, 6 hours is not enough to finish the job.	2023-02-28 18:52:03 +08:00
Yi Zhang	0be20dc0f6	Run GPU test job after all CPU test jobs succeed. (#14833 ) ### Description Make GPU job depends on all CPU jobs ### Motivation and Context GPU resources are very limited in packaging pipeline. And GPU test job is very time consuming. Only one CPU job fails, the workflow fails, so the GPU job is meaningless. To utilize GPU resources more efficiently, run GPU job only after all CPU jobs succeed. ###test pipeline https://dev.azure.com/aiinfra/Lotus/_build/results?buildId=280905&view=results	2023-02-28 07:44:51 +08:00
Yulong Wang	6b83ad9659	[js/web] allow unittest (onnxruntime_test_all) to run in browser (#14820 ) ### Description allow onnxruntime_test_all to run in browser for WebAssembly build (use flag `--wasm_run_tests_in_browser`). To output the logs from stdout correctly, this test needs to be build with `--enable_wasm_threads`.	2023-02-24 16:45:33 -08:00
Rachel Guo	0700788b6e	Disable e2e android react native CI test temporarily (#14803 ) ### Description <!-- Describe your changes. --> Disable e2e android react native test temporarily to unblock the CI failure with no easy fix. ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. --> Temp solution to unblock CI failure.	2023-02-24 09:32:18 -08:00
Jian Chen	29428cd9dc	Cjian/pr into main for 1.14.1 fix (#14805 ) ### Description <!-- Describe your changes. --> PR a change made to 1.14.1 into Main branch as well. ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. -->	2023-02-23 18:10:57 -08:00
Jian Chen	62ee0c8110	Migrating ORT Extensions from Git submodule to cmake FetchContent (#14298 ) ### Description <!-- Describe your changes. --> Merging extensions from Git submodule to cmake FetchContent ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. --> --------- Co-authored-by: Changming Sun <chasun@microsoft.com> Co-authored-by: Jian Chen <jchen351@MacBook-Pro.local>	2023-02-22 19:42:36 -08:00
Edward Chen	b3b9be19b1	Update clang-tidy path for updated Mac image. (#14760 ) Update clang-tidy path for updated Mac image. Fix Objective-C static analysis build.	2023-02-22 09:00:42 -08:00
Edward Chen	ad78579b66	Update java/build.gradle to not use deprecated features that were removed in gradle 8.0. (#14733 ) ### Description <!-- Describe your changes. --> Update java/build.gradle to not use deprecated features that were removed in gradle 8.0. Also move gradle wrapper setup from a script into a step template. ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. --> Fix builds which use hosted Mac agents and gradle. Recently the system version of gradle got upgraded to 8.0. Even though we use an older gradle wrapper version, java/build.gradle is still processed with gradle 8.0 in the initial call to `gradle wrapper`.	2023-02-20 11:19:49 +08:00
Wei-Sheng Chin	7b31bcda2e	Disable LazyTensor-ORT Test (#14703 ) As title since LazyTensor is replaced by Dynamo in PyTorch 2.0.	2023-02-17 17:46:51 +08:00
Patrice Vignola	ce9a71620f	Fix DML release build (#14661 ) ### Description Fixes the DML release build for 1.14.1. This was initially fixed by https://github.com/microsoft/onnxruntime/pull/13417 for 1.13.1, but the changes didn't make their way back to the main branch.	2023-02-13 17:31:11 -08:00
Tang, Cheng	8f34c8c8ed	Introduce collective ops to ort inference build (#14399 ) ### Description Introduce collective ops into onnxruntime inference build, including 1) AllReduce and AllGather schema in contrib op, controlled by USE_MPI flag 2) AllReduce and AllGather kernel in cuda EP, controlled by ORT_USE_NCCL flag ### Motivation and Context Enable the collective ops in onnxruntime inference build so we have the ability to run distributed inference with multiple GPUs. The original ncclAllReduce ops in training build require quite complex configurations, which is not suitable for inference case, and it already broken. so we introduce a new implementation. --------- Co-authored-by: Cheng Tang <chenta@microsoft.com@orttrainingdev9.d32nl1ml4oruzj4qz3bqlggovf.px.internal.cloudapp.net>	2023-02-07 13:47:48 -08:00
RandySheriffH	b6bec54341	Revert mimalloc from v2.0.9 to v2.0.3 (#14603 ) Revert mimalloc from v2.0.9 to v2.0.3 to silence build error in [post-merge ](https://aiinfra.visualstudio.com/Lotus/_build/results?buildId=273075&view=logs&j=f019f681-ae8f-5ee4-d119-02530df66a84&t=6c90c65c-2ab2-56af-633f-b5631256a8e1&l=351) pipeline. New dependency version was generated [here](https://aiinfra.visualstudio.com/Lotus/_artifacts/feed/Lotus/UPack/onnxruntime_build_dependencies/overview/1.0.29). Co-authored-by: Randy Shuai <rashuai@microsoft.com> Co-authored-by: rui-ren <ruiren1225@gmail.com>	2023-02-07 09:58:25 -08:00
Baiju Meswani	68a402e739	Add support for python 3.10 for onnxruntime-training cuda and cpu (#14100 )	2023-02-02 11:32:41 -08:00
RandySheriffH	01cafe89f0	Specify deps in deps.txt and manifest (#14530 ) Specify new deps and update cgmanifest.json. --------- Co-authored-by: Randy Shuai <rashuai@microsoft.com>	2023-02-02 09:44:57 -08:00
Baiju Meswani	7954976e0a	Fix python packaging pipeline (#14533 ) fix onnx and protobuf inconsistencies in python packaging pipeline.	2023-02-02 13:11:18 +08:00
Yulong Wang	0578eeff91	upgrade EsrpCodeSigning from v1 to v2 (#14531 ) ### Description This change upgrade EsrpCodeSigning from v1 to v2 in our build pipeline.	2023-02-02 13:08:26 +08:00
Yi Zhang	80f807c03d	upgrade protobuf to 3.20.2 and onnx to 1.13 (#14279 ) ### Description upgrade protobuf to 3.20.2, same as onnx 1.13.0 ### Motivation and Context Per component governance requirement and Fixes #14060 unused-parameter error occurs in 2 conditions. 1. compile protolbuf `onnxruntime_src/cmake/external/protobuf/src/google/protobuf/repeated_ptr_field.h:752:66: error: unused parameter ‘prototype’ [-Werror=unused-parameter]` 2. include onnx_pb.h ``` 2023-01-28T10:20:15.0410853Z FAILED: CMakeFiles/onnxruntime_pybind11_state.dir/onnxruntime_src/onnxruntime/python/onnxruntime_pybind_iobinding.cc.o ...... 2023-01-28T10:20:15.0466024Z from /build/Debug/_deps/onnx-src/onnx/onnx_pb.h:51, 2023-01-28T10:20:15.0466958Z from /onnxruntime_src/include/onnxruntime/core/framework/to_tensor_proto_element_type.h:10, .... 2023-01-28T10:20:15.0609678Z /build/Debug/_deps/onnx-build/onnx/onnx-operators-ml.pb.h:1178:25: required from here 2023-01-28T10:20:15.0610895Z /onnxruntime_src/cmake/external/protobuf/src/google/protobuf/repeated_ptr_field.h:752:66: error: unused parameter ‘prototype’ [-Werror=unused-parameter] 2023-01-28T10:20:15.0611707Z cc1plus: all warnings being treated as errors ``` https://dev.azure.com/onnxruntime/2a773b67-e88b-4c7f-9fc0-87d31fea8ef2/_apis/build/builds/874605/logs/22	2023-01-31 12:55:09 -08:00
cloudhan	3b6d551c35	Enable ccache for HIP objects (#14465 ) This enables HIP compiler to be launched with `ccache` when build with `--use_cache`	2023-01-28 22:34:24 +08:00
Vincent Wang	7aecb2150f	Fix onnxruntime-CI-nightly-ort-pipeline Failure (#14464 ) PyTorch skipped version 1.14 and jumped to 2.0, while the image for the onnxruntime-CI-nightly-ort-pipeline is still using nightly-ubuntu2004-cu116-py38-torch1140dev. Switch to the new torch version image to fix the failure of the pipeline.	2023-01-28 16:05:56 +08:00
Tianlei Wu	94b1791974	Upgrade CUTLASS to v2.11 and add sequence length threshold for cutlass FMHA (#14401 ) ### Description Add sequence length threshold for triggering cutlass FMHA in FP32. See performance test results in https://github.com/microsoft/onnxruntime/pull/14343 to see how this threshold is selected. Upgrade cutlass to v2.11 and update deps.txt and cgmanifest for nuget pipeline build (test build: https://aiinfra.visualstudio.com/Lotus/_build/results?buildId=268574&view=results)	2023-01-25 09:43:48 -08:00
Edward Chen	7cc9aed314	Android package custom build script update (#14403 ) Update Android package custom build script. - Use later version of various dependencies (CMake, JDK, Android command line tools, Android NDK, Ubuntu). The CMake version was too old for the current ORT code. - Do in-container build in a directory that is not shared with the host. Resolves some file permission issues and speeds up file access. Add a nightly build to make sure the script works with the latest ORT.	2023-01-25 09:19:05 -08:00
Yi Zhang	cf3661ff6d	Revert "Allow PostAnalysis@2 task to continue on error for Windows_Pa… (#14375 ) …ckaging_CPU_x86_default (#14332)" This reverts commit `a491f33f54`. ### Description ### Motivation and Context It looks an ADO issue. Now, it's recovered. It could be reenabled.	2023-01-21 09:32:39 +08:00
Edward Chen	3b382ea7e1	Free OrtStatus in ASSERT_ORT_STATUS_OK, make run_android_emulator.py work with newer JDK version (#14369 ) - Free OrtStatus in ASSERT_ORT_STATUS_OK in model_tests.cc - Make run_android_emulator.py work with newer JDK version	2023-01-20 09:27:47 -08:00
Yi Zhang	3d6cea14f4	Remove intermedia obj files once build finished (#14361 ) ### Description Remove intermedia obj files and reenable cache ### Motivation and Context Recently, training_debug_x64 pipeline often failed due to not enough space. It could free nearly 8G space by deleting obj files. So, the compilation cache can be reenabled	2023-01-20 13:37:15 +08:00
Edward Chen	ae0e090c7b	Fix post merge jobs pipeline build issues (#14346 ) - Fix debug node inputs outputs nullptr dereference with ONNX optional types. - Fix model test memory leak. - Convert jobs to stages in post-merge-jobs.yml to allow a subset of builds to be enabled when running manually. - Fix buffer overrun in CumSum op exposed by Mimalloc build.	2023-01-19 11:16:42 -08:00
Yi Zhang	b51415b0ea	disable cache for training_x64_debug (#14358 ) ### Description disable cache to save disk space for training_x64_debug ### Motivation and Context To mitigate not enough disk space in training_x64_debug first.	2023-01-19 15:08:34 +08:00
Adrian Lizarraga	a491f33f54	Allow PostAnalysis@2 task to continue on error for Windows_Packaging_CPU_x86_default (#14332 ) ### Description Allows the PostAnalysis@2 task for windows CI jobs to continue even if an error is encountered. ### Motivation and Context This is a temporary workaround that enables the `Windows_Packaging_CPU_x86_default` job within the Zip-Nuget-Java-NodeJS packaging pipeline to finish. A recent update to dotnet 6 has broken the PostAnalysis task for this job. This task was originally added by https://github.com/microsoft/onnxruntime/pull/13694	2023-01-18 19:54:48 -08:00
Rui Ren	904e63633a	increase the time limit as more unit tests added (#14327 ) ### Description Pipeline failed because we added more unit tests, reference: https://dev.azure.com/onnxruntime/onnxruntime/_build/results?buildId=863643&view=logs&j=7536d2cd-87d4-54fe-4891-bfbbf2741d83&t=305229be-e8ba-5189-ca61-fcb77d866478 Now we have: [2430 tests]( https://dev.azure.com/onnxruntime/onnxruntime/_build/results?buildId=863619&view=logs&j=7536d2cd-87d4-54fe-4891-bfbbf2741d83&t=4efd38bc-b0da-5f98-81a8-ea2885f78448&l=43853) Previously we had: [2422 tests](https://dev.azure.com/onnxruntime/onnxruntime/_build/results?buildId=859543&view=logs&j=7536d2cd-87d4-54fe-4891-bfbbf2741d83&t=4efd38bc-b0da-5f98-81a8-ea2885f78448&l=43640) - Timeout error as we have 2 hour threshold ``` jobs: - job: Linux_Build timeoutInMinutes: 120 variables: skipComponentGovernanceDetection: true ``` ### Motivation and Context - Increase the timeoutInMinutes to `150`	2023-01-18 15:51:21 -08:00
Guenther Schmuelling	60290393f3	enable ort-extensions in wasm release builds (#14239 ) enable ort-extensions in wasm release builds. sentence piece, gpt2, bert and word piece tokenizers for now. wasm size will grow from 8.4MB to 8.9MB.	2023-01-17 12:39:13 -08:00
Yi Zhang	fb801d58b1	Add Cache in Linux CPU Aten Pipeline (#14313 ) ### Description Add compilation cache in Linux CPU Aten Pipeline. The pipeline could be completed in 6 minutes at best. ### Motivation and Context 1. Accelerate the pipeline. 2. It's the shortest pipeline with docker image. I'll use it to try moving the storage of linux docker image from ACR to ADO pipeline cache.	2023-01-17 10:49:29 +08:00
Yi Zhang	6d60dc24fe	install shared deps script (#14234 ) ### Description Add a new install_shared_deps.sh ### Motivation and Context Azcopy, Ninja, Node.js and CCache are all needed, but they are copied everywhere.	2023-01-16 18:27:29 +08:00
Yi Zhang	2a82f95040	Increase package python test pipeline timeout limit (#14288 ) ### Description Increase python test pipeline timeout limit. So far, It's a known issue for tensortRT8.5.	2023-01-14 13:46:09 +08:00
PeixuanZuo	d3a09cf77f	[ROCm] use pytest-xdist for fast pytest (#14261 ) ### Description Use pytest-xdist to distribute tests across multiple CPUs to speed up test execution. Use pytest-rerunfailures to rerun failed test in case of pytest-xdist crash. `pytest -n 16` can reduce pytest time from 80 minutes to 20 minutes. ### Motivation and Context Now kernel explorer pytest of ROCm CI takes nearly 1 hour 20 minutes. It will take longer time when we add more tunableOp in the future.	2023-01-13 16:57:50 +08:00
Scott McKay	b9ecd428c1	Add ability to register custom ops by specifying a function name (#14177 ) ### Description <!-- Describe your changes. --> Use dlsym/GetProcAddress to lookup a custom ops registration function by name and call it. This will be better on mobile platforms where the custom ops library is linked against, and there isn't necessarily a filesystem that a library path can be loaded from. Alternative is to wire up passing in the address of the function, but that has multiple complications which differ by platform. ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. --> Enable using ort and ort-ext packages on mobile platforms. Co-authored-by: Edward Chen <18449977+edgchen1@users.noreply.github.com>	2023-01-12 15:11:34 +10:00
sfatimar	7654cd50e8	Openvino ep 2022.3 v4.3 (#14210 ) ### Description Changes to incorporate OpenVINO EP 2022.3 ### Motivation and Context This change is required to incorportate OpenVINO EP 2022.3 - If it fixes an open issue, please link to the issue here. --> Co-authored-by: mohsinmx <mohsinx.mohammad@intel.com> Co-authored-by: Preetha Veeramalai <preetha.veeramalai@intel.com> Co-authored-by: Aravind <aravindx.gunda@intel.com> Co-authored-by: mayavijx <mayax.vijayan@intel.com> Co-authored-by: flexci <mohsinmx>	2023-01-11 16:31:26 -08:00
RandySheriffH	83ad562826	Rename CloudEP to AzureEP (#14175 ) Rename CloudEP to AzureEP. Co-authored-by: Randy Shuai <rashuai@microsoft.com>	2023-01-11 12:25:04 -08:00
Ashwini Khade	d92c663f28	Create dedicated build for training api (#14136 ) ### Description Enable creating dedicated build for on device training. With this PR we can build a lean binary for on device training using flag --enable_training_apis. This binary includes only the essentials like training ops, optimizers etc and NOT features like Aten fallback, strided tensors, gradient builders etc . This binary also removes all the deprecated components like training::TrainingSession and OrtTrainer etc ### Motivation and Context This enables our partners to create a lean binary for on device training.	2023-01-10 20:58:04 -08:00
PeixuanZuo	33367fa2dc	[MIGraphX] update the MIGraphX version used in ORT to rocm-5.4.0 (#14184 ) ### Description Update the MIGraphX version used in ORT to rocm-5.4.0 ### Motivation and Context The previous branch migraphx_for_ort has stopped updating, it is too far away from the MIgraphX latest release branch. More discussion here: https://github.com/microsoft/onnxruntime/issues/14126#issuecomment-1373201049 Co-authored-by: peixuanzuo <peixuanzuo@linmif39a000004.zvflicr54joexhdgnhvmxrxygg.phxx.internal.cloudapp.net>	2023-01-10 13:40:25 +08:00

1 2 3 4 5 ...

1334 commits