onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-06-03 23:49:44 +00:00

Author	SHA1	Message	Date
Yifan Li	808215366d	Fix Multi GPU TensorRT tests (#17269 ) ### Description * Integrate `trt_multi_gpu` test stage in ORT post merge CI (Win-2xA10 vm) * Deprecate Linux MultiGPU TRT CI (This vm will be deprecated soon) * Add multi gpu support to existing C# test cases * Deprecate unfunctional flag `--enable_multi_device_tests` ### Motivation and Context * Two contexts of replacing Linux MultiGPU TRT CI: * Flag `--enable_multi_device_tests` is not functional, which cannot detect issues like #17036 * The Linux-2xM60 VM of this CI pool is about to be deprecated 9/6/23. Need to enable this test in other dualGPU vm pool.	2023-08-25 20:30:45 -07:00
Arthur Islamov	c262879214	Added DML and CUDA provider support in onnxruntime-node (#16050 ) ### Description I've added changes to support CUDA and DML (only on Windows, on other platforms it will throw an error) ### Motivation and Context It fixes this feature request https://github.com/microsoft/onnxruntime/issues/14127 which is tracked here https://github.com/microsoft/onnxruntime/issues/14529 I was working on StableDiffusion implementation for node.js and it is very slow on CPU, so GPU support is essential. Here is a working demo with a patched and precompiled version https://github.com/dakenf/stable-diffusion-nodejs ---------	2023-08-25 16:57:06 -07:00
Yi Zhang	9cd33e07b4	Readd Tests in Window GPU Reduced Ops workflow (#17294 ) ### Description Add single test step in Window GPU Reduced Ops workflow ### Motivation and Context The old workflow's building and testing were running in one command. In PR #17263, the test step was removed by mistake. So, readd it. How to consolidate the test step is in consideration.	2023-08-25 15:56:59 +08:00
Yi Zhang	756eda2cc4	Windows CI build steps template (#17263 ) ### Description 1. New windows ci build steps template. 2. Remove useless variables. ### Motivation and Context 1. Make it easier to apply build cache to all windows CIs. 2. Other team's devs only need to take care of build options ###Comparision Before: `9f21f694cf/tools/ci_build/github/azure-pipelines/win-gpu-tensorrt-ci-pipeline.yml (L19-L82)` After: `b4c1f2261b/tools/ci_build/github/azure-pipelines/win-gpu-tensorrt-ci-pipeline.yml (L35-L54)`	2023-08-25 05:58:49 +08:00
Jian Chen	33415b9da4	Removing 10.14 suffix from osx nuget package (#17277 ) ### Description <!-- Describe your changes. --> ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. -->	2023-08-24 08:51:54 -07:00
cloudhan	87bef1f3f2	Move composable_kernel to deps.txt (#17245 )	2023-08-23 17:39:16 -07:00
Yi Zhang	61a79436e2	Common pre-build steps of Windows CI (#16970 ) ### Description Unify some pre-build common steps. ### Motivation and Context In the long run, other devs should only focus on build option and test commands. It would reduce mistakes and maintenance cost to use common template steps. There will be more PRs to achieve the goal.	2023-08-22 18:09:55 +08:00
cloudhan	4e6cec4d09	Update ck and enable test (#16383 ) Apply the fix in https://github.com/ROCmSoftwarePlatform/composable_kernel/issues/728 Introduce more kernel instances and allow the introduction of streamk and splitk.	2023-08-22 11:08:55 +08:00
Baiju Meswani	aae9a52e8b	Avoid pushing cpu package to https://download.onnxruntime.ai/ (#17238 )	2023-08-21 15:47:07 -07:00
Changming Sun	e2b6827a59	Add a CUDA 12.x pipeline and improve install_third_party_deps.ps1 (#17231 ) ### Description 1. Add a CUDA 12.x pipeline 2. Improve install_third_party_deps.ps1: avoid using Start-process. Directly call the command instead. ### Motivation and Context Since our official packages and all CI pipelines still use CUDA 11.x, we need extra pipelines to validate our source code level compatibility with CUDA 12.x. BTW for sure the prebuilt binaries in our release page are not compatible with CUDA 12.x. Do not report bugs for that. AB#15152	2023-08-21 13:04:36 -07:00
Chi Lo	9445539e2c	Update dependency for deps.txt (#17220 ) https://github.com/microsoft/onnxruntime/pull/17059 updates deps.txt and we also need to update cgmanifest.json and upload the files to Azure DevOps https://aiinfra.visualstudio.com/Lotus/_build/results?buildId=342803&view=results for testing	2023-08-19 00:43:25 -07:00
Edward Chen	d6cd41cfc1	[CoreML EP] Add Shape, Gather, and Slice ops (#17153 ) Add CoreML EP shape related ops: - Shape - Gather - Slice Add support for int64/int32 inputs in CoreML EP.	2023-08-18 22:34:34 -07:00
Yulong Wang	3426954525	disable browser stack tests (#17224 ) ### Description disable browser stack tests	2023-08-18 17:14:12 -07:00
Changming Sun	6db72165eb	Fix python packaging test pipeline (#17204 ) ### Description 1. Fix python packaging test pipeline. There was an error in tools/ci_build/github/linux/run_python_tests.sh that it installed a released version of onnxruntime python package from pypi.org to run the test. Supposedly it should pick one from the current build. 2. Refactor the pipeline to allow choosing cmake build type from the web UI when manually trigger a build. Now this feature is for Linux only. Because I don't want to change too much when we are about to cut a release branch. After that I will expand it to all platforms. This feature is useful for debugging pipeline issues, also, we may consider having a nightly pipeline to run all tests in Debug mode which may catch extra bugs because in debug mode we can enforce range check. Test run: https://aiinfra.visualstudio.com/Lotus/_build/results?buildId=342674&view=results ### Motivation and Context Currently the pipeline has a crash error. AB#18580	2023-08-18 14:51:26 -07:00
Adrian Lizarraga	6ee4be724b	Update LICENSE name in NuGet packaging pipelines (#17183 ) ### Description Updates NuGet packaging pipelines to use the correct license name. ### Motivation and Context The license name changed. See https://github.com/microsoft/onnxruntime/pull/17170 The QNN_Windows_Nuget and Zip-Nuget-* pipelines will not run without this update.	2023-08-17 22:22:19 -07:00
Changming Sun	0cccbcc47b	Move DML build job's Prefast task to a CPU machine pool (#17192 ) ### Description Move DML build job's Prefast task to a CPU machine pool which has larger memory. The current one runs out of memory in every run. ### Motivation and Context To fix the broken python packaging pipeline.	2023-08-17 13:16:29 -07:00
Jian Chen	e0022d061f	Set web-ci-pipeline.yml only triggered when related fields are updated (#17148 ) - 'js/web' - 'js/node' - 'onnxruntime/core/providers/js' is updated ### Description <!-- Describe your changes. --> ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. -->	2023-08-17 12:55:35 -07:00
Adrian Lizarraga	96b1ff610b	Add CI and PR validation triggers to QNN Windows x64 Pipeline yaml (#17178 ) ### Description Adds continuous integration and pull-requestion validation triggers directly to the yaml file for the Windows x64 QNN CI Pipeline. ### Motivation and Context There have been various unit tests failures that break the QNN_Windows_Nuget pipeline, which builds QNN EP for Windows x64. This PR ensures that QNN EP is built and tested on a Windows x64 image for every pull request.	2023-08-16 11:44:54 -07:00
Jian Chen	8998b6811d	Fix NPM Packaging Pipeline (#17182 ) ### Description <!-- Describe your changes. --> ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. -->	2023-08-15 22:56:38 -07:00
Adam Louly	c647e3e8ab	Run nightly pipeline tests from the commit id. (#17162 ) ### Description The onnxruntime-CI-nightly-ort-pipeline encounters occasional failures due to synchronization discrepancies between the ACPT nightly image and the repository. We are addressing this by executing tests using the commit ID associated with the ort build within the ACPT image. --------- Co-authored-by: Adam Louly <adamlouly@microsoft.com@orttrainingdev9.d32nl1ml4oruzj4qz3bqlggovf.px.internal.cloudapp.net>	2023-08-15 12:07:38 -07:00
Changming Sun	8e203efc69	Cleanup cmake file (#17154 ) ### Description 1. Clean up cmake files. Remove some unused code 2. Remove the "Semmle" task from tools/ci_build/github/azure-pipelines/templates/win-ci.yml. Semmle is deprecated and replaced by CodeQL.	2023-08-15 10:51:33 -07:00
Changming Sun	2a22325005	Explicitly set JDK version when building ORT java package (#17147 ) ### Description Explicitly set JDK version when building ORT java package. This is to fix an internal build error.	2023-08-15 10:36:05 -07:00
Adrian Lizarraga	b734db1924	[QNN EP] Fix CI build on Windows x64 pipelines (#17152 ) ### Description - Disables Resize tests that use nearest mode on QNN CPU. - Fixes indentation problems on yaml for win x64 qnn pipeline. ### Motivation and Context The QNN windows Nuget pipeline does not run due to failing unit tests on Windows x64. These tests should not be enabled until we determine the rounding behavior of QNN's ResizeNearestNeighbor operator.	2023-08-14 21:03:14 -07:00
Baiju Meswani	289600b47d	ONNX Runtime training cpu package name for ADO (#17109 )	2023-08-14 11:32:35 -07:00
PeixuanZuo	be2200c00b	[ROCm] fix python package pipeline (#17136 ) ROCm python package pipeline failed because this PR(https://github.com/microsoft/onnxruntime/pull/16325) changed onnx version to a commit and we need to build onnx from source. Low protobuf version will cause build errors. This PR remove `cmake ` and `protobuf ` from Dockerfile, these two will install by `install_os_deps.sh`.	2023-08-14 11:22:43 -07:00
Jian Chen	45f52987a2	Web CI Pipeline Isolation (#17005 ) ### Description <!-- Describe your changes. --> ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. -->	2023-08-14 10:37:37 -07:00
Jian Chen	68ea9631af	Fix typo onnxruntimecpubuilpython (#17120 ) ### Description The correct name should be onnxruntimecpubuildpython ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. --> Co-authored-by: Yulong Wang <7679871+fs-eire@users.noreply.github.com>	2023-08-14 08:34:43 -07:00
Changming Sun	4728f20f9a	Fix CI build (#17118 ) ### Description Some pipelines are failing. It is because PR #16325 set ONNX version to `rel-1.14.1` . It is a branch name, not a commit or tag name. It means whenever the branch got a new commit, we will auto pick it and use it.	2023-08-11 10:56:38 -07:00
Edward Chen	e7e974b23f	Use double quotes so variable gets expanded. (#17105 )	2023-08-11 09:05:41 -07:00
Hector Li	344c41fdb9	[QNN EP] Update QNN to v2.13 (#17079 ) ### Description Update QNN SDK to v2.13, update some UTs accordingly	2023-08-10 20:47:55 -07:00
Yulong Wang	9cd4e5af68	[wasm] upgrade emsdk to 3.1.44 (#17069 ) ### Description This change upgrade emsdk to 3.1.44. Because backend is upgraded to LLVM 16, so need to fix a lot of build failures caused by "-Wshorten-64-to-32". most of the build failures comes from generated `onnx.pb.h`, and this can be fixed by including "core/graph/onnx_protobuf.h", which detects and ignore shorten-64-to-32 warnings.	2023-08-10 16:08:36 -07:00
Bowen Bao	6986981482	Bump ONNX version (#16325 ) ### Description Bump ONNX version to https://github.com/onnx/onnx/tree/rel-1.14.1 to include a fix for segfault when shape inferencing nested onnx functions. ### Motivation and Context Resolves #16170	2023-08-10 11:27:28 -07:00
PeixuanZuo	12837ba5c7	[ROCm] Update CI based on ubuntu 22.04 (#17076 ) - Update ROCm version to ROCm5.6 - Update CI based on ubuntu 22.04	2023-08-10 09:51:29 -07:00
RandySheriffH	a7542f48d6	Make AzureEP default for python and c# packaging (#17025 ) Make AzureEP default for python and c# packaging, with UT. --------- Co-authored-by: Randy Shuai <rashuai@microsoft.com>	2023-08-09 12:36:52 -07:00
Yulong Wang	56bced0581	[js/web] enable webgpu in browser unit test (#16310 ) ### Description enable webgpu in browser unit test. The CI pipeline uses Edge v113+ which enables WebGPU. === UPDATE on 08/07/2023: - add flags to Edge browser launch commandline so that Edge on CI agents can initialize WebGPU correctly. - ONLY enable webgpu on web release build. Other pipelines are using flag `-b=wasm,webgl,xnnpack` to specify the other 3 backends explicitly. - disable "Resize" related test failures. Once they are fixed the tests can be re-enabled. --------- Co-authored-by: Satya Jandhyala <satya.k.jandhyala@gmail.com>	2023-08-08 11:45:04 -07:00
Edward Chen	50719d2f8e	[iOS] Add script to get simulator device info. (#17012 ) Add script to get iOS simulator device info so we don't need to use hardcoded specifiers which may or may not refer to a valid simulator device. Add use-xcode-version step to a packaging pipeline so it uses a consistent version of Xcode.	2023-08-08 09:04:06 -07:00
Baiju Meswani	249917a093	Add mac and windows python packages for onnxruntime-training (#16993 )	2023-08-07 20:32:55 -07:00
Yifan Li	d6ce43db5e	[EP Perf] MemTest: Add Valgrind and fix addressSanitizer (#16930 ) ### Description 1. Add valgrind to existing ep_perf CI MemTest and parse ORT-TRT memLeak details 1. General Valgrind logs and logs related to ORT-TRT will be parsed in [CI artifacts](https://aiinfra.visualstudio.com/Lotus/_build/results?buildId=334122&view=artifacts&pathAsName=false&type=publishedArtifacts) 1. Logic: 1. Run valgrind with `onnxruntime-perf-test -e tensorrt` and export log to `valgrind.log` 2. Identify if any `definitely lost` memleak happened 1. For log paragraphs which show `definitely lost`, parse if they have keyword `TensorrtExecutionProvider`. 2. If so, extract these details to `ort_trt_memleak_detail.log`, and return `build failure` to EP Perf CI 3. Fix existing addressSanitizer and sync the squeezenet testcase with latest update from [ort-inference-example](https://github.com/microsoft/onnxruntime-inference-examples/blob/main/c_cxx/squeezenet/main.cpp) 1. Updates in short: Upgrade main.cpp to be using OrtTensorRTProviderOptionsV2 4. Reorder the 7-min-MemTest to be ahead of 9-hr-model-tests, and enable MemTest by default	2023-08-04 16:58:57 -07:00
Yulong Wang	5af8774a0b	[build] do init and precheck first (#16961 ) ### Description This change allows Web CI to do some check as the first step, so that if there are errors it won't launch the task to build web assembly, which is heavy. Checks includes: - "npm ci" in /js, /js/common and /js/web. this implicitly include: - typescript compiler in /js - typescript compiler in /js/common - webpack build in /js/common - typescript compiler in /js/web - ESLint on typescripts - clang-format formatter (.js, .ts, .cc, .h, .mm) - Prettier formatter (.json, .jsonc, .md) --------- Co-authored-by: Caroline Zhu <carolinezhu@microsoft.com@orttrainingdev7.d32nl1ml4oruzj4qz3bqlggovf.px.internal.cloudapp.net>	2023-08-04 16:44:45 -07:00
Yi Zhang	555414f1aa	Set PR trigger rules (#16987 ) ### Description Add a script to insert the trigger rules to workflow yamls. First step, skipp windows gpu and linux gpu workflow when there's only doc change ### Motivation and Context Make skipping workflows for doc change easily. [AB#18201](https://aiinfra.visualstudio.com/6a833879-cd9b-44a4-a9de-adc2d818f13c/_workitems/edit/18201)	2023-08-04 08:21:07 -07:00
Edward Chen	06096fcb31	Hardcode xcodebuild destination iOS simulator OS to 16.4. (#16982 )	2023-08-03 14:49:54 -07:00
Dmitri Smirnov	bd4d011142	[C#] Rename unreleased API, add utilities (#16806 ) ### Description 1. rename OrtValue.FillStringTensorElement to StringTensorSetElementAt . To the API user I think we're conceptually setting the string at an offset in the tensor with is roughly equivalent to `List<string> list ... list[index] = "value"`. 2. While working on new inference examples, I noticed that I am still inclined to use `DenseTensor` for N-D indexing. Added `GetStrides()` and `GetIndex()` from strides for long dims, so the user can obtain strides and translate N-D indices into a flat index to operate directly on the native `OrtValue` buffers. Expose these functions to the user. 3. Make sure we generate docs for C# public static functions.	2023-08-02 10:06:42 -07:00
Yulong Wang	4a2a248dd7	remove unused comments in mac CI yml file (#16964 ) ### Description remove unused comments in mac CI yml file	2023-08-01 20:52:12 -07:00
Yulong Wang	afac67bcc3	[build] fix the CI pipeline (#16962 ) ### Description There are currently multiple failures that blocking the CI pipelines so this PR has all of the fixes in order to make sure it passes the CI. Otherwise a single fix will still fail the CI. includes: #16960 #16958 Please help to make sure this PR get merged once CI passed. @snnn @carzh @guschmue Fixed: [AB#18118](https://aiinfra.visualstudio.com/6a833879-cd9b-44a4-a9de-adc2d818f13c/_workitems/edit/18118) --------- Co-authored-by: Caroline Zhu <carolinezhu@microsoft.com@orttrainingdev7.d32nl1ml4oruzj4qz3bqlggovf.px.internal.cloudapp.net>	2023-08-01 16:22:45 -07:00
Yulong Wang	969c95f73f	[js/common] a few fixes/revises to onnxruntime-common (#16853 ) ### Description - enable unit test for js/common in CI - add debug config in js/.vscode/launch.json - enable source map for js/common/test for debugging purposes; add source map files to ignore list - ignore js/common/test folder for npm packaging	2023-08-01 11:17:39 -07:00
Yi Zhang	c4e4b98fb2	replace one pool with onnxruntime-Win2022-GPU-T4 (#16953 ) ### Description replace one pool ### Motivation and Context onnxruntime-gpu-tensorrt8-winbuild-t4 would be deprecated	2023-08-01 21:02:56 +08:00
Changming Sun	73ddba964f	Update the MacOS/Linux build scripts that build/install protobuf from source (#16906 ) ### Description 1. As a follow-up of #16761, this PR allows build ORT on iOS/Android without the need to explicitly specify a protoc path. #16761 is for WASM. This one is for iOS/Android 2. Update the MacOS/Linux build scripts that build/install protobuf from source. Make them be more flexible. Add the support for RedHatEnterprise(ubi), which will needed for upgrading the base image from centos:7 to ubi:8. 3. Update tools/ci_build/github/pai/rocm-ci-pipeline-env.Dockerfile : the docker file's base image has preinstalled protobuf in /usr/local, we should uninstall them to avoid conflicts.	2023-07-31 10:51:48 -07:00
Yi Zhang	28a099fca8	unify the steps of downloading cuda sdk and setup env (#16896 ) ### Description The `%AGENT_TEMPDIRECTORY%\v11.8` is created in azcopy step. So, the set env step should be after the azcopy step. ### Motivation and Context Correct the previous logic Unify the step since multiple jobs are using it.	2023-07-31 10:25:04 -07:00
Scott McKay	21a71d52bd	Enable CodeQL for Android build as per 1CS requirement. (#16875 ) ### Description <!-- Describe your changes. --> Split stages for CPU and CPU+NNAPI builds as CodeQL is enabled at the stage level. We run it for CPU+NNAPI as that covers all the Android code. We don't want to run it for both as duplicate issues would be created for a problem in code included in both builds. ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. -->	2023-07-28 17:54:23 +10:00
Yi Zhang	9f21f694cf	stop support to VS 2019 (#16892 ) ### Description Remove VS 2019 code. ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. -->	2023-07-28 13:09:35 +08:00
Changming Sun	9dcbcf1d2f	Delete unused files (#16887 ) ### Description These yaml files and docker files are not used by any pipeline. If I were wrong, feel free to submit a PR to get the wrongly deleted file back from git history (git keeps everything forever).	2023-07-27 16:46:09 -07:00
Yi Zhang	bd95a8ea77	update onnxruntime-gpu-winbuild-T4 to onnxruntime-Win2022-GPU-T4 (#16838 ) ### Description ### Motivation and Context It's also used to upgrade visual studio to VS2022. onnxruntime-gpu-winbuild-T4 and onnxruntime-gpu-tensorrt8-winbuild-t4 are using the image based on one dev branch and VS2019 To avoid breaking the current CIs, we move jobs running on onnxruntime-gpu-winbuild-T4/onnxruntime-gpu-tensorrt8-winbuild-t4 to onnxruntime-Win2022-GPU-T4.	2023-07-27 08:38:20 -07:00
Wang, Mengni	fe463d4957	Support SmoothQuant for ORT static quantization (#16288 ) ### Description Support SmoothQuant for ORT static quantization via intel neural compressor > Note: Please use neural-compressor==2.2 to try SmoothQuant function. ### Motivation and Context For large language models (LLMs) with gigantic parameters, the systematic outliers make quantification of activations difficult. As a training free post-training quantization (PTQ) solution, SmoothQuant offline migrates this difficulty from activations to weights with a mathematically equivalent transformation. Integrating SmoothQuant into ORT quantization can benefit the accuracy of INT8 LLMs. --------- Signed-off-by: Mengni Wang <mengni.wang@intel.com>	2023-07-26 18:56:45 -07:00
Justin Chu	0c1a5098dc	Disable PERF* rules in ruff to allow better readability (#16834 ) ### Description Disable two PERF* rules in ruff to allow better readability. Rational commented inline. This change also removes the unused noqa directives because of the rule change. ### Motivation and Context Readability	2023-07-25 15:38:22 -07:00
Edward Chen	e01365f80b	Update upload_pod_archive_and_update_podspec.sh to take path pattern (#16810 ) Update upload_pod_archive_and_update_podspec.sh to take a pod archive path glob pattern. The actual pod archive path has a version suffix which changes.	2023-07-25 08:55:31 -07:00
Yi Zhang	38db5eca65	replace onnxruntime-Win-CPU-2019 with onnxruntime-Win-CPU-2022 (#16844 ) ### Description <!-- Describe your changes. --> ### Motivation and Context upgrade to VS2022	2023-07-25 23:05:34 +08:00
Yi Zhang	f88f0d8e36	Upgrade 4 stages in nuget pipeline to VS2022 (#16825 ) ### Description ### Motivation and Context Continue upgrading to VS2022 ### Verfication https://dev.azure.com/aiinfra/Lotus/_build/results?buildId=331377&view=results N.B. In practice, SDLNativeRules@3 doesn't support VS2019.	2023-07-25 14:22:39 +08:00
PeixuanZuo	8ede2f139e	[ROCm] Optimize ROCm CI pipeline 2 (#16691 ) - Set `KERNEL_EXPLORER_TEST_USE_CUPY=1` to replace numpy with cupy on kernel explorer test. KERNEL_EXPLORER_TEST_USE_CUPY=0 The CPU utilization is shown as below: ![image](https://github.com/microsoft/onnxruntime/assets/94887879/91724b78-0b4e-4cbd-ad88-83cad9976472) KERNEL_EXPLORER_TEST_USE_CUPY=1 The CPU utilization is shown as below: ![image](https://github.com/microsoft/onnxruntime/assets/94887879/58239911-667c-4d5f-bb78-deca60d0266f) - Use `Bash@3`. - Update shell script.	2023-07-24 13:57:48 +08:00
Yi Zhang	3252ff2cb7	Change DML GPU pool in Windows GPU workflow use Visual Studio 2022 (#16784 ) ### Description 1. use the pool with VS2022 2. upgrade System.Memory to 4.5.5 ### Motivation and Context Solve the build error while using VS2022: `[Failure] Msbuild failed when processing the file 'D:\a\_work\1\s\csharp\src\Microsoft.ML.OnnxRuntime\Microsoft.ML.OnnxRuntime.csproj' with message: Method not found: 'System.ReadOnlySpan`1<Char> Microsoft.IO.Path.GetFileName(System.ReadOnlySpan`1<Char>)'` Ref: https://stackoverflow.com/questions/73399777/azure-build-failing-due-to-method-not-found-system-readonlyspan1char-micros	2023-07-23 10:07:21 +08:00
Justin Chu	d79515041c	[Better Engineering] Bump ruff to 0.0.278 and fix new lint errors (#16789 ) Stack from [ghstack](https://github.com/ezyang/ghstack) (oldest at bottom): * __->__ #16789 Bump ruff to 0.0.278 and fix new lint errors. I added noqa to all existing RUF012 errors which requires mutable class variables to be annotated with `ClassVar`, as well as all PERF issues. Signed-off-by: Justin Chu <justinchu@microsoft.com>	2023-07-21 12:53:41 -07:00
Baiju Meswani	538d2412ef	Objective-C Add Support to Create and Query String ORTValues (#16764 ) This pull request contains a few changes: 1. Adds support for string ort values. 2. Fixes the training minimal build (that was broken with #16601) by putting custom op registration behind #ifdefs 3. Fixes the iOS pod package generation (that was again broken with #16601) by explicitly providing paths to be copied during pod creation.	2023-07-20 17:39:29 -07:00
Adrian Lizarraga	a8c263f92c	[QNN EP] Update QNN SDK to 2.12 (#16750 ) ### Description - Updates the default QNN SDK to 2.12 for CI pipelines - Adds a disabled InstanceNormalization test for regression on QNN SDK 2.12 - Cleans up logs for unsupported ops. ### Motivation and Context Test with the latest QNN SDK.	2023-07-20 16:22:14 -07:00
Yi Zhang	c314d7724f	Update dml gpu pool to onnxruntime-Win2022-GPU-dml-A10 (#16765 ) ### Description onnxruntime-Win2022-GPU-dml-A10 is using VS2022. ### Motivation and Context 1. Upgrade VS2019 to VS2022 to fix prefast issue.	2023-07-20 16:52:13 +08:00
Edward Chen	fc1f463ff1	[ios] Enable training package in packaging pipeline (#16683 ) Build iOS training package in packaging pipeline. Refactor iOS packaging pipeline to build different package variants in parallel.	2023-07-19 19:55:00 -07:00
saurabh	24566058b3	ovep dockerfile and wheel docs changes (#16482 ) ### Description This PR is includes changes in the documentation of _readmeOV.rst_ file and also the changes in the dockerfile which enables to build ORT with latest OpenVINO 2023.0.0 ### Motivation and Context Modified the dockerfile to incorporate the latest version of OpenVINO (2023.0.0) for building Onnxruntime. The changes in the PR aim to improve the overall user experience by providing accurate and up-to-date documentation while leveraging latest OpenVINO 2023.0.0	2023-07-19 09:01:09 -07:00
Edward Chen	df8843c4a7	Upgrade old Python version in packaging pipeline (#16667 ) - Upgrade from Python 3.6 to 3.8 in packaging pipeline. - Raise build.py minimum required Python version.	2023-07-17 08:24:47 -07:00
Adrian Lizarraga	19169afe30	[QNN EP] Add option to skip unit tests in the QNN NuGet packaging pipeline (#16164 ) Add option to skip unit tests in the QNN NuGet packaging pipeline.	2023-07-14 10:52:05 -07:00
Yi Zhang	36b121d8c2	add more check to Web CI on cache restore (#16689 ) ### Description <!-- Describe your changes. --> ### Motivation and Context Make sure the data is correct.	2023-07-14 10:00:13 +08:00
Scott McKay	a3fc04ba74	Fix CodeCoverage pipeline (#16684 ) ### Description <!-- Describe your changes. --> Delete second reference to onnxruntime_api_tests_without_env in the code coverage commands. One was removed in #16373 and the duplicate wasn't noticed. ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. --> Fix pipeline.	2023-07-14 07:47:04 +10:00
PeixuanZuo	ebc311365b	[ROCm] Optimize ROCm CI to reduce time (#16620 ) This PR mainly optimize ROCm CI test to reduce time and CPU utilization. - use smaller batch size on strided_batched_gemm/batched_gemm test - disable cpu training test - fix test_e2e_padding_elimination Occasional failures on ROCm.	2023-07-13 10:58:03 +08:00
Yi Zhang	f3b40abe29	Use pipeline cache to cache onnx node test data. (#16659 ) ### Description Use pipeline cache instead of reading data from the image. ### Motivation and Context 1. To reduce the browser dependency of custom image. 2. The onnx node test data is less than 30M and the cache download time is very short.	2023-07-13 09:26:27 +08:00
PeixuanZuo	596dbe277e	[ROCm] add upgrade to fix security issue (#16668 )	2023-07-12 17:57:18 +08:00
Scott McKay	ce68a4c06a	Fix Linux build failure when onnxruntime_DISABLE_ABSEIL=ON (#16373 ) ### Description <!-- Describe your changes. --> Add ort_value.h to session_options.h so OrtValue is defined. Update a unit test binary to add required include paths. Adding ort_value.h pulls in more data type headers. ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. --> #16193	2023-07-12 11:23:18 +10:00
Yulong Wang	5b6c1394cb	[js/test] CI: use pre-downloaded testdata in image (#16562 ) ### Description update web CI to use pre-downloaded testdata in image	2023-07-10 22:22:14 -07:00
PeixuanZuo	2fd5e1cc39	[ROCm] fix shell bug (#16641 ) `set -ex` with `grep` will exit when grep doesn't meet any string.	2023-07-10 17:31:27 +08:00
PeixuanZuo	cb4bf4f5c8	[ROCm] Move ROCm build step on CPU only machine (#16596 ) - Move ROCm build step on CPU only machine - Add the performance data of the huggingface bert-large model on the MI200 - At the beginning of the test step, check the agent's GPU usage and kill the threads occupying the GPU, which may be left over from previous tasks that exited abnormally. - Use different docker images during the build and test steps. The difference is the `uid` and `user` when build docker image and create docker container.	2023-07-10 11:55:10 +08:00
Edward Chen	e22b0836e7	[objc] Update docs and fix static analysis build (#16617 ) - Update some documentation comments. - Use onnxruntime_training.h as the umbrella header so training API docs are included in generated docs. - Fix static analysis build.	2023-07-07 07:58:54 -07:00
Yi Zhang	fed08e070a	Add compiler cache in linux wasm build (#16579 ) ### Description Add compiler cache in wasm build to accelerate web ci ### Motivation and Context It could reduce the pipeline duration by 30 minutes. web ci could be completed in 2 hours with cache. https://dev.azure.com/onnxruntime/onnxruntime/_build/results?buildId=1053219&view=results	2023-07-06 06:58:48 +08:00
Vrajang Parikh	fd8ad9b950	Enable iOS packaging for training (#16525 ) ### Description Enable support for building iOS packages/CocoaPods with training API - Add `Training` Package variant and config files in current iOS packaging utilities to enable creation of training packages ### Motivation and Context This PR introduces new `Training` variant in `build_and_assemble_ios_pods.py` script which allows creating pods for iOS with training API enabled. The sample script to build training pods: ``` python3 tools/ci_build/github/apple/build_and_assemble_ios_pods.py --variant Training \ --build-settings-file tools/ci_build/github/apple/default_full_ios_training_framework_build_settings.json \ -b=-- path_to_protoc_exe=<path/to/protoc> ``` Note: build settings file should have `--enable_training` as a build parameter. Simply adding training packaging increases the duration of the Azure pipeline for packaging by 70 minutes. To address this issue, we need to parallelize pod creation. In order not to further strain the pipeline, the changes for training packaging will be added in another PR, which optimizes the packaging pipeline. --------- Co-authored-by: Edward Chen <18449977+edgchen1@users.noreply.github.com>	2023-07-05 13:27:59 -07:00
PeixuanZuo	e2526714e2	[ROCm] Move MIGraphX build step on CPU only machine (#16582 ) - Move MIGraphX build step on CPU only machine - Use ccache on build step - Not pass host uid into docker build process.	2023-07-05 13:55:28 +08:00
Wei-Sheng Chin	a0a5f57581	[DORT] Use new FX-to-ONNX exporter (#16450 ) The ONNX exporter in DORT have been moved to PyTorch as a formal feature. We therefore switch to consume the exporter from PyTorch instead of maintaining two duplicates.	2023-07-04 13:13:04 -07:00
PeixuanZuo	d540c7da0f	[ROCm] Add ROCm5.6 to python package pipeline (#16572 ) Add ROCm5.6 to python package pipeline.	2023-07-04 18:18:12 +08:00
pengwa	ac100ebb64	Fix orttraining-ortmodule-distributed CI (#16569 ) ### Fix orttraining-ortmodule-distributed CI https://pypi.org/project/pydantic/#history released version 2.0 1st July, Deepspeed has known issue on newer version of it (https://github.com/microsoft/DeepSpeed/issues/3280). So fix this by add similar check as DS did in https://github.com/microsoft/DeepSpeed/pull/3290	2023-07-03 13:18:59 +08:00
Scott McKay	2fd25de360	Use verbose logging in Android emulator in React Native CI (#16528 ) ### Description <!-- Describe your changes. --> Set emulator logging to verbose to see if it helps with intermittent React Native CI failures when emulator crashes at startup ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. -->	2023-06-30 11:51:20 +10:00
Yi Zhang	fb7e1f133f	[Fix] TSA Upload failed in nuget pipeline. (#16476 ) ### Description partially revert PR #16244. ### Motivation and Context npm pipeline couldn't triggered if nuget pipeline status is warning. ### Test Run https://dev.azure.com/aiinfra/Lotus/_build/results?buildId=321873&view=logs&s=b17bed5b-cc14-5026-390a-fb2feea063f2	2023-06-28 06:40:49 +08:00
Rachel Guo	892b1b19ea	[js/rn] limit x86_64 arch in detox xcodebuild for react native e2e test (#16460 ) ### Description <!-- Describe your changes. --> As title. ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. --> Works with local onnxruntime-c pod in js/rn/e2e test.	2023-06-27 09:45:04 -07:00
guyang3532	4768ac5f30	Fix onnxruntime-CI-nightly-ort-pipeline Failure (#16495 ) The image for the onnxruntime-CI-nightly-ort-pipeline is too old. The ort package in the image is older than latest test code in nightly ci. This causes the nightly ci failed.	2023-06-27 23:19:23 +08:00
Yi Zhang	6e9541046e	extend react native ci timeout limit (#16469 ) ### Description <!-- Describe your changes. --> ### Motivation and Context 2 consecutive runs in npm pipeline failed due to time out	2023-06-27 08:44:03 +08:00
Yifan Li	e2c214d81f	[TensorRT EP] TRT 8.6 minor version update (#16475 ) ### Description * Minor version update: TRT 8.6.0.12->8.6.1.6 * CI pipeline ymls/dockerfiles are updated * cgmanifest.json/deps.txt/download-deps.yml are updated; Win trt binaries uploaded to [win img 307029](https://aiinfra.visualstudio.com/AI%20Infra%20Management/_build/results?buildId=307029&view=results) * Re-enable unit tests which were failed in 8.6.0 and re-gained support in 8.6.1	2023-06-26 10:44:27 -07:00
PeixuanZuo	7e211f0e03	[ROCm] Move mount data step into docker container (#16471 ) Some CI jobs may interrupted unexpectedly and didn't execute umount data step. The data left in host device will cause `device or resource busy` and make subsequent CI jobs fail. Move the mount data step into docker container, the host machine will not be occupied when CI jobs exit incorrectly.	2023-06-26 10:25:06 +08:00
Rachel Guo	04dbdc96bf	[js/rn] Fix React Native CI pipeline E2E test (#16447 ) ### Description <!-- Describe your changes. --> Based on this kindly provided quick fix: https://github.com/microsoft/onnxruntime/pull/16411 See more description in the above linked pr about bumping AGP version, etc. Also fixed import header file path in detox e2e test. ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. --> Good build: https://dev.azure.com/onnxruntime/onnxruntime/_build/results?buildId=1041757&view=logs&j=de302ec2-2305-57e0-e8c6-cd89c569f2a3&t=9894c870-b8ce-548d-51ff-8f44d21a4117&l=18	2023-06-22 14:33:49 -07:00
Yi Zhang	8e8840f1de	Enable Web CI on Linux (#16419 ) ### Description 1. Enable Web ci on Linux ### Motivation and Context 1. speed up web ci, the duration can be reduced from 160 minutes to 130 minutes, a time saving of 20% could be be achieved. The total computation time is 455 minutes now. Moved to Linux, it could be reduced to 336 minutes. 2. It's the first step to enable compilation cache for emscripten 3. per Yulong's request, build_web stages are still using windows pool ![image](https://github.com/microsoft/onnxruntime/assets/16190118/c9496408-74bd-45ea-b4ae-a4dd2a574d17) https://dev.azure.com/onnxruntime/onnxruntime/_build/results?buildId=1038382&view=results	2023-06-22 15:42:58 +08:00
yf711	0ad0d6ebbf	Unblock Linux MultiGPU TensorRT CI (#16446 ) ### Description Revert docker base image to nvidia/cuda:11.8.0-cudnn8-devel-ubuntu20.04@sha256:b754c43fe9d62e88862d168c4ab9282618a376dbc54871467870366cacfa456e ### Motivation and Context The default img env of nvidia/cuda:11.8.0-cudnn8-devel-ubuntu20.04 has minor upgrade, which make Linux MultiGPU TensorRT CI (NV12 instance with Maxwell GPU) fail on three CApiTestGlobalThreadPoolsWithProvider tests (these three tests have higher error which are above the tolerance) That minor upgrade includes cudnn 8.7.0->8.9.0, which might be a factor that make maxwell GPU generator higher error. CIs with T4 GPU are not affected.	2023-06-21 17:15:39 -07:00
Rachel Guo	961fa7274a	[NNAPI doc] add reducemean to supported op list (#16414 ) ### Description <!-- Describe your changes. --> As title. ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. -->	2023-06-21 00:29:20 -07:00
Rachel Guo	b4b126ffb0	Set onnxruntime-c local pod path environment variable for react native e2e tests on ci (#16431 ) ### Description <!-- Describe your changes. --> Set onnxruntime-c local pod path environment variable for react native e2e tests on react-native-ci.yml ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. --> Previously the E2E test project is not properly consuming a local built onnxruntime-c version pod. https://github.com/microsoft/onnxruntime/pull/16411#issuecomment-1598512816	2023-06-21 00:27:36 -07:00
PeixuanZuo	470d6c1cce	[ROCm] Delete unused file to fix Component Governance Alert (#16407 ) Delete unused file to fix Component Governance Alert	2023-06-19 11:28:32 -07:00
PeixuanZuo	1418d8728c	[ROCm] Fix CI Pipeline (#16409 ) 1. add `set -ex` before commands. 2. update ccache.	2023-06-19 15:22:13 +08:00
Yi Zhang	8b9eab093b	keep symlinks in maven package (#16376 ) ### Description 1. Keep symlink in the package. 2. keep the artifact package format ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. -->	2023-06-19 09:41:39 +08:00
Changming Sun	188d5f5398	Fix Linux Multi GPU build pipeline (#16368 ) ### Description The build pipeline runs on Azure NV12 machines that will be deprecated soon because the SKU is too old. So this PR will move the pipeline to a Windows machine with two A10 GPUs.	2023-06-15 16:24:46 -07:00
Yi Zhang	3e99e43a1d	extend Final AAR testing timeout limit (#16340 ) ### Description <!-- Describe your changes. --> ### Motivation and Context improve nuget pipeline stability	2023-06-15 17:27:45 +08:00
PeixuanZuo	097346be9d	[ROCm] Add clean step for ROCm CI pipeline (#16336 ) 1. Add clean step for ROCm CI pipeline 2. Fix error "device or resource busy" bug by setting umount dataset step as `always()` step.	2023-06-15 13:44:12 +08:00
Baiju Meswani	5eec24837f	Fix for AMD GPU pipeline (#16357 )	2023-06-14 20:36:16 -07:00
Changming Sun	dbc7a195b1	Update win-ci-pipeline.yml: enable xnnpack tests (#16244 ) 1. Enable xnnpack test 2. Change TSA database name from onnxruntime_master to onnxruntime_main. This is a leftover of renaming the "master" branch to "main" 3. Add two static analysis jobs for WinML and DML 4. Rename the machine pool "aiinfra-dml-winbuild" to "onnxruntime-Win2019-GPU-dml-A10", so that the internal and public ADO instances use the same machine pool name. 5. Move Windows GPU CI build pipeline from "onnxruntime-Win2022-GPU-T4" to "onnxruntime-Win2022-GPU-A10" machine pool, because we do not have enough T4 GPUs.	2023-06-14 19:12:42 -07:00
Baiju Meswani	8a3de16d14	Temporary fix to make the training pipeline green (#16353 )	2023-06-14 13:11:35 -07:00
Edward Chen	4f23577cb5	[React Native] Publish E2E test logs on build failure too. (#16327 ) ### Description <!-- Describe your changes. --> Publish E2E test logs on build failure too. ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. --> Get more information about intermittent test failures.	2023-06-12 17:56:46 -07:00
JiCheng	eed02a3f78	Xnnpack QDQ test (#16281 ) ### Description A few QDQ tests failed on XNNPACK EP. The reason should be the range of input_data doesn't fit for scale and zero_point. ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. -->	2023-06-12 14:00:42 +08:00
Yulong Wang	f274bbb0c8	[js] add API that allows to get package version (#16207 ) ### Description Add an API for users to get version of current package. example usage: ```js import { env } from 'onnxruntime-node'; console.log(env.versions.node); // output "1.16.0" ``` ```js import { env } from 'onnxruntime-web'; console.log(env.versions.web); // output "1.16.0" console.log(env.versions.common); // output "1.16.0" console.log(env.versions.node); // output "undefined" ``` #16156	2023-06-09 16:18:53 -07:00
Yi Zhang	3b5a8352c1	CodeSign Mac packages in nuget pipeline (#16291 ) ### Description 1. Updated Mac package workflow for easily debugging. 2. Changed Archive type from tgz to zip since zip is supported by ESRP. 3. .../dylib.dSYM/Contents/Resources/DWARF/libonnxruntime.1.16.0.dylib is a debug symbol file, so it couldn't be signed. ### Motivation and Context It‘s required from VS code. Mac binaries in nuget should be signed	2023-06-10 06:35:47 +08:00
Edward Chen	b668a6da96	Treat Objective-C static analysis warnings as errors (#16293 ) - Update Objective-C static analysis check to fail on warnings. - Address warning. - Clean up build definition.	2023-06-09 08:51:49 -07:00
Vrajang Parikh	67f4a4fd16	Objective-C binding for ORT training (#16127 ) ### Description Implement Objective-C binding for `ORTCheckPoint`. Additionally, - Modify `onnxruntime_objectivec.cmake` to only include training header and sources when training flag is enabled - Enable objective-c binding for `orttraining-mac-ci-pipeline` ### Motivation and Context This PR is part of implementing Objective-C bindings for training API. It implements objective-c binding for ORTCheckPoint class. The objective-C API closely resembles the C++ API. Note: The test for saving checkpoint is skipped as it requires use of training session. It will be added when the objective-c binding for `ORTTrainingSession` is added.	2023-06-07 14:01:30 -07:00
Edward Chen	1261d0b8ba	Fix some build issues on MacOS with Xcode 14.3. (#15878 ) - Fix flatbuffers flatc warning, unused-but-set-variable. - Address `-Wshorten-64-to-32` warnings (fix in our code, allow in dependencies' code). - Update CI builds to use Xcode 14.3. - Update minimum iOS version to 12.0. - Update Mac hosted agents to MacOS 13 where possible.	2023-06-07 12:07:11 -07:00
PeixuanZuo	a95f8ae53c	[ROCm] Update ROCm/MIGraphX CI pipeline (#16215 ) MIGraphX CI - Change docker container user name to `onnxruntimedev` ROCm CI - Build docker image every job instead of using prebuild image. - Every job create a container with only one GPU with command `docker run -it --device=/dev/kfd --device=/dev/dri/renderDxxx` - Remove tests that are unstable or use outdated interfaces. - Enable training ortmodule test.	2023-06-05 10:28:10 +08:00
Changming Sun	6b5b79872b	Avoid taking dependency on dl.fedoraproject.org (#16202 ) ### Description 1. Avoid taking dependency on dl.fedoraproject.org The website is not very stable. Our build pipelines often fail to fetch packages from there. 2. Update manylinux to the latest version	2023-06-02 07:41:46 -07:00
Changming Sun	5bfa1183d1	Add a Memory Profiling build job in post merge pipeline (#16172 ) ### Description 1. Add a Memory Profiling build job 2. Remove no absl build job since the feature will be removed 3. Simplify post-merge-jobs.yml by unifying the pool names ### Motivation and Context To catch build errors in #16124	2023-06-01 13:00:44 -07:00
Yi Zhang	e0199cfbd9	extend mac packaging timeout limit (#16173 ) ### Description ### Motivation and Context MacOS_py_wheels are often failed due to timeout	2023-05-31 18:31:28 +08:00
Baiju Meswani	7edc4b105d	Copy missing training header files to the package archive (#16119 )	2023-05-30 16:45:40 -07:00
Sunghoon	bf05d4ec26	Fix nightly ort CI pipeline (#16162 ) This PR changes [night ort CI pipeline](https://dev.azure.com/onnxruntime/onnxruntime/_build?definitionId=198) to pick up the latest night ACPT image, which was changed from torch 2.0.0.dev to torch 2.1.0.dev.	2023-05-30 14:00:34 -07:00
Xavier Dupré	e726151b5c	Introduce float 8 types (#14731 ) ### Description The PR implements FloatE4M3FN, FloatE5M2, FloatE4MEFNUZ, FloatE5M2FNUZ as described in PR https://github.com/onnx/onnx/pull/4805. It uses CUDA API to cast float/half to float8 if CUDA>=11.8, a custom implementation if CUDA<11.8. * It implements, Cast, QuantizeLinear, DequantizeLinear for all types on CPU, only for types FloatE4M3FN, FloatE5M2 on CUDA. * It extends the supported types for control flow operator, Shape, Reshape, Identity, If, Loop, Scan, Reshape * It implements Equal(19). * Cast, QuantizeLinear, DequantizeLinear operators now support a parameter `saturate` only valid for float 8 types. It is true by default. In that case, any value out of range is converted into the maximum float 8 value. If false, it is infinite. * QuantizeLinear, DequantizeLinear now supports multiple scales on CUDA (and ROCm by extension), scale = 1D tensor with one scale per channel ### Motivation and Context Supports latest onnx version. Fixes [AB#15395](https://aiinfra.visualstudio.com/6a833879-cd9b-44a4-a9de-adc2d818f13c/_workitems/edit/15395) --------- Co-authored-by: Xavier Dupre <xadupre@microsoft.com@orttrainingdev8.d32nl1ml4oruzj4qz3bqlggovf.px.internal.cloudapp.net> Co-authored-by: Randy Shuai <rashuai@microsoft.com> Co-authored-by: Edward Chen <18449977+edgchen1@users.noreply.github.com> Co-authored-by: Scott McKay <Scott.McKay@microsoft.com>	2023-05-30 13:25:58 -07:00
Yi Zhang	31fc25d2c2	[Fix] Check if CUDA is downloaded in AGENT_TEMPDIRECTORY (#16142 ) ### Description supplement of #15915 ### Motivation and Context fix nuget pipeline exception in the stage of Final_Jar_Testing_Windows_GPU ``` JUnit Jupiter:ProviderOptionsTest:testCUDAOptions() MethodSource [className = 'ai.onnxruntime.providers.ProviderOptionsTest', methodName = 'testCUDAOptions', methodParameterTypes = ''] => ai.onnxruntime.OrtException: Error code - ORT_RUNTIME_EXCEPTION - message: D:\a\_work\1\s\onnxruntime\core\session\provider_bridge_ort.cc:1131 onnxruntime::ProviderLibrary::Get [ONNXRuntimeError] : 1 : FAIL : LoadLibrary failed with error 126 "" when trying to load "C:\Users\cloudtest\AppData\Local\Temp\onnxruntime-java17193857285260738736\onnxruntime_providers_cuda.dll" ``` ### Verification https://dev.azure.com/aiinfra/Lotus/_build/results?buildId=313476&view=results	2023-05-30 13:14:08 +08:00
Yi Zhang	73584f9360	More fixes on nuget pipeline (#16091 ) ### Description 1. parameters couldn't using string to comprare, change it to boolean. 2. Windows_CI_GPU_DML_DEV_arm64 on the pool onnxruntime-Win-CPU-2022 failed to pass prefast step, change the pool to aiinfra-dml-winbuild. 3. skipped test_zfnet512, it's failed in Nuget_Test_Win_Training_CPU Todo Only Final_Jar_Testing_Windows_GPU failed now. https://dev.azure.com/aiinfra/Lotus/_build/results?buildId=313042&view=logs&s=d66543d5-16de-5a48-6ecb-a36e21ff8d4d&j=d9489789-5e39-5a05-13ab-9aaf7b4d386f	2023-05-27 08:59:12 +08:00
Changming Sun	60bb07307b	Fix the TRT GPU build job in python packaging pipeline (#16073 ) 1. Cherry-pick #16054 back to the main branch 2. Replace onnxruntime-gpu-winbuild-t4 with onnxruntime-Win2022-GPU-T4. The later one has VS2022. --------- Co-authored-by: Patrice Vignola <vignola.patrice@gmail.com>	2023-05-25 00:09:08 -07:00
Yi Zhang	76fd9aa745	[Fix] Some pipelines have to be using VS2019 (#16034 ) ### Description ### Motivation and Context Fix nuget and python package pipeline. 1. ARM 32 build isn't supported by VS2022 officially. https://developercommunity.visualstudio.com/t/Compilation-Error-with-VS2022-ARM/10285309 2. onnxruntime-gpu-winbuild-T4 and onnxruntime-gpu-winbuild-tensorrt8-T4 haven't VS 2022	2023-05-25 09:55:35 +08:00
yf711	84f1af7ff5	ort build flag fix (#16072 ) ### Description * Sync and clean build flag `--use_tensorrt_builtin_parser` from existing CI config as this becomes default flag * cuda version update	2023-05-24 12:32:10 -07:00
Guenther Schmuelling	20857c4ff2	workaround test failure in ci (#16070 ) don't run wasm proxy test on debug build to unblock ci. Needs some longer debugging.	2023-05-24 21:01:06 +08:00
Shukant Pal	f316bc57c4	[CoreML EP] Implement Unary & Reduce operators (#15532 ) ### Description This change is a follow-up to #15327. It adds Unary operators (Sqrt, Reciprocal) and Reduce operators (ReduceSum, ReduceMean). I've tried to follow existing patterns in the code :-) ### Motivation and Context This reduces fragmentation across EPs when using CoreML on macOS, thereby speeding up execution. --------- Co-authored-by: Edward Chen <18449977+edgchen1@users.noreply.github.com>	2023-05-24 18:16:59 +10:00
RandySheriffH	d35361bf9d	Fix python pipeline for AzureEP without using root (#16023 ) Fix python pipeline for AzureEP without using root, this is for 1.15. --------- Co-authored-by: Randy Shuai <rashuai@microsoft.com>	2023-05-22 16:38:47 -07:00
Hector Li	4324d2173b	[QNN EP] Enable Qnn context cache to save model initialization time (#15815 ) ### Description Enable Qnn Context cache feature to save model initialization time Provider options: qnn_context_cache_enable\|1 to enable the cache feature qnn_context_cache_path to set the cache path. It is set to model_file.onnx.bin by default. ### Motivation and Context Model initialization time takes long because the cost of conversion from Onnx model to Qnn model. Qnn have feature to serialize the Qnn context to file, then next time user can load it from the cache context and execute the graph to save the cost. --------- Co-authored-by: Adrian Lizarraga <adlizarraga@microsoft.com>	2023-05-19 10:52:17 -07:00
RandySheriffH	4dfb89b3ad	Implement mutex-free spin lock for task queue (#14834 ) Implemented "lock-free" spinlock to save CPU usage on context switching. The change has been tested on queene service of Ads team, the lock-free version of ort (40 threads) saves CPU usage on gen8 (128 logical processors on 8 numa nodes) windows by nearly half, from 65% to 35%. For 32 cores, the curve is flat: Anubis, 32 vCPU, windows, hugging face models, 95 percentile E2E latency in ms: model \| mutex(ms) \| mutex-free --- \| --- \| --- alvert_base_v2 \| 34.21 \| 34.09 bert_large_uncased \| 116.27\| 117.84 bart_base \| 72.06 \| 71.99 distilgpt2 \| 25.43 \| 25.02 vit_base_patch16_224 \| 37.33 \| 37.76 Anubis, 32 vCPU win, Linux, 1st party models, 95 percentile E2E latency in ms: model \| mutex(ms) \| mutex-free --- \| --- \| --- deepthink_v2 \| 24.35 \| 22.95 bing_feeds \| 36.96 \| 36.48 deep_writes \| 14.46 \| 14.32 keypoints \| 9.34 \| 7.69 model11 \| 1.71 \| 1.66 model12 \| 1.82 \| 1.44 model2 \| 4.21 \| 3.95 model6 \| 1.08 \| 1.05 agiencoder \| 0.99 \| 0.93 geminet_transformer \| 5.32 \| 5.24 --------- Co-authored-by: Randy Shuai <rashuai@microsoft.com>	2023-05-19 10:12:10 -07:00
PeixuanZuo	d78bbf5ef2	[ROCm] remove ROCm5.2.3, ROCm5.3, ROCm5.4 from pipeline (#16004 ) remove ROCm5.2.3, ROCm5.3, ROCm5.4 from pipeline.	2023-05-19 10:29:01 +08:00
Edward Chen	6d46007028	Add explicit 'set +x' before printing a vso[] command to avoid output getting parsed again with a trailing quote. (#15986 ) Here's the motivating issue: https://github.com/microsoft/azure-pipelines-tasks/issues/10331 Noticed some problems in other repos so also updating usages in ORT. We may be fine now without it, but this change adds some safeguard against future additions of 'set -x' for debugging.	2023-05-17 19:30:28 -07:00
Changming Sun	d98763473a	Change CUDA pipelines to download CUDA SDK in every build job (#15915 ) ### Description Change CUDA pipelines to download CUDA SDK in every build job ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. -->	2023-05-17 17:31:51 -07:00
Yi Zhang	6d43d51eb0	[Fix] No test result report while not using ctest (#15976 ) ### Description 1. Set gtest output while ctest is set to empty. 2. onnx_src in _deps shouldn't be removed because onnx_test_pytorch_converted and onnx_test_pytorch_converted need to read data from onnx/backend/test/data/.. ### Motivation and Context Test result report is important to find the flaky tests. ### To do Tests are not inconsistent. If ctest_path is empty, onnx_test_pytorch_converted and onnx_test_pytorch_converted will not be executed, if it's not, onnxruntime_mlas_test will not be executed. `270c09a37f/tools/ci_build/build.py (L1743-L1753)`	2023-05-17 08:31:16 -07:00
Jian Chen	2881d849d4	Update Win-CPU-2021 to onnxruntime-Win-CPU-2022 (#15967 ) ### Description After this PR there are following pool need to be updated. old\|new\|note ---\|---\|--- onnxruntime-Win2019-GPU-dml-A10\|tbd\| onnxruntime-Win2019-GPU-T4\|onnxruntime-Win2022-GPU-T4\| onnxruntime-Win2019-GPU-training-T4\|onnxruntime-Win2022-GPU-T4\|ame as the above because we do not have many T4 GPUs onnxruntime-tensorrt8-winbuild-T4\|tbd\| aiinfra-dml-winbuild\|tbd\| ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. -->	2023-05-17 08:29:27 -07:00
Jian Chen	780442b9f6	Change windows machine pools to use VS2022  (#15806 ) ### Description <!-- Describe your changes. --> Old pool \| New pool \| Notes -- \| -- \| -- onnxruntime-Win-CPU-2019 \| onnxruntime-Win-CPU-2022 \| onnxruntime-Win2019-CPU-training \| onnxruntime-Win2022-CPU-training-AMD \| onnxruntime-Win2019-CPU-training-AMD \| onnxruntime-Win2022-CPU-training-AMD \| Same as the above onnxruntime-Win2019-GPU-dml-A10 \| Need be created \| You need to create a new image for it first onnxruntime-Win2019-GPU-T4 \| onnxruntime-Win2022-GPU-T4 \| onnxruntime-Win2019-GPU-training-T4 \| onnxruntime-Win2022-GPU-T4 \| Same as the above because we do not have many T4 GPUs onnxruntime-tensorrt8-winbuild-T4\| TBD\|TBD Win-CPU-2021\|onnxruntime-Win-CPU-2022\| will do it in next PR Win-CPU-2019\|onnxruntime-Win2022-Intel-CPU'\| Intel CPU needed for win-ci-pipeline.yml -> `stage: x64_release_dnnl` <br class="Apple-interchange-newline"> ### Motivation and Context With vs2022 we can take the advantage of 64bit compiler. It also with better c++20 support	2023-05-16 10:34:34 -07:00
RandySheriffH	7faad53632	Set default option for package name and build arg options (#15958 ) Set default value for parameters in nuget-zip pipeline, and only apply the configurations when they are not "NONE". --------- Co-authored-by: Randy Shuai <rashuai@microsoft.com>	2023-05-16 09:07:38 -07:00
yf711	825d691617	Unify cuda & trt version on few CIs (#15943 ) ### Description The cuda & trt version of some CIs didn't sync with the majority. Unifying cuda version as 11.8 and trt version as 8.6 on these CIs	2023-05-15 09:54:30 -07:00
Rachel Guo	18133ddadb	[doc] add LeakyRelu to coreml supported ops (#15944 ) ### Description <!-- Describe your changes. --> ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. -->	2023-05-15 09:46:30 -07:00
Adrian Lizarraga	5542e70dd1	[QNN EP] Update default QNN SDK version to 2.10 for QNN NuGet pipeline (#15899 ) ### Description Updates the default QNN SDK version to 2.10 for the QNN NuGet pipeline. ### Motivation and Context Ensures that the daily QNN NuGet pipeline builds ORT using the latest QNN SDK by default.	2023-05-15 09:17:42 -07:00
PeixuanZuo	af6cb2af87	[ROCm] update ROCm/MIGraphX CI to ROCm5.5 (#15905 ) update ROCm/MIGraphX CI to ROC5.5. TODO: two PR to fix failure on orttraining/orttraining/test/python/orttraining_test_ortmodule_api.py - test_gradient_correctness_minmax/test_gradient_correctness_argmax_unfold/test_gradient_correctness_argmax_diagonal (https://github.com/microsoft/onnxruntime/pull/15903) - test_ortmodule_attribute_name_collision_warning (https://github.com/microsoft/onnxruntime/pull/15884)	2023-05-15 10:28:15 +08:00
Yi Zhang	b20d5e85d5	Update Cuda to 11.8 in 2 Linux GPU workflows. (#15925 ) ### Description use template variable for cuda version ### Motivation and Context	2023-05-14 12:51:25 +08:00
RandySheriffH	7c4e8267e7	Implement openAI endpoint invoker for nuget (#15797 ) Implement openAI audio endpoint, and enable nuget packaging. --------- Co-authored-by: Randy Shuai <rashuai@microsoft.com>	2023-05-11 22:04:02 -07:00
Yi Zhang	0e7ae13e74	Run Linux GPU tests in docker container (#15872 ) ### Description Run Linux GPU tests in docker container ### Motivation and Context	2023-05-12 06:29:22 +08:00
Jian Chen	1a73d61829	Update eigen to 3.4 and remove the eigen from git submodule (#15875 ) ### Description Update eigen to 3.4 and remove the eigen from git submodule ### Motivation and Context We need to have eigen 3.4 for c++20	2023-05-11 11:56:59 -07:00
Changming Sun	7c58d013aa	Remove Ubuntu 18.04 usages (#15781 ) ### Description Remove Ubuntu 18.04 usages because it will be EOL this month. ### Motivation and Context	2023-05-11 11:44:00 -07:00
Yulong Wang	756cf3a76f	increase web CI timeout (#15876 ) ### Description The CI is extremely slow on downloading source code (~1MB/sec) so the web CI went timeout. This is blocking the PR/checks. Increase the timeout temporarily.	2023-05-11 11:17:46 -07:00
liqun Fu	ac9ae9f7c5	update onnx release 1.14 for docker files (#15680 ) ### Description this is for ort 1.15 release to work with onnx 1.14 It shall be merged after onnx 1.14 release and before ort 1.15 release. ### Motivation and Context --------- Signed-off-by: Liqun Fu <liqfu@microsoft.com>	2023-05-10 13:15:56 -07:00
Nat Kershaw (MSFT)	36c9ae0f58	Fix release version suffix for RC builds (#15865 )	2023-05-09 23:06:08 -07:00
Jian Chen	34cb293c6b	Remove unused ADO YML pipeline template (#15857 ) ### Description Remove unused ADO YML pipeline template ### Motivation and Context Clean up and reduce our codebase.	2023-05-09 09:15:04 -07:00
Yulong Wang	0457fd0b40	upgrade emsdk to 3.1.37 (#15817 ) ### Description upgrade emsdk to 3.1.37 WIP branch to debug the mystery memory issue in web assembly multi-thread build.	2023-05-08 16:49:47 -07:00
Yi Zhang	045c623415	Make Nuget workflow easy to debug (#15808 ) ### Description Fix the bug in #15693 ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. -->	2023-05-08 20:53:08 +08:00

1 2 3 4 5 ...

1688 commits