onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-06-13 01:09:22 +00:00

Author	SHA1	Message	Date
Yulong Wang	45ff957973	1.17.3 cherry-picks for ORT Web changes (#19926 ) ### Description This PR is a preview of cherry-picks for ort-web to `rel-1.17.3` based on `rel-1.17.2`. <details> <summary>Changes of ort-web to cherry-pick</summary> The following commits are from main branch. `o` stands for pick, and `x` stands for skip. ``` o `2e0a388c36` [js/webgpu] Add HardSigmoid support (#19215) o `d226e40856` [js/webgpu] set query type in onRunStart (#19202) o `61610ff986` [js/webgpu] Add FusedConv clip test case (#18900) o `a33b5bd1fa` [JS/WebGPU] Added Uniforms to SkipLayerNorm. (#18788) o `591f90c0b9` [js/webgpu] Fix issue of timestamp query (#19258) o `7252c6e747` [WebNN EP] Support WebNN async API with Asyncify (#19145) o `5b06505073` [js/webgpu] Fix Tanh explosion (#19201) o `656ca66186` [js/webgpu] Support uniforms for conv, conv transpose, conv grouped (#18753) o `a3f0e2422b` [js/webgpu] Support f16 uniform (#19098) o `9e69606360` fix f16 for attention, enable slice and flatten for more types (#19262) o `624b4e2063` [js/webgpu] Remove enableShapesUniforms (#19279) o `90883a366a` [js/webgpu] Add hardSigmoid activation for fusedConv (#19233) o `85cef0af8c` [js/webgpu] Support capture and replay for jsep (#18989) o `d73131cf0f` [js/webgpu] Use DataType as uniform cpu type (#19281) o `dd1f6ccc45` [js/webgpu] resolve codescan alert (#19343) o `3a2ab1963a` [js/webgpu] Refactor createTensorShapeVariables (#18883) o `efc17e79de` [js/webgpu] Fix the undefined push error (#19366) x `50806a7dd5` [js/web] support external data in npm test (#19377) o `ccbe264a39` [js/webgpu] Add LeakyRelu activation for fusedConv (#19369) o `5ff27ef02a` [js/webgpu] support customop FastGelu (#19392) x `03be65e064` [js/web] fix types exports in package.json (#19458) o `06269a3952` [js/webgpu] allow uint8 tensors for webgpu (#19545) o `dfeda9019c` [JS/WebGPU] Add MatMulNBits (#19446) o `1b48054e1b` [js/webgpu] Create Split indices helpers by rank, not by shape (#19554) o `3fe2c137ee` [js] small fix to workaround formatter (#19400) x `70567a4b3a` [js/web] use ApiTensor insteadof onnxjs Tensor in TensorResultValidator (#19358) o `6e04e36e3f` [js/common] upgrade tsc in common from 4.9.5 to 5.2.2 (#19317) o `58f4921686` [js] changes to allow Float16Array if any polyfill is available (#19305) o `57d6819212` [js/web] Fix fused-conv is not included in npm test (#19581) o `ebd220b073` Misspelling in README.md (#19433) o `38c3432393` Bump ip from 1.1.8 to 1.1.9 in /js/react_native (#19582) o `fe82fccf1a` [js/webgpu] Fix Conv2DTransposeMatMul f16 compilation failure (#19596) o `76a2a487a1` Bump ip from 1.1.8 to 1.1.9 in /js/react_native/e2e (#19583) o `29b1106033` [node] Switch to setImmediate to avoid starving the Node.js event loop (#19610) o `ae3d73c981` [JS/WebGPU] Fix Split and Where to handle corner cases. (#19613) o `aec2389ad0` [js/webgpu] allows a ProgramInfo's RunData to use zero sized output (#19614) o `bb43a0f133` [js/webgpu] minor fixes to make tinyllama work (#19564) o `0edb035808` [js/web] fix suite test list for zero sized tensor (#19638) o `3cb81cdde2` [js/common] move 'env.wasm.trace' to 'env.trace' (#19617) o `e30618d055` [js/webgpu] use Headless for webgpu test by default (#19702) o `f06164ef8b` [js/web] transfer input buffer back to caller thread (#19677) x `a788514027` [js/web] dump debug logs for karma for diagnose purpose (#19785) o `24b72d2613` [JS/WebGPU] Preserve zero size input tensor dims. (#19737) o `4538d31a8b` [js/webgpu] expose a few properties in WebGPU API (#19857) o `53de2d8cb0` [js/webgpu] Enable GroupedConvVectorize path (#19791) o `ed250b88c3` [JS/WebGPU] Optimize MatMulNBits (#19852) x `e771a763c3` [js/test] align web test runner flags with ort.env (#19790) o `79e50aeef3` [js/web] rewrite backend resolve to allow multiple EPs (#19735) o `acb0df2280` Fix #19931 broken Get Started link of "ONNX Runtime JavaScript API" page (#19932) o `b29849a287` [js/common] fix typedoc warnings (#19933) o `afdab62f53` Bump follow-redirects from 1.15.4 to 1.15.6 in /js/web (#19949) o `28ad6c3955` Bump follow-redirects from 1.15.4 to 1.15.6 in /js/node (#19951) o `7e0d424934` accumulate in fp32 for Reduce* (#19868) o `4c6a6a37f7` [js/webgpu] Fix NAN caused by un-initialized buffer in instance-norm (#19387) o `01c7aaf6aa` [js/webgpu] allow setting env.webgpu.adapter (#19940) o `c45cff60cf` [js/webgpu] fix maxpool / fp16 (#19981) ``` </details> <details> <summary>Cherry-pick commandlines</summary> ```sh git cherry-pick `2e0a388c36` git cherry-pick `d226e40856` git cherry-pick `61610ff986` git cherry-pick `a33b5bd1fa` git cherry-pick `591f90c0b9` git cherry-pick `7252c6e747` git cherry-pick `5b06505073` git cherry-pick `656ca66186` git cherry-pick `a3f0e2422b` git cherry-pick `9e69606360` git cherry-pick `624b4e2063` git cherry-pick `90883a366a` git cherry-pick `85cef0af8c` #<<<<< Note: conflicts git cherry-pick `d73131cf0f` git cherry-pick `dd1f6ccc45` git cherry-pick `3a2ab1963a` git cherry-pick `efc17e79de` git cherry-pick `ccbe264a39` git cherry-pick `5ff27ef02a` git cherry-pick `06269a3952` git cherry-pick `dfeda9019c` git cherry-pick `1b48054e1b` git cherry-pick `3fe2c137ee` git cherry-pick `6e04e36e3f` git cherry-pick `58f4921686` git cherry-pick `57d6819212` git cherry-pick `ebd220b073` git cherry-pick `38c3432393` git cherry-pick `fe82fccf1a` git cherry-pick `76a2a487a1` git cherry-pick `29b1106033` git cherry-pick `ae3d73c981` git cherry-pick `aec2389ad0` git cherry-pick `bb43a0f133` git cherry-pick `0edb035808` git cherry-pick `3cb81cdde2` git cherry-pick `e30618d055` git cherry-pick `f06164ef8b` git cherry-pick `24b72d2613` git cherry-pick `4538d31a8b` git cherry-pick `53de2d8cb0` git cherry-pick `ed250b88c3` git cherry-pick `79e50aeef3` git cherry-pick `acb0df2280` git cherry-pick `b29849a287` git cherry-pick `afdab62f53` git cherry-pick `28ad6c3955` git cherry-pick `7e0d424934` git cherry-pick `4c6a6a37f7` git cherry-pick `01c7aaf6aa` git cherry-pick `c45cff60cf` ``` </details> <details> <summary>Cherry-pick conflicts</summary> - `85cef0af8c` #18989 this change is for enabling graph capture feature for JSEP, and it is done after ROCM EP enabled graph capture feature. However, the ROCM EP graph capture feature is not cherry-picked in rel-1.17.2. </details> --------- Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: Jiajia Qin <jiajia.qin@intel.com> Co-authored-by: Xu Xing <xing.xu@intel.com> Co-authored-by: satyajandhyala <satya.k.jandhyala@gmail.com> Co-authored-by: Yang Gu <yang.gu@intel.com> Co-authored-by: Wanming Lin <wanming.lin@intel.com> Co-authored-by: Jiajie Hu <jiajie.hu@intel.com> Co-authored-by: Guenther Schmuelling <guschmue@microsoft.com> Co-authored-by: Matttttt <18152455+martholomew@users.noreply.github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: Segev Finer <segev208@gmail.com> Co-authored-by: Belem Zhang <belem.zhang@intel.com>	2024-03-29 13:13:39 -07:00
Rachel Guo	046d06ff26	Cherry-pick for 1.17.3 (#20013 ) ### Description <!-- Describe your changes. --> Web prs are not included yet. ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. --> --------- Co-authored-by: Yufeng Li <liyufeng1987@gmail.com> Co-authored-by: Maximilian Müller <44298237+gedoensmax@users.noreply.github.com> Co-authored-by: Yi Zhang <zhanyi@microsoft.com> Co-authored-by: Your Name <your@email.com> Co-authored-by: Edward Chen <18449977+edgchen1@users.noreply.github.com> Co-authored-by: enximi <70036307+enximi@users.noreply.github.com> Co-authored-by: George Wu <jywu@microsoft.com> Co-authored-by: Markus Tavenrath <mtavenrath@users.noreply.github.com> Co-authored-by: Tianlei Wu <tlwu@microsoft.com> Co-authored-by: rachguo <rachguo@rachguos-Mini.attlocal.net> Co-authored-by: Adam Pocock <adam.pocock@oracle.com> Co-authored-by: aciddelgado <139922440+aciddelgado@users.noreply.github.com> Co-authored-by: kunal-vaishnavi <115581922+kunal-vaishnavi@users.noreply.github.com> Co-authored-by: Changming Sun <chasun@microsoft.com> Co-authored-by: Chi Lo <54722500+chilo-ms@users.noreply.github.com>	2024-03-29 13:10:13 -07:00
Rachel Guo	6bc6adc658	Update version number to 1.17.2 (#19701 ) ### Description <!-- Describe your changes. --> As title. Follow up pr for source code release 1.17.2 ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. --> --------- Co-authored-by: rachguo <rachguo@rachguos-Mini.attlocal.net> Co-authored-by: Changming Sun <chasun@microsoft.com>	2024-03-01 13:51:00 -08:00
Rachel Guo	75968b9eca	Cherry-pick for 1.17.1 patch release (#19477 ) ### Description <!-- Describe your changes. --> As title. ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. --> --------- Co-authored-by: petermcaughan <peter.mcaughan@gmail.com> Co-authored-by: Peter McAughan <petermca@microsoft.com> Co-authored-by: Adrian Lizarraga <adlizarraga@microsoft.com> Co-authored-by: Patrice Vignola <vignola.patrice@gmail.com> Co-authored-by: ivberg <ivberg@microsoft.com> Co-authored-by: Yulong Wang <7679871+fs-eire@users.noreply.github.com> Co-authored-by: Baiju Meswani <bmeswani@microsoft.com> Co-authored-by: Preetha Veeramalai <preetha.veeramalai@intel.com> Co-authored-by: fxmarty <9808326+fxmarty@users.noreply.github.com> Co-authored-by: Sheil Kumar <smk2007@gmail.com> Co-authored-by: Sheil Kumar <sheilk@microsoft.com> Co-authored-by: Prathik Rao <prathik.rao@gmail.com> Co-authored-by: Shubham Bhokare <32080845+shubhambhokare1@users.noreply.github.com> Co-authored-by: kunal-vaishnavi <115581922+kunal-vaishnavi@users.noreply.github.com> Co-authored-by: Jian Chen <cjian@microsoft.com> Co-authored-by: Xavier Dupré <xadupre@users.noreply.github.com> Co-authored-by: satyajandhyala <satya.k.jandhyala@gmail.com>	2024-02-21 12:51:37 -08:00
Rachel Guo	5f0b62cde5	[ORT 1.17.0 Release] Cherry-pick Final Round (#19327 ) ### Description <!-- Describe your changes. --> Cherry-pick Final Round ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. --> --------- Co-authored-by: Adrian Lizarraga <adlizarraga@microsoft.com> Co-authored-by: Changming Sun <chasun@microsoft.com> Co-authored-by: Chi Lo <54722500+chilo-ms@users.noreply.github.com> Co-authored-by: rachguo <rachguo@rachguos-Mini.attlocal.net> Co-authored-by: Edward Chen <18449977+edgchen1@users.noreply.github.com> Co-authored-by: kunal-vaishnavi <115581922+kunal-vaishnavi@users.noreply.github.com> Co-authored-by: aciddelgado <139922440+aciddelgado@users.noreply.github.com> Co-authored-by: Yufeng Li <liyufeng1987@gmail.com>	2024-01-30 16:51:05 -08:00
Rachel Guo	3fd94a8cc7	[ORT 1.17.0 Release] Cherry pick 1st round (#19243 ) ### Description <!-- Describe your changes. --> [ORT 1.17.0 Release] Cherry pick 1st round PR authors please take a look, and let me know if there are any questions about the changes or approve accordingly. ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. --> --------- Co-authored-by: wejoncy <wejoncy@163.com> Co-authored-by: Xavier Dupré <xadupre@users.noreply.github.com> Co-authored-by: Yulong Wang <7679871+fs-eire@users.noreply.github.com> Co-authored-by: Hector Li <hecli@microsoft.com> Co-authored-by: luoyu-intel <yu.luo@intel.com> Co-authored-by: kunal-vaishnavi <115581922+kunal-vaishnavi@users.noreply.github.com> Co-authored-by: Chi Lo <54722500+chilo-ms@users.noreply.github.com> Co-authored-by: Ye Wang <52801275+wangyems@users.noreply.github.com> Co-authored-by: Adrian Lizarraga <adlizarraga@microsoft.com> Co-authored-by: snadampal <87143774+snadampal@users.noreply.github.com> Co-authored-by: Tianlei Wu <tlwu@microsoft.com> Co-authored-by: Heflin Stephen Raj <heflinstephen03@gmail.com> Co-authored-by: Yifan Li <109183385+yf711@users.noreply.github.com> Co-authored-by: Yufeng Li <liyufeng1987@gmail.com> Co-authored-by: Changming Sun <chasun@microsoft.com>	2024-01-26 20:11:48 -08:00
Adrian Lizarraga	daafe63ecc	cherry pick qnn sdk 2.18 updates into release branch (#19197 ) cherry picked from commit `28a16c223c` https://github.com/microsoft/onnxruntime/pull/19129	2024-01-19 17:04:47 -08:00
Rachel Guo	a63b71eadb	Cherry-pick "Fix buildJava from Zip-Nuget-Java-Nodejs Packaging Pipeline (#19187 )" (#19194 ) ### Description Cherry-pick "Fix buildJava from Zip-Nuget-Java-Nodejs Packaging Pipeline (#19187)"	2024-01-18 13:44:48 -08:00
Changming Sun	e2e488d6f8	Revert "iOS packaging pipeline stability" (#19135 ) Reverts microsoft/onnxruntime#19097 because it broken Android CI pipeline.	2024-01-16 09:18:35 -08:00
Jian Chen	c92f72ebeb	Merge Linux Nuget GPU pipeline with zip-nuget (#19120 ) ### Description <!-- Describe your changes. --> ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. -->	2024-01-16 08:59:03 -08:00
pengwa	1150b1f81e	ORTModule memory improvement (#18924 ) ## Dependency https://github.com/microsoft/onnxruntime/pull/19007 ## ORTModule memory efficient gradient management Previously I have tried to solve the coarsed-grained gradient accumulation/update problem in ORTModule with https://github.com/microsoft/onnxruntime/pull/8979, while that resolution somehow is not fully validated with DDP or there is user hooks on the gradient accumulation on torch parameter. This PR is addressing the problem in the similar approach as PR 8979, e.g. trigger gradient accumulation once ORT computed the grad, but instead of use a AccumulateGrad op, this time with a ONNX operator PythonOp, internally it will call param.backward(grad), which will help handle all related hooks correctly. ## Design Check the details from https://microsoftapc-my.sharepoint.com/:p:/g/personal/pengwa_microsoft_com/EaaBq4EzsFhOmsDEXCG7Ba4Bb9bwd0O2sFV_JXJ4jBLYLA?e=7Sz2g8&nav=eyJzSWQiOjI3MSwiY0lkIjozMjE4NzI1NDIzfQ ## Convergence Validation: ![image](https://github.com/microsoft/onnxruntime/assets/10530022/ccf3a213-e815-4b23-b759-165033b2d9fe) differences are on mostly 0.000x, sometimes 0.00x, which may comes from the different order gradient apply happens before or after this change (on deepspeed zero stage 2) ## TODO Consolidate the logic with Stage3's similar logic.	2024-01-16 08:57:37 +08:00
Yi Zhang	922a2f00e3	Extend timeout in Nuget-CUDA-Packaging-Pipeline (#19138 ) ### Description <!-- Describe your changes. --> ### Motivation and Context Linux_GPU_x64 job in the pipeline has been canceled due to timeout since 0112.	2024-01-15 14:37:22 +08:00
Jian Chen	c3ce9df80c	Disabling python3.12 on training python packaging pipleines (#19123 )	2024-01-14 14:51:00 -08:00
Jian Chen	76797127d6	Always download cuda and trt libraries from Azure blob (#19118 ) ### Description This way, we will not need to update the windows images constantly and allow more flexibility to choose the cuda version in the future.	2024-01-14 11:37:26 -08:00
Yulong Wang	f917dde717	[web] remove xnnpack from web backends (#19116 ) ### Description XNNPACK is already disabled in web assembly build. This change removes the xnnpack backend registration in JS.	2024-01-13 23:04:02 -08:00
Edward Chen	e1e45901e2	iOS packaging pipeline stability (#19097 ) - Remove protoc build step which sometimes times out. Download protoc instead. - Use macOS-12 image in the set variables stage. It seems more stable.	2024-01-13 19:27:44 -08:00
Changming Sun	5558912d7b	Disable ccache in Windows CPU CI pipeline (#19131 ) ### Description Disable ccache for all the jobs in in Windows CPU CI pipeline. Before disabling it, the build has a warning that: "MSIL .netmodule or module compiled with /GL found; restarting link with /LTCG; add /LTCG to the link command line to improve linker performance" After disabling it, the warning is gone and the build doesn't use /GL or /LTCG. Cache itself should not cause this difference. ### Motivation and Context	2024-01-13 18:40:43 -08:00
Adrian Lizarraga	65893ef382	Add --parallel to QNN EP NuGet pipeline build command (#19126 ) ### Description Add --parallel to QNN EP NuGet pipeline build command ### Motivation and Context Improve build times for pipeline.	2024-01-13 02:38:40 -08:00
Jian Chen	78e796bb27	Fixing issue where unzip package froim 'onnxruntime-win-x64-gpu' was also uploaded. (#19096 ) ### Description Fixing issue where unzip package froim 'onnxruntime-win-x64-gpu' was also uploaded. For example, https://aiinfra.visualstudio.com/Lotus/_build/results?buildId=396440&view=artifacts&pathAsName=false&type=publishedArtifacts	2024-01-12 22:30:43 -08:00
Jian Chen	e5eacc6d11	Fix cuda-packaging-pipeline.yml (#19115 ) ### Description <!-- Describe your changes. --> ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. -->	2024-01-12 19:09:25 -08:00
Guenther Schmuelling	96dbac6e4b	update to emsdk-3.1.51 (#18844 )	2024-01-12 16:04:33 -08:00
Caroline Zhu	4dbaa73738	[js/web/training] added end-to-end tests (#18700 ) ## Summary * following inference's [set-up for end-to-end tests](https://github.com/microsoft/onnxruntime/tree/main/js/web/test/e2e), created an end-to-end test runner for training * this test runner copies testdata from the [trainingapi folder](https://github.com/microsoft/onnxruntime/tree/main/onnxruntime/test/testdata/training_api) * then runs two tests (training session with evalModel & optimizer model, and training session with the minimum options), and tests if the ORT-web training package encompasses inference * these tests check * createTrainingSession * runTrainStep * runOptimizerStep if applicable * the parameters methods (getParametersSize, loadParametersBuffer, and getContiguousParameters) ## TL;DR * [`js/web/test/training/e2e/run.js`](https://github.com/microsoft/onnxruntime/compare/main...carzh:onnxruntime:carzh/training-e2e-runner?expand=1#diff-c1359c4d401f9ba69e937814219cefe5fd11b151a6ffd084c641af3c82e8216c) is responsible for setting up and running the end to end tests * [`js/web/test/training/e2e/common.js`](https://github.com/microsoft/onnxruntime/compare/main...carzh:onnxruntime:carzh/training-e2e-runner?expand=1#diff-ee5452491b7b2563d175d13d81d10f2323b12b18589aa4c5798962a8b904a4a8) contains the test function definitions (`testInferenceFunction`, `testTrainingFunctionMin`, `testTrainingFunctionAll`) ## Flow * entrypoint: user runs the following command in the terminal: `npm run test:training:e2e` * [`js/web/package.json`](https://github.com/microsoft/onnxruntime/compare/main...carzh:onnxruntime:carzh/training-e2e-runner?expand=1#diff-79275844e75c3c410bb3a71c7f59b2b633e5a3e975c804ffc47220025084da28) was modified to include an npm script that will run `run.js` which will run the end to end tests * [`js/web/test/training/e2e/run.js`](https://github.com/microsoft/onnxruntime/compare/main...carzh:onnxruntime:carzh/training-e2e-runner?expand=1#diff-c1359c4d401f9ba69e937814219cefe5fd11b151a6ffd084c641af3c82e8216c) is responsible for * detecting and installing local tarball packages of ORT-web * copying training data to the `js/web/training/e2e/data` folder * starting two Karma processes. Karma is a test runner framework that simulates testing in the browser. * In this case, the tests happen in Chrome. We can configure the tests to run in Edge and other browsers in the future. * one of these karma processes is self-hosted, meaning it pulls the ORT-web package from local * the other karma process is not self-hosted, meaning it pulls the ORT-web package from another source. In this case, we start an http server that serves the ORT-web binaries. * [`js/web/test/training/e2e/simple-http-server.js`](https://github.com/microsoft/onnxruntime/compare/main...carzh:onnxruntime:carzh/training-e2e-runner?expand=1#diff-f798ab485f3ec26c299fe5b2923574c9e4b090200ba20d490bbf6c183286993c) is responsible for starting the HTTP server and serving the ORT binary files. This code almost identical to the same code in the inference E2E tests. * [`js/web/test/training/e2e/karma.conf.js`](https://github.com/microsoft/onnxruntime/compare/main...carzh:onnxruntime:carzh/training-e2e-runner?expand=1#diff-436cfe8f670c768a04895bd4a1874a5e033f85e0e2d84941c62ff1f7c30a9f28) Karma configuration file that specifies what happens when a karma process is started. The config specifies Mocha as the testing framework, which will go through all the loaded files and run any tests that exist * [`js/web/test/training/e2e/browser-test-wasm.js`](https://github.com/microsoft/onnxruntime/compare/main...carzh:onnxruntime:carzh/training-e2e-runner?expand=1#diff-13b6155e106dddc7b531ef671186e69b2aadb8a0f4b2f3001db0991567d78221) File that contains the tests that Mocha will pick up on and run. * The test functions (such as testInference and testTrainingFunctionAll) are defined in [`js/web/test/training/e2e/common.js`](https://github.com/microsoft/onnxruntime/compare/main...carzh:onnxruntime:carzh/training-e2e-runner?expand=1#diff-ee5452491b7b2563d175d13d81d10f2323b12b18589aa4c5798962a8b904a4a8). ## Notes * I followed the [tests for training core](`b023de0bfc/orttraining/orttraining/test/training_api/core/training_api_tests.cc`) where they randomly generated input for the training session * E2E tests are triggered by running `npm run test:training:e2e` -- suggestions for alternative script names are appreciated!!! ## Motivation and Context - adding training bindings for web	2024-01-12 13:33:33 -08:00
Changming Sun	55b046e97e	Remove enable_mac_silicon settings (#19108 ) ### Description Remove enable_mac_silicon settings from two packaging pipelines. ### Motivation and Context Now we build universal2 packages instead.	2024-01-12 11:01:39 -08:00
Changming Sun	0e8d4c3d21	Enable Address Sanitizer in CI (#19073 ) ### Description 1. Add two build jobs for enabling Address Sanitizer in CI. One for Windows CPU, One for Linux CPU. 2. Set default compiler flags/linker flags in build.py for normal Windows/Linux/MacOS build. This can help control compiler flags in a more centralized way. 3. All Windows binaries in our official packages will be built with "/PROFILE" flag. Symbols of onnxruntime.dll can be found at [Microsoft public symbol server](https://learn.microsoft.com/en-us/windows-hardware/drivers/debugger/microsoft-public-symbols). Limitations: 1. On Linux Address Sanitizer ignores RPATH settings in ELF binaries. Therefore once Address Sanitizer is enabled, before running tests we need to manually set LD_LIBRARY_PATH properly otherwise libonnxruntime.so may not be able to find custom ops and shared EPs. 4. On Linux we also need to set LD_PRELOAD before running some tests(if the main executable, like python, is not built with address sanitizer. On Windows we do not need to. 5. On Windows before running python tests we should manually copy address sanitizer DLL to the onnxruntime/capi directory, because python 3.8 and above has enabled "Safe DLL Search Mode" that wouldn't use the information provided by PATH env. 6. On Linux Address Sanitizer found a lot of memory leaks from our python binding code. Therefore right now we cannot enable Address Sanitizer when building ONNX Runtime with python binding. 7. Address Sanitizer itself uses a lot of memory address space and delays memory deallocations, which is easy to cause OOM issues in 32-bit applications. We cannot run all the tests in onnxruntime_test_all in 32-bit mode with Address Sanitizer due to this reason. However, we still can run individual tests in such a way. We just cannot run all of them in one process. ### Motivation and Context To catch memory issues.	2024-01-12 07:24:40 -08:00
Changming Sun	285606108a	Set pythonInterpreter in set-python-manylinux-variables-step.yml (#19105 ) ### Description Set pythonInterpreter in set-python-manylinux-variables-step.yml. To fix a build error: ``` Starting: Set Python manylinux variables ============================================================================== Task : Python script Description : Run a Python file or inline script Version : 0.231.1 Author : Microsoft Corporation Help : https://docs.microsoft.com/azure/devops/pipelines/tasks/utility/python-script ============================================================================== ##[error]Parameter 'toolPath' cannot be null or empty. Finishing: Set Python manylinux variables ``` The error was because today I deleted a bunch of software from the VM image. The task might fail if no Python versions are found in $(Agent.ToolsDirectory).	2024-01-12 07:22:02 -08:00
Jian Chen	53497702a6	Fix Nuget CUDA Packaging pipeline (#19054 ) ### Description <!-- Describe your changes. --> ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. --> --------- Co-authored-by: Yi Zhang <zhanyi@microsoft.com>	2024-01-11 11:59:21 -08:00
Jian Chen	2eb3db6bf0	Adding python3.12 support to ORT (#18814 ) ### Description Adding python3.12 support to ORT ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. -->	2024-01-11 08:34:28 -08:00
Baiju Meswani	730df1bfa2	Increase MacOS pipeline timeout (#19072 )	2024-01-09 18:35:21 -08:00
Ashwini Khade	897a4163d7	Update transformer version for training CIs (#19046 ) ### Description Updating version to resolve security vulnerability.	2024-01-09 12:00:34 -08:00
Changming Sun	ab897a4a40	Remove Windows ARM32 from nuget packaging pipelines (#19049 ) ### Description 1. Remove Windows ARM32 from nuget packaging pipelines 2. Add missing component-governance-component-detection-steps.yml to some build jobs. ### Motivation and Context Stop supporting Windows ARM32 to align with [Windows's support policy](https://learn.microsoft.com/en-us/windows/arm/arm32-to-arm64). Users who need this feature still can build the DLLs from source. However, later on we will remove that support too.	2024-01-09 07:45:03 -08:00
Adrian Lizarraga	52e5601449	[QNN Nuget Pipeline] Build with ML ops and detect ORT version (#19024 ) ### Description - Removes `--disable_ml_ops` build flag - Automatically detects ORT version from VERSION file via `templates/set-version-number-variables-step.yml`. We will no longer need to create a commit to update ORT versions. ### Motivation and Context - A new unit test caused failures in the QNN Nuget pipeline because it did not enable ml ops. - Automate ORT version specification	2024-01-08 12:44:12 -08:00
Yi Zhang	e8ac97c8d8	Move Windows GPU training job to A10 (#19041 ) ### Description 1. Update sm to 86 ### Motivation and Context We have more A10 quota then T4 and Nvidia AXX could be partitioned	2024-01-08 09:19:58 -08:00
PeixuanZuo	efdcefcf8c	[ROCm] fix security warning (#19017 ) fix security warning	2024-01-05 10:05:34 -08:00
Changming Sun	e155c66b4a	Change all macOS python packages to use universal2 (#19013 ) ### Description Change all macOS python packages to use universal2, to reduce the number of packages we have. ### Motivation and Context According to [wikipedia](https://en.wikipedia.org/wiki/MacOS_Big_Sur), macOS 11 is the first macOS version that supports universal 2. And it is the min macOS version we support. So we no longer need to maintain separate binaries for different CPU archs.	2024-01-04 17:44:49 -08:00
Adrian Lizarraga	02b1ff5fa2	[QNN EP] Support multithreaded inference of a single session (#18981 ) ### Description - Add mutex to protect QNN API calls for executing a graph and extracting the corresponding profile data. - Ensures QNN EP's execute function does not store unnecessary state (i.e., input and output buffer pointers do not need to be stored as class members.) ### Motivation and Context Allow calling `session.Run()` from multiple threads when using QNN EP.	2024-01-04 13:32:48 -08:00
PeixuanZuo	7a454acd61	[ROCm] Update CI/Packaging pipeline to ROCm6.0 (#18985 ) Update CI/Packaing pipeline to ROCm6.0	2024-01-03 17:25:15 +08:00
Yi Zhang	c97e3f4821	[Fix] exception in Fuzz Test pipeline (#18984 ) ### Description <!-- Describe your changes. --> ### Motivation and Context The file path is not correct.	2024-01-03 14:53:31 +08:00
Yifan Li	3993d43048	[EP Perf] Fix missing Azure cli & use onnx zoo model inside image (#18917 ) ### Description * Fix [missing Azure CLI issue](https://aiinfra.visualstudio.com/Lotus/_build/results?buildId=392612&view=logs&j=b6bfa4e2-8141-507f-8ca1-59b3f929fa71&t=d0fed32c-7043-5439-8bf2-dd69d21beb5b&l=12). * Now, once CI fails to run `az --version`, it would auto-reinstall the azure cli dependency * Use existing onnx zoo model inside image during memtesting * to avoid test failure when onnx model zoo is restructuring * Display more detail info of valgrind when memtesting * Clear invalid dep of existing AddressSanitizer test case ### Validate * Before the fix, Azure CLI is missing: https://aiinfra.visualstudio.com/Lotus/_build/results?buildId=392994&view=logs&j=b6bfa4e2-8141-507f-8ca1-59b3f929fa71&t=d0fed32c-7043-5439-8bf2-dd69d21beb5b&l=10 * After the fix: https://aiinfra.visualstudio.com/Lotus/_build/results?buildId=392619&view=logs&j=b6bfa4e2-8141-507f-8ca1-59b3f929fa71&t=d0fed32c-7043-5439-8bf2-dd69d21beb5b	2024-01-01 17:14:39 -08:00
Yi Zhang	3f03c12986	Split Onnxruntime Nuget GPU package (#18819 ) ### Description 1. Update donwload-artifacts to flex-downloadartifacts to make it eaiser to debug. 2. Move the native files into Gpu.Windows and Gpu-linux packages. Onnxruntime-Gpu has dependency on them. 3. update the package validation as well 4. Add 2 stages to run E2E test for GPU.Windows and GPU.Linux for example: ![image](https://github.com/microsoft/onnxruntime/assets/16190118/35c6730b-8080-4f52-a17c-b9c61f41b6bb) ### Motivation and Context Single Onnxruntime.Gpu Package size has already excceded the Nuget size limit. We split the package into some smaller packages to make them can be published. For compatibility, the user can install or upgrade Onnxruntime.Gpu, which will install Gpu.Windows and Gpu.Linux automatically. And the user can only install Gpu.Windows and Gpu.Linux directly. ### Test Link 1. In ORT_NIGHTLY 2. Install the preview version in nuget-int. (nuget source: https://apiint.nugettest.org/v3/index.json) --------- Co-authored-by: Scott McKay <skottmckay@gmail.com>	2023-12-22 16:57:16 +08:00
Changming Sun	3d8f229d39	Add ARM64EC build jobs (#18870 ) ### Description Add ARM64EC build jobs in post merge pipeline to validate if our code is compatible with Windows ARM64EC.	2023-12-21 16:31:38 -08:00
Yifan Li	54e471a054	[EP Perf] Display percentage of cuda/trt ops in cuda/trt ep on EP Perf Dashboard (#18868 ) ### Description Display percentage of cuda/trt ops in cuda/trt ep on EP Perf Dashboard: ![image](https://github.com/microsoft/onnxruntime/assets/109183385/bafba098-1338-46fa-b10a-ca19eff2a746) Check [here](https://msit.powerbi.com/groups/d1ae6355-afd0-4c40-b78e-676a86cab1e2/reports/82101bbb-dad2-4f24-9ddf-a37f0d41509a/ReportSectionda402bdf6824e505a614?experience=power-bi) to preview on ep perf dashboard ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. --> - brief overview of op metrics towards various models - easy to identify models which haven't reached 100% ops on cuda/trt ep.	2023-12-20 22:11:47 -08:00
Hector Li	8931854528	Move some QNN EP provider options to session options (#18877 ) Move QNN EP provider options to session options ### Description Need to use session option to support multi-partition for context cache feature. To smooth the transaction, move the provider options to session options first. This is the first step for PR: PR https://github.com/microsoft/onnxruntime/pull/18865	2023-12-20 00:13:38 -08:00
Scott McKay	666fcbde4d	Add LeakyRelu to list of NNAPI operators (#18880 ) ### Description <!-- Describe your changes. --> Add LeakyRelu to the list as support was added a while ago. ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. -->	2023-12-20 14:44:31 +10:00
Changming Sun	535a2403dd	Update Nuget publishing jobs (#18851 ) ### Description 1. Add a CodeSign validation task before the binaries are published, to make sure all DLL files are signed. 2. Auto-trigger the CUDA 12 pipeline's publishing job.	2023-12-19 16:54:46 -08:00
Ashwini Khade	4dff154f51	Fix nightly pipeline failure (#18867 ) ### Description Fixes a failure in the ortmodule nightly pipeline. ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. -->	2023-12-19 09:18:00 -08:00
Jian Chen	6d7519ede8	Adding new pipeline for python cuda testing (#18718 ) ### Description <!-- Describe your changes. --> ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. -->	2023-12-18 18:13:03 -08:00
Changming Sun	ad476d5a1f	Change Nuget packaging pipeline's build TRT job to download CUDA SDK on-the-fly (#18847 ) ### Description Change Nuget packaging pipeline's build TRT job to download CUDA SDK on-the-fly, so that we do not need to put a CUDA SDK in the build machine's image.	2023-12-15 17:44:02 -08:00
Changming Sun	fc9ecb59db	Add Windows ARM build jobs to post merge pipeline (#18832 ) ### Description Add Windows ARM build jobs to post merge pipeline to valid our code is still compatible with these build settings.	2023-12-15 08:47:52 -08:00
Changming Sun	cbad4fe49b	Update absl and googletest (#18827 ) ### Description Update absl and googletest to their latest version to include some cmake changes: 1. A googletest's cmake change that will allow using external absl and re2. 2. Nullability enhancements that will allow our clang-based static analysis detecting many kinds of null pointer errors. ### Motivation and Context To fix a C4744 link warning in our Windows pipelines. ``` LINK : warning C4744: 'static char const absl::lts_20230802::base_internal::FastTypeTag<bool>::dummy_var' has different type in 'd:\a\_work\_temp\abseil_cpp\abseil-cpp-20230802.0\absl\flags\parse.cc' and 'd:\a\_work\1\b\relwithdebinfo\_deps\googletest-src\googletest\src\gtest-all.cc': 'signed char' and 'unsigned char' [D:\a\_work\1\b\RelWithDebInfo\onnxruntime_mlas_test.vcxproj] LINK : warning C4744: 'static char const absl::lts_20230802::base_internal::FastTypeTag<class std::basic_string<char,struct std::char_traits<char>,class std::allocator<char> > >::dummy_var' has different type in 'd:\a\_work\_temp\abseil_cpp\abseil-cpp-20230802.0\absl\flags\parse.cc' and 'd:\a\_work\1\b\relwithdebinfo\_deps\googletest-src\googletest\src\gtest-all.cc': 'signed char' and 'unsigned char' [D:\a\_work\1\b\RelWithDebInfo\onnxruntime_mlas_test.vcxproj] LINK : warning C4744: 'static char const absl::lts_20230802::base_internal::FastTypeTag<class std::basic_string<char,struct std::char_traits<char>,class std::allocator<char> > >::dummy_var' has different type in 'd:\a\_work\_temp\abseil_cpp\abseil-cpp-20230802.0\absl\flags\internal\usage.cc' and 'd:\a\_work\1\b\relwithdebinfo\_deps\googletest-src\googletest\src\gtest-all.cc': 'signed char' and 'unsigned char' [D:\a\_work\1\b\RelWithDebInfo\onnxruntime_mlas_test.vcxproj] LINK : warning C4744: 'static char const absl::lts_20230802::base_internal::FastTypeTag<bool>::dummy_var' has different type in 'd:\a\_work\_temp\abseil_cpp\abseil-cpp-20230802.0\absl\flags\internal\flag.cc' and 'd:\a\_work\1\b\relwithdebinfo\_deps\googletest-src\googletest\src\gtest-all.cc': 'signed char' and 'unsigned char' [D:\a\_work\1\b\RelWithDebInfo\onnxruntime_mlas_test.vcxproj] LINK : warning C4744: 'static char const absl::lts_20230802::base_internal::FastTypeTag<class std::basic_string<char,struct std::char_traits<char>,class std::allocator<char> > >::dummy_var' has different type in 'd:\a\_work\_temp\abseil_cpp\abseil-cpp-20230802.0\absl\flags\internal\flag.cc' and 'd:\a\_work\1\b\relwithdebinfo\_deps\googletest-src\googletest\src\gtest-all.cc': 'signed char' and 'unsigned char' [D:\a\_work\1\b\RelWithDebInfo\onnxruntime_mlas_test.vcxproj] LINK : warning C4744: 'static char const absl::lts_20230802::base_internal::FastTypeTag<int>::dummy_var' has different type in 'd:\a\_work\_temp\abseil_cpp\abseil-cpp-20230802.0\absl\flags\internal\flag.cc' and 'd:\a\_work\1\b\relwithdebinfo\_deps\googletest-src\googletest\src\gtest-all.cc': 'signed char' and 'unsigned char' [D:\a\_work\1\b\RelWithDebInfo\onnxruntime_mlas_test.vcxproj] ```	2023-12-14 16:15:07 -08:00
Changming Sun	b129f425fc	Fix test model URL issue (#18823 ) ### Description ONNX model zoo changed their dir structure. So some our pipelines are failing. In prevent such things happening again, we'd better to read the test data for a cache from local disk instead of downloading it remotely every time.	2023-12-14 13:06:08 -08:00

1 2 3 4 5 ...

1772 commits