onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-07-08 17:17:15 +00:00

Author	SHA1	Message	Date
Tianlei Wu	fbff99a432	Change Jave Test Threshold (#19508 ) ### Description Increase the threshold to 1e-5 to avoid test failed in CUDA when difference is slightly larger than 1e-6. May because TF32 is used in those CUDA tests. ### Motivation and Context https://dev.azure.com/onnxruntime/onnxruntime/_build/results?buildId=1291322&view=logs&j=f2f63060-d9d6-52d0-adee-b97db5a9ab91&t=28e21ca6-87a4-5e1e-0441-72b5e8326f2d ProviderOptionsTest > testCUDAOptions() FAILED org.opentest4j.AssertionFailedError: array contents differ at index [103], expected: <0.0102678> but was: <0.010266338> at app//org.junit.jupiter.api.AssertionFailureBuilder.build(AssertionFailureBuilder.java:151) at app//org.junit.jupiter.api.AssertionFailureBuilder.buildAndThrow(AssertionFailureBuilder.java:132) at app//org.junit.jupiter.api.AssertArrayEquals.failArraysNotEqual(AssertArrayEquals.java:440) at app//org.junit.jupiter.api.AssertArrayEquals.assertArrayEquals(AssertArrayEquals.java:290) at app//org.junit.jupiter.api.AssertArrayEquals.assertArrayEquals(AssertArrayEquals.java:123) at app//org.junit.jupiter.api.AssertArrayEquals.assertArrayEquals(AssertArrayEquals.java:119) at app//org.junit.jupiter.api.Assertions.assertArrayEquals(Assertions.java:1360) at app//ai.onnxruntime.providers.ProviderOptionsTest.runProvider(ProviderOptionsTest.java:99) at app//ai.onnxruntime.providers.ProviderOptionsTest.testCUDAOptions(ProviderOptionsTest.java:43) https://dev.azure.com/onnxruntime/onnxruntime/_build/results?buildId=1293200&view=logs&jobId=f2f63060-d9d6-52d0-adee-b97db5a9ab91&j=f2f63060-d9d6-52d0-adee-b97db5a9ab91&t=28e21ca6-87a4-5e1e-0441-72b5e8326f2d InferenceTest > testCUDA() FAILED org.opentest4j.AssertionFailedError: array contents differ at index [103], expected: <0.0102678> but was: <0.010266337> at app//org.junit.jupiter.api.AssertionFailureBuilder.build(AssertionFailureBuilder.java:151) at app//org.junit.jupiter.api.AssertionFailureBuilder.buildAndThrow(AssertionFailureBuilder.java:132) at app//org.junit.jupiter.api.AssertArrayEquals.failArraysNotEqual(AssertArrayEquals.java:440) at app//org.junit.jupiter.api.AssertArrayEquals.assertArrayEquals(AssertArrayEquals.java:290) at app//org.junit.jupiter.api.AssertArrayEquals.assertArrayEquals(AssertArrayEquals.java:123) at app//org.junit.jupiter.api.AssertArrayEquals.assertArrayEquals(AssertArrayEquals.java:119) at app//org.junit.jupiter.api.Assertions.assertArrayEquals(Assertions.java:1360) at app//ai.onnxruntime.InferenceTest.runProvider(InferenceTest.java:676) at app//ai.onnxruntime.InferenceTest.testCUDA(InferenceTest.java:615)	2024-02-14 10:08:46 -08:00
Ye Wang	f53d2c2465	Phi2 script fixes (#19500 ) ### Description <!-- Describe your changes. --> This PR is intended to support Phi2 passes in Olive. Merge it before https://github.com/microsoft/Olive/pull/938 ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. -->	2024-02-14 10:08:11 -08:00
Prathik Rao	544407038d	SimplifiedLayerNormalization Fusion BFloat16 support for Llama-v2 on A100 (#18898 ) ### Description <!-- Describe your changes. --> Adds bfloat16 as a supported dtype for SimplifiedLayerNormFusion which will provide speedup for Llama-v2 on A100 using bfloat16 numerical format. _layernorm_optimized_training.onnx exported in bfloat16 vs. float16:_ ![image](https://github.com/microsoft/onnxruntime/assets/31260940/8c0a5f0f-5fcb-4637-bcd9-f34272ec0284) ### Repro Instructions ```python from torch import nn from onnxruntime.training.ortmodule import ORTModule, DebugOptions, LogLevel import torch dtype = torch.bfloat16 # dtype = torch.float16 class Net(nn.Module): def __init__(self): super().__init__() self.fc = nn.Linear(784, 10, dtype=dtype) self.layernorm = nn.LayerNorm([784], dtype=dtype) def forward(self, x): x = x.view(x.shape[0], -1) x = self.layernorm(x) x = self.fc(x) return x model = Net() model = ORTModule(model, DebugOptions(save_onnx=True, onnx_prefix='layernorm', log_level=LogLevel.INFO)) model.to("cuda") images = torch.randn((8, 28, 28), dtype=dtype).to("cuda") output = model(images) ``` ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. --> ONNX Runtime integration with Llama-v2 family of LLMs. --------- Co-authored-by: Prathik Rao <prathikrao@microsoft.com@orttrainingdev8.d32nl1ml4oruzj4qz3bqlggovf.px.internal.cloudapp.net>	2024-02-14 10:05:16 -08:00
dependabot[bot]	18f76bd25d	Bump gradle/wrapper-validation-action from 1 to 2 (#19412 ) Bumps [gradle/wrapper-validation-action](https://github.com/gradle/wrapper-validation-action) from 1 to 2. <details> <summary>Release notes</summary> <p><em>Sourced from <a href="https://github.com/gradle/wrapper-validation-action/releases">gradle/wrapper-validation-action's releases</a>.</em></p> <blockquote> <h2>v2.0.0</h2> <h2>What's Changed</h2> <p>The version of the Node.js runtime was updated to 20, and the majority of dependencies were updated to the latest versions. From now on, the <code>wrapper-validation-action</code> will require a Node.js 20 runtime environment.</p> <p>There are no functional changes in this release. This release is tagged with the <code>v2</code> version label.</p> <ul> <li>[NEW] Update Node.js runtime to version 20 (<a href="https://redirect.github.com/gradle/wrapper-validation-action/issues/170">#170</a>)</li> </ul> <h2>v2.0.0-rc.1</h2> <p>This is a release candidate for <code>v2.0.0</code>. It is also available under the <code>v2</code> version label.</p> <h2>What's Changed</h2> <p>The version of the Node.js runtime was updated to 20, and the majority of dependencies were updated to the latest versions. From now on, the <code>wrapper-validation-action</code> will require a Node.js 20 runtime environment.</p> <p>There are no functional changes in this release.</p> <ul> <li>[NEW] Update Node.js runtime to version 20 (<a href="https://redirect.github.com/gradle/wrapper-validation-action/issues/170">#170</a>)</li> </ul> <h2>v1.1.0</h2> <p>The action now adds the path of the failed wrapper Jar as a <code>failed-wrapper</code> Step output parameter. This makes the value available for reporting in later Steps/Jobs.</p> <h2>v1.0.6</h2> <h1>Gradle Wrapper Validation</h1> <ul> <li>Security vulnerability: <a href="`959bfac6da`">Bump json5 from 1.0.1 to 1.0.2</a></li> <li>Security vulnerability: <a href="`ffa46e5c87`">Bump qs from 6.10.1 to 6.11.0</a></li> </ul> <h2>v1.0.5</h2> <h1>Gradle Wrapper Validation</h1> <ul> <li>Update dependencies for Node 16 (<a href="https://redirect.github.com/gradle/wrapper-validation-action/issues/53">#53</a>)</li> <li>Update dependencies with security vulnerabilities (<a href="https://redirect.github.com/gradle/wrapper-validation-action/issues/67">#67</a>)</li> <li>Update various other dependencies (<a href="https://redirect.github.com/gradle/wrapper-validation-action/issues/45">#45</a>, <a href="https://redirect.github.com/gradle/wrapper-validation-action/issues/47">#47</a>, <a href="https://redirect.github.com/gradle/wrapper-validation-action/issues/48">#48</a>, <a href="https://redirect.github.com/gradle/wrapper-validation-action/issues/54">#54</a>)</li> </ul> <h2>v1.0.4</h2> <h1>Gradle Wrapper Validation</h1> <ul> <li>Retry connections to the server on failure (<a href="https://redirect.github.com/gradle/wrapper-validation-action/issues/39">#39</a>)</li> <li>Update dependencies (<a href="https://redirect.github.com/gradle/wrapper-validation-action/issues/38">#38</a>, <a href="https://redirect.github.com/gradle/wrapper-validation-action/issues/37">#37</a>, <a href="https://redirect.github.com/gradle/wrapper-validation-action/issues/36">#36</a>, <a href="https://redirect.github.com/gradle/wrapper-validation-action/issues/34">#34</a>, <a href="https://redirect.github.com/gradle/wrapper-validation-action/issues/31">#31</a>, <a href="https://redirect.github.com/gradle/wrapper-validation-action/issues/30">#30</a>, <a href="https://redirect.github.com/gradle/wrapper-validation-action/issues/29">#29</a>)</li> </ul> <h2>v1.0.3</h2> <h1>Gradle Wrapper Validation</h1> <p>Update <code>minimist</code> version to <code>1.2.5</code></p> <h2>v1.0.2</h2> <!-- raw HTML omitted --> </blockquote> <p>... (truncated)</p> </details> <details> <summary>Commits</summary> <ul> <li><a href="`27152f6fa0`"><code>27152f6</code></a> Update to Node 20 (<a href="https://redirect.github.com/gradle/wrapper-validation-action/issues/170">#170</a>)</li> <li><a href="`d8758a98d1`"><code>d8758a9</code></a> Build output</li> <li><a href="`e916071cca`"><code>e916071</code></a> Update NPM dependencies</li> <li><a href="`d9359e465a`"><code>d9359e4</code></a> Add asdf config file</li> <li><a href="`77d43de170`"><code>77d43de</code></a> Update upload-artifact version</li> <li><a href="`2f8436d9bb`"><code>2f8436d</code></a> Use setup-node@v4 instead of pinning to a revision</li> <li><a href="`bfa0fe410a`"><code>bfa0fe4</code></a> Consistently use npm cache for workflows</li> <li><a href="`8be8473276`"><code>8be8473</code></a> Update workflows and action to NodeJS 20</li> <li><a href="`c8fad9e3f8`"><code>c8fad9e</code></a> Bump <code>@babel/traverse</code> from 7.14.7 to 7.23.2</li> <li><a href="`342dbebe72`"><code>342dbeb</code></a> Update README to use <code>actions/checkout@v4</code></li> <li>See full diff in <a href="https://github.com/gradle/wrapper-validation-action/compare/v1...v2">compare view</a></li> </ul> </details> <br /> [![Dependabot compatibility score](https://dependabot-badges.githubapp.com/badges/compatibility_score?dependency-name=gradle/wrapper-validation-action&package-manager=github_actions&previous-version=1&new-version=2)](https://docs.github.com/en/github/managing-security-vulnerabilities/about-dependabot-security-updates#about-compatibility-scores) Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting `@dependabot rebase`. [//]: # (dependabot-automerge-start) [//]: # (dependabot-automerge-end) --- <details> <summary>Dependabot commands and options</summary> <br /> You can trigger Dependabot actions by commenting on this PR: - `@dependabot rebase` will rebase this PR - `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it - `@dependabot merge` will merge this PR after your CI passes on it - `@dependabot squash and merge` will squash and merge this PR after your CI passes on it - `@dependabot cancel merge` will cancel a previously requested merge and block automerging - `@dependabot reopen` will reopen this PR if it is closed - `@dependabot close` will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually - `@dependabot show <dependency name> ignore conditions` will show all of the ignore conditions of the specified dependency - `@dependabot ignore this major version` will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this minor version` will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this dependency` will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself) </details> Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-02-13 15:59:24 -08:00
dependabot[bot]	f048fb5b14	Bump nuget/setup-nuget from 1 to 2 (#19411 ) Bumps [nuget/setup-nuget](https://github.com/nuget/setup-nuget) from 1 to 2. <details> <summary>Release notes</summary> <p><em>Sourced from <a href="https://github.com/nuget/setup-nuget/releases">nuget/setup-nuget's releases</a>.</em></p> <blockquote> <h2>v2.0.0</h2> <h2>What's Changed</h2> <ul> <li>build(deps): bump semver from 7.3.8 to 7.5.2 by <a href="https://github.com/dependabot"><code>@dependabot</code></a> in <a href="https://redirect.github.com/NuGet/setup-nuget/pull/49">NuGet/setup-nuget#49</a></li> <li>build(deps-dev): bump word-wrap from 1.2.3 to 1.2.5 by <a href="https://github.com/dependabot"><code>@dependabot</code></a> in <a href="https://redirect.github.com/NuGet/setup-nuget/pull/51">NuGet/setup-nuget#51</a></li> <li>build(deps-dev): bump <code>@babel/traverse</code> from 7.23.0 to 7.23.2 by <a href="https://github.com/dependabot"><code>@dependabot</code></a> in <a href="https://redirect.github.com/NuGet/setup-nuget/pull/57">NuGet/setup-nuget#57</a></li> <li>Update to use Node.js 20 by <a href="https://github.com/frederikprijck"><code>@frederikprijck</code></a> in <a href="https://redirect.github.com/NuGet/setup-nuget/pull/59">NuGet/setup-nuget#59</a></li> <li>build(deps-dev): bump prettier from 2.8.7 to 3.0.3 by <a href="https://github.com/dependabot"><code>@dependabot</code></a> in <a href="https://redirect.github.com/NuGet/setup-nuget/pull/60">NuGet/setup-nuget#60</a></li> <li>build(deps-dev): bump <code>@types/node</code> from 18.18.0 to 20.8.9 by <a href="https://github.com/dependabot"><code>@dependabot</code></a> in <a href="https://redirect.github.com/NuGet/setup-nuget/pull/62">NuGet/setup-nuget#62</a></li> <li>build(deps-dev): bump <code>@vercel/ncc</code> from 0.36.1 to 0.38.1 by <a href="https://github.com/dependabot"><code>@dependabot</code></a> in <a href="https://redirect.github.com/NuGet/setup-nuget/pull/61">NuGet/setup-nuget#61</a></li> <li>build(deps-dev): bump eslint-plugin-jest from 27.4.0 to 27.6.0 by <a href="https://github.com/dependabot"><code>@dependabot</code></a> in <a href="https://redirect.github.com/NuGet/setup-nuget/pull/64">NuGet/setup-nuget#64</a></li> <li>build(deps-dev): bump nock from 13.3.3 to 13.3.6 by <a href="https://github.com/dependabot"><code>@dependabot</code></a> in <a href="https://redirect.github.com/NuGet/setup-nuget/pull/63">NuGet/setup-nuget#63</a></li> <li>build(deps-dev): bump eslint from 8.50.0 to 8.52.0 by <a href="https://github.com/dependabot"><code>@dependabot</code></a> in <a href="https://redirect.github.com/NuGet/setup-nuget/pull/65">NuGet/setup-nuget#65</a></li> <li>build(deps-dev): bump <code>@typescript-eslint/parser</code> from 5.62.0 to 6.9.1 by <a href="https://github.com/dependabot"><code>@dependabot</code></a> in <a href="https://redirect.github.com/NuGet/setup-nuget/pull/70">NuGet/setup-nuget#70</a></li> <li>build(deps-dev): bump eslint-plugin-github from 4.10.0 to 4.10.1 by <a href="https://github.com/dependabot"><code>@dependabot</code></a> in <a href="https://redirect.github.com/NuGet/setup-nuget/pull/68">NuGet/setup-nuget#68</a></li> <li>build(deps-dev): bump <code>@types/jest</code> from 29.5.5 to 29.5.7 by <a href="https://github.com/dependabot"><code>@dependabot</code></a> in <a href="https://redirect.github.com/NuGet/setup-nuget/pull/69">NuGet/setup-nuget#69</a></li> <li>build(deps-dev): bump eslint from 8.52.0 to 8.53.0 by <a href="https://github.com/dependabot"><code>@dependabot</code></a> in <a href="https://redirect.github.com/NuGet/setup-nuget/pull/73">NuGet/setup-nuget#73</a></li> <li>build(deps-dev): bump <code>@types/node</code> from 20.8.9 to 20.8.10 by <a href="https://github.com/dependabot"><code>@dependabot</code></a> in <a href="https://redirect.github.com/NuGet/setup-nuget/pull/71">NuGet/setup-nuget#71</a></li> <li>build(deps-dev): bump nock from 13.3.6 to 13.3.8 by <a href="https://github.com/dependabot"><code>@dependabot</code></a> in <a href="https://redirect.github.com/NuGet/setup-nuget/pull/72">NuGet/setup-nuget#72</a></li> <li>build(deps-dev): bump prettier from 3.0.3 to 3.1.0 by <a href="https://github.com/dependabot"><code>@dependabot</code></a> in <a href="https://redirect.github.com/NuGet/setup-nuget/pull/74">NuGet/setup-nuget#74</a></li> <li>build(deps-dev): bump <code>@types/jest</code> from 29.5.7 to 29.5.8 by <a href="https://github.com/dependabot"><code>@dependabot</code></a> in <a href="https://redirect.github.com/NuGet/setup-nuget/pull/76">NuGet/setup-nuget#76</a></li> <li>build(deps-dev): bump <code>@typescript-eslint/parser</code> from 6.9.1 to 6.10.0 by <a href="https://github.com/dependabot"><code>@dependabot</code></a> in <a href="https://redirect.github.com/NuGet/setup-nuget/pull/77">NuGet/setup-nuget#77</a></li> <li>build(deps-dev): bump <code>@types/node</code> from 20.8.10 to 20.9.0 by <a href="https://github.com/dependabot"><code>@dependabot</code></a> in <a href="https://redirect.github.com/NuGet/setup-nuget/pull/75">NuGet/setup-nuget#75</a></li> <li>build(deps-dev): bump eslint from 8.53.0 to 8.54.0 by <a href="https://github.com/dependabot"><code>@dependabot</code></a> in <a href="https://redirect.github.com/NuGet/setup-nuget/pull/80">NuGet/setup-nuget#80</a></li> <li>build(deps-dev): bump <code>@types/node</code> from 20.9.0 to 20.9.2 by <a href="https://github.com/dependabot"><code>@dependabot</code></a> in <a href="https://redirect.github.com/NuGet/setup-nuget/pull/79">NuGet/setup-nuget#79</a></li> <li>build(deps-dev): bump <code>@typescript-eslint/parser</code> from 6.10.0 to 6.12.0 by <a href="https://github.com/dependabot"><code>@dependabot</code></a> in <a href="https://redirect.github.com/NuGet/setup-nuget/pull/81">NuGet/setup-nuget#81</a></li> <li>build(deps-dev): bump <code>@types/jest</code> from 29.5.8 to 29.5.10 by <a href="https://github.com/dependabot"><code>@dependabot</code></a> in <a href="https://redirect.github.com/NuGet/setup-nuget/pull/83">NuGet/setup-nuget#83</a></li> <li>build(deps-dev): bump typescript from 5.2.2 to 5.3.2 by <a href="https://github.com/dependabot"><code>@dependabot</code></a> in <a href="https://redirect.github.com/NuGet/setup-nuget/pull/82">NuGet/setup-nuget#82</a></li> <li>build(deps-dev): bump nock from 13.3.8 to 13.4.0 by <a href="https://github.com/dependabot"><code>@dependabot</code></a> in <a href="https://redirect.github.com/NuGet/setup-nuget/pull/88">NuGet/setup-nuget#88</a></li> <li>build(deps-dev): bump <code>@types/node</code> from 20.9.2 to 20.10.3 by <a href="https://github.com/dependabot"><code>@dependabot</code></a> in <a href="https://redirect.github.com/NuGet/setup-nuget/pull/86">NuGet/setup-nuget#86</a></li> <li>build(deps-dev): bump eslint from 8.54.0 to 8.55.0 by <a href="https://github.com/dependabot"><code>@dependabot</code></a> in <a href="https://redirect.github.com/NuGet/setup-nuget/pull/85">NuGet/setup-nuget#85</a></li> <li>build(deps-dev): bump <code>@typescript-eslint/parser</code> from 6.12.0 to 6.13.2 by <a href="https://github.com/dependabot"><code>@dependabot</code></a> in <a href="https://redirect.github.com/NuGet/setup-nuget/pull/89">NuGet/setup-nuget#89</a></li> <li>build(deps-dev): bump <code>@types/jest</code> from 29.5.10 to 29.5.11 by <a href="https://github.com/dependabot"><code>@dependabot</code></a> in <a href="https://redirect.github.com/NuGet/setup-nuget/pull/93">NuGet/setup-nuget#93</a></li> <li>build(deps-dev): bump prettier from 3.1.0 to 3.1.1 by <a href="https://github.com/dependabot"><code>@dependabot</code></a> in <a href="https://redirect.github.com/NuGet/setup-nuget/pull/91">NuGet/setup-nuget#91</a></li> <li>build(deps-dev): bump typescript from 5.3.2 to 5.3.3 by <a href="https://github.com/dependabot"><code>@dependabot</code></a> in <a href="https://redirect.github.com/NuGet/setup-nuget/pull/92">NuGet/setup-nuget#92</a></li> <li>build(deps-dev): bump <code>@types/node</code> from 20.10.3 to 20.10.4 by <a href="https://github.com/dependabot"><code>@dependabot</code></a> in <a href="https://redirect.github.com/NuGet/setup-nuget/pull/90">NuGet/setup-nuget#90</a></li> <li>build(deps-dev): bump eslint from 8.55.0 to 8.56.0 by <a href="https://github.com/dependabot"><code>@dependabot</code></a> in <a href="https://redirect.github.com/NuGet/setup-nuget/pull/94">NuGet/setup-nuget#94</a></li> <li>build(deps-dev): bump <code>@typescript-eslint/parser</code> from 6.13.2 to 6.19.0 by <a href="https://github.com/dependabot"><code>@dependabot</code></a> in <a href="https://redirect.github.com/NuGet/setup-nuget/pull/107">NuGet/setup-nuget#107</a></li> <li>build(deps-dev): bump eslint-plugin-jest from 27.6.0 to 27.6.3 by <a href="https://github.com/dependabot"><code>@dependabot</code></a> in <a href="https://redirect.github.com/NuGet/setup-nuget/pull/106">NuGet/setup-nuget#106</a></li> <li>build(deps-dev): bump <code>@types/node</code> from 20.10.4 to 20.11.5 by <a href="https://github.com/dependabot"><code>@dependabot</code></a> in <a href="https://redirect.github.com/NuGet/setup-nuget/pull/110">NuGet/setup-nuget#110</a></li> <li>build(deps-dev): bump prettier from 3.1.1 to 3.2.4 by <a href="https://github.com/dependabot"><code>@dependabot</code></a> in <a href="https://redirect.github.com/NuGet/setup-nuget/pull/109">NuGet/setup-nuget#109</a></li> <li>build(deps-dev): bump <code>@types/node</code> from 20.11.5 to 20.11.10 by <a href="https://github.com/dependabot"><code>@dependabot</code></a> in <a href="https://redirect.github.com/NuGet/setup-nuget/pull/116">NuGet/setup-nuget#116</a></li> <li>build(deps-dev): bump nock from 13.4.0 to 13.5.1 by <a href="https://github.com/dependabot"><code>@dependabot</code></a> in <a href="https://redirect.github.com/NuGet/setup-nuget/pull/115">NuGet/setup-nuget#115</a></li> <li>build(deps-dev): bump ts-jest from 29.1.1 to 29.1.2 by <a href="https://github.com/dependabot"><code>@dependabot</code></a> in <a href="https://redirect.github.com/NuGet/setup-nuget/pull/113">NuGet/setup-nuget#113</a></li> <li>build(deps-dev): bump <code>@typescript-eslint/parser</code> from 6.19.0 to 6.20.0 by <a href="https://github.com/dependabot"><code>@dependabot</code></a> in <a href="https://redirect.github.com/NuGet/setup-nuget/pull/117">NuGet/setup-nuget#117</a></li> </ul> <h2>New Contributors</h2> <ul> <li><a href="https://github.com/frederikprijck"><code>@frederikprijck</code></a> made their first contribution in <a href="https://redirect.github.com/NuGet/setup-nuget/pull/59">NuGet/setup-nuget#59</a></li> </ul> <p><strong>Full Changelog</strong>: <a href="https://github.com/NuGet/setup-nuget/compare/v1.2.0...v1.3.0">https://github.com/NuGet/setup-nuget/compare/v1.2.0...v1.3.0</a></p> <!-- raw HTML omitted --> </blockquote> <p>... (truncated)</p> </details> <details> <summary>Commits</summary> <ul> <li><a href="`a21f25cd39`"><code>a21f25c</code></a> Update dist for release (<a href="https://redirect.github.com/nuget/setup-nuget/issues/118">#118</a>)</li> <li><a href="`5166d73a43`"><code>5166d73</code></a> build(deps-dev): bump <code>@typescript-eslint/parser</code> from 6.19.0 to 6.20.0 (<a href="https://redirect.github.com/nuget/setup-nuget/issues/117">#117</a>)</li> <li><a href="`b915545882`"><code>b915545</code></a> build(deps-dev): bump ts-jest from 29.1.1 to 29.1.2 (<a href="https://redirect.github.com/nuget/setup-nuget/issues/113">#113</a>)</li> <li><a href="`00081d4dbe`"><code>00081d4</code></a> build(deps-dev): bump nock from 13.4.0 to 13.5.1 (<a href="https://redirect.github.com/nuget/setup-nuget/issues/115">#115</a>)</li> <li><a href="`e44f8a5711`"><code>e44f8a5</code></a> build(deps-dev): bump <code>@types/node</code> from 20.11.5 to 20.11.10 (<a href="https://redirect.github.com/nuget/setup-nuget/issues/116">#116</a>)</li> <li><a href="`f685ada866`"><code>f685ada</code></a> build(deps-dev): bump prettier from 3.1.1 to 3.2.4 (<a href="https://redirect.github.com/nuget/setup-nuget/issues/109">#109</a>)</li> <li><a href="`aee2c690f4`"><code>aee2c69</code></a> build(deps-dev): bump <code>@types/node</code> from 20.10.4 to 20.11.5 (<a href="https://redirect.github.com/nuget/setup-nuget/issues/110">#110</a>)</li> <li><a href="`2bd1cef324`"><code>2bd1cef</code></a> build(deps-dev): bump eslint-plugin-jest from 27.6.0 to 27.6.3 (<a href="https://redirect.github.com/nuget/setup-nuget/issues/106">#106</a>)</li> <li><a href="`c5ed90cfc8`"><code>c5ed90c</code></a> build(deps-dev): bump <code>@typescript-eslint/parser</code> from 6.13.2 to 6.19.0 (<a href="https://redirect.github.com/nuget/setup-nuget/issues/107">#107</a>)</li> <li><a href="`34040aa462`"><code>34040aa</code></a> build(deps-dev): bump eslint from 8.55.0 to 8.56.0 (<a href="https://redirect.github.com/nuget/setup-nuget/issues/94">#94</a>)</li> <li>Additional commits viewable in <a href="https://github.com/nuget/setup-nuget/compare/v1...v2">compare view</a></li> </ul> </details> <br /> [![Dependabot compatibility score](https://dependabot-badges.githubapp.com/badges/compatibility_score?dependency-name=nuget/setup-nuget&package-manager=github_actions&previous-version=1&new-version=2)](https://docs.github.com/en/github/managing-security-vulnerabilities/about-dependabot-security-updates#about-compatibility-scores) Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting `@dependabot rebase`. [//]: # (dependabot-automerge-start) [//]: # (dependabot-automerge-end) --- <details> <summary>Dependabot commands and options</summary> <br /> You can trigger Dependabot actions by commenting on this PR: - `@dependabot rebase` will rebase this PR - `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it - `@dependabot merge` will merge this PR after your CI passes on it - `@dependabot squash and merge` will squash and merge this PR after your CI passes on it - `@dependabot cancel merge` will cancel a previously requested merge and block automerging - `@dependabot reopen` will reopen this PR if it is closed - `@dependabot close` will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually - `@dependabot show <dependency name> ignore conditions` will show all of the ignore conditions of the specified dependency - `@dependabot ignore this major version` will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this minor version` will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this dependency` will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself) </details> Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-02-13 15:59:15 -08:00
fxmarty	1e10cdb2b9	Fix subgraph quantization regression in onnxruntime 1.17 (#19421 ) As per title, fixes https://github.com/microsoft/onnxruntime/issues/19418 ONNX Runtime 1.17 broke the quantization of ONNX models with subgraphs where initializers are placed on the top-level graph, while different subgraphs use the same initializer.	2024-02-13 15:49:19 -08:00
Yifan Li	5c7e6b2e2a	[EP Perf] Add CI option to enable TRT-OSS parser (#19448 ) ### Description <!-- Describe your changes. --> * Introducing CI option to enable TRT-OSS parser, during ep perf testing: ![image](https://github.com/microsoft/onnxruntime/assets/109183385/a9ba6393-6b94-4b8f-8ca4-ba7bc7954504) By default, open-sourced onnx-tensorrt parser listed under [cmake/deps.txt](https://github.com/microsoft/onnxruntime/blob/main/cmake/deps.txt#L39-L40) will be used if enabling this option. ### To verify this option and check the difference during ORT image build: If this option is enabled: <img width="649" alt="image" src="https://github.com/microsoft/onnxruntime/assets/109183385/3b778583-451e-4617-ba8c-c064442e60fd"> If this option is not enabled (by default): <img width="683" alt="image" src="https://github.com/microsoft/onnxruntime/assets/109183385/cd8383ba-eff4-4536-94ab-a1424bb858ab"> * update default usage of cmake/trt version to the latest ### Motivation and Context Make it easier to test oss parser and find potential gap between tensorrt builtin/oss parser. Schedule runs with oss parser will be set after this PR gets merged	2024-02-12 23:04:08 -08:00
George Wu	5e70c6b3a6	allow protobuf lite build for TRT EP (#19498 ) allow protobuf-lite builds with TensorRT EP as long as it's built with the trt built-in parser and not the oss-parser. This is because trt built-in parser statically links protobuf so there aren't any conflicts for protobuf-lite.	2024-02-12 22:53:04 -08:00
Adrian Lizarraga	4dfba53bfb	[QNN EP] Build x64 python wheel for QNN EP (#19499 ) ### Description Adds a job to the python packaging pipeline that builds x64 python wheels for QNN EP. ### Motivation and Context Necessary to create a cached QNN model on Windows x64, which is done by creating a properly configured onnxruntime session with QNN EP.	2024-02-12 20:54:04 -08:00
Patrice Vignola	61e07a46e1	[DML EP] Support split hidden size for RotaryEmbedding (#18852 ) RotaryEmbedding now supports the `[batchSize, numHeads, sequenceLength, headSize]` format for its input, which is used in Mistral.	2024-02-12 19:36:08 -08:00
Hector Li	a622710fe1	Add option to skip session run in perf_test tool (#19501 ) Enable a option to exit after session creation so that user can measure session creation time to measure impact of enabling any initialization optimizations.	2024-02-12 19:11:40 -08:00
snadampal	7fa6f4fca4	add arm64 bfloat16 fastmath mode option for transformers benchmarking script (#19294 ) Add arm64 bfloat16 fastmath mode option for transformers benchmarking script. ### Motivation and Context onnxruntime now supports bfloat16 fastmath gemm kernels for arm64 platforms with bfloat16 instruction support. This PR updates benchmark scripts to test that mode.	2024-02-12 15:20:36 -08:00
Preetha Veeramalai	90e2e8561f	Ovep 1.17.1 (#19482 ) ### Description Handle bugs for API backward compatability. Update to consume the onnx model path rather the onnx serialised model to OV compile_model API	2024-02-12 12:31:08 -08:00
Changming Sun	9cb97ee507	Disable CPU EP's allocator's arena when address sanitizer is enabled (#19485 ) ### Description Disable CPU EP's allocator's arena when address sanitizer is enabled, because it masks problems. For example, the code in onnxruntime/test/quantization/quantization_test.cc has a memory leak problem: it allocated a buffer but didn't free it, but most memory leak check tool cannot detect that because the buffer was from an arena and the arena was finally freed. ### Motivation and Context Provider better memory leak check coverage.	2024-02-12 09:39:49 -08:00
Baiju Meswani	c831031ad5	Remove cuda gencode 90 to reduce onnxruntime-training package size (#19486 )	2024-02-12 09:24:36 -08:00
Fangrui Song	d00adb7989	Align bins_space_ storage (#17552 ) Otherwise, `new (BinFromIndex(b)) Bin(this, bin_size);` in bfc_arena.cc would cause a -fsanitize=alignment (part of -fsanitize=undefined) failure like runtime error: constructor call on misaligned address 0xXXX for type 'Bin', which requires 8 byte alignment	2024-02-11 19:18:26 -08:00
Patrice Vignola	1182b5509b	Disable streams for the DML EP (#19481 ) There's currently a bug in the allocation planner when reusing buffers and more than one streams are used that make it possible (although rarely) to reach a reference count of 0 for a buffer that is still being used. Since DML doesn't benefit from multiple streams, disabling it is the safest option for now. This is a high priority issue that we need to fix for 1.17.1 since it breaks stable diffusion. Identifying the perfect fix and fixing the underlying issue would be too risky for a patch release, especially given the limited time that we have. https://github.com/microsoft/onnxruntime/issues/19480	2024-02-10 00:34:34 -08:00
Dmitri Smirnov	0e984ef0d1	Small fixes in Resize CPU antialias (#19476 ) ### Description Add a comment, pass NCHW = true when setting upsample_antialias ### Motivation and Context Small bugs.	2024-02-09 15:27:04 -08:00
Shubham Bhokare	90cf03767d	Support ONNX export of OpenAi Whisper model (#17316 ) Build from source and run the command below Example, converting whisper-base ` python -m onnxruntime.transformers.models.whisper.convert_to_onnx -m openai/whisper-base --model_impl openai -e -o -w --chain_model --output ./demo`	2024-02-09 15:26:39 -05:00
Changming Sun	1007d8f3d1	Revert "Revert NeuralSpeed code for x64 MatMulNBits (#19382 )" (#19474 ) This reverts commit `0d10c7f3c1`.	2024-02-09 09:24:54 -08:00
Justin Chu	3d2ddf96e3	Bump ruff linter to 0.2.1 (#19471 ) ### Motivation and Context Include new lint rules	2024-02-08 16:08:27 -08:00
Yulong Wang	03be65e064	[js/web] fix types exports in package.json (#19458 ) ### Description Since TypeScript v4.7, types need to specify inside "exports" field when it is available. This PR appends types just before each "default" (which is required by spec to be the last item). Fixes #19403.	2024-02-08 15:56:48 -08:00
ivberg	148f54c6ea	Add capturestate / rundown ETW support logging for session and provider options (#19397 ) ### Description Add capturestate / rundown ETW support logging for session and provider options. ### Motivation and Context Follow-up to #16259 and #18882 This is very useful when you have longer running ONNX sessions which will be the case for a lot of AI workloads. That means ETW tracing may start minutes or hours after a process & session has been established. When a trace is captured, you would want to know the state of ONNX at that time. The state for ONNX is session and config options so that they show up in the trace. Tested with xperf and ORT xperf -start ort -on 3a26b1ff-7484-7484-7484-15261f42614d xperf -capturestate ort 3a26b1ff-7484-7484-7484-15261f42614d <--- Run this after session has been up for some time xperf -stop ort -d .\ort.etl <- Trace will now also have rundown events Also these will show if you use WPR [CaptureStateOnSave ](https://learn.microsoft.com/en-us/windows-hardware/test/wpt/capturestateonsave)	2024-02-08 11:28:05 -08:00
Scott McKay	3b1b18347c	Check for invalid combination of python + minimal build in build.py (#19463 ) ### Description <!-- Describe your changes. --> Python bindings aren't supported in a minimal build. Check in build.py so user gets a better error message. ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. --> #19422	2024-02-08 09:08:41 -08:00
Ye Wang	19952c5b35	Add script to convert phi2 to ort-vllm compatible (#19429 ) ### Description <!-- Describe your changes. --> 1. add option to export onnx compatiable with ort_vllm. This makes sure that onnx model only leverages on paged attn from vllm. It's intended to use internally so not mentioned in readme. 2. add details in ORT installation(https://github.com/microsoft/onnxruntime/pull/19338#discussion_r1476906190) ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. --> --------- Co-authored-by: wejoncy <wejoncy@163.com>	2024-02-07 17:03:06 -08:00
luoyu-intel	0d10c7f3c1	Revert NeuralSpeed code for x64 MatMulNBits (#19382 ) ### Description <!-- Describe your changes. --> Revert PR#19016 https://github.com/microsoft/onnxruntime/pull/19016 Revert PR#17669 https://github.com/microsoft/onnxruntime/pull/17669	2024-02-07 13:04:37 -08:00
Jian Chen	75f06319d6	Change binet to bin (#19424 ) ### Description This pull request includes a small change to the `Dockerfile.manylinux2_28_cuda` file in the `tools/ci_build/github/linux/docker` directory. The change corrects the `PREPEND_PATH` argument from `/usr/local/cuda/binet` to `/usr/local/cuda/bin`, ensuring the correct path to CUDA binaries is set.	2024-02-07 09:51:02 -08:00
Scott McKay	36d223676b	Use GraphViewer.IsConstantInitializer in NNAPI EP. (#19401 ) ### Description <!-- Describe your changes. --> An overridable initializer should not have a fixed value included in an NNAPI model as it could be changed at runtime. The current check doesn't include validating that the initializer is constant. I was updating GetClipMinMax as part of adding CoreML EP ML Program support, and in order to make both CoreML and NNAPI do the more correct thing of using IsConstantInitializer this set of changes was required. ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. --> Make NNAPI and CoreML EPs more correct. --------- Co-authored-by: Edward Chen <18449977+edgchen1@users.noreply.github.com>	2024-02-07 14:01:51 +10:00
Patrice Vignola	302d4be7d9	[DML EP] Fix external data unpacking (#19415 ) ### Description This change `55a669409a` didn't take into account external data when unpacking initializer, and therefore crashes when trying to unpack them.	2024-02-06 17:10:55 -08:00
Maximilian Müller	91b2e660fe	[Build] fix: missing nvcc flags when compiling with unittests (#19308 ) When configured using the following CMake ops Clion is not able to configure due to checking with `nvcc ... --dryrun tmp.cu`: ``` cmake -G Ninja -Donnxruntime_USE_TENSORRT="ON" -Donnxruntime_USE_CUDA="ON" -Donnxruntime_USE_CUDA_NHWC_OPS="ON" -DCMAKE_CUDA_ARCHITECTURES="native" -Donnxruntime_NVCC_THREADS=1 -Donnxruntime_ENABLE_NVTX_PROFILE="ON" -Donnxruntime_USE_TENSORRT_BUILTIN_PARSER="ON" -DCMAKE_CUDA_COMPILER_LAUNCHER="ccache" -Donnxruntime_BUILD_UNIT_TESTS="ON" -Donnxruntime_USE_TRITON_KERNEL=OFF -Donnxruntime_USE_FLASH_ATTENTION=OFF ``` Without building the unittests everything works fine. I believe my changes only follow the logic that is actually desired. If `NVCC_HAS_STRICT_ALIASING` is set to false it should not be possible to add this as a CUDA flag. Same is true for `HAS_NOERROR` as seen in `adjust_global_compile_flags.cmake`	2024-02-06 17:01:26 -08:00
Edward Chen	df5c6718bd	Remove iOS simulator max runtime version limit. (#19396 )	2024-02-06 14:54:06 -08:00
Tianlei Wu	bedf0eee73	[CUDA] Add use_tf32 provider option (for FP32 GEMM) (#19357 ) [TF32](https://blogs.nvidia.com/blog/tensorfloat-32-precision-format/) could help boost performance on GPU of SM >= 80. Sometime, user observes accuracy loss, or need disable TF32 for testing purpose. To disable TF32, it is also possible to set environment variable `NVIDIA_TF32_OVERRIDE = 0`. However, sometime we do not want to use environment variable to avoid impacting other applications, or want to have finer control (like one session using TF32, and another session not). This provider option could help. Here we add a provider option `use_tf32`. When `use_tf32 = 0`, we will disable TF32 for float MatMul/GEMM in cublas. It applies to MatMulNBits, Attention, LongformerAttention, PackedAttention, PackedMultiHeadAttention operators when float GEMM is used internally in the operator. Note that it will not impact other data type, like fp8 gemm could still use TF32 in accumulation. Previously, cublasGemmStridedBatchedHelper does not use TF32 in inference. Here we enabled TF32 by default, so we might observe speed up for FP32 transformers models on SM >= 80. There is another PR that enables the option for cuDNN Conv later. ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. --> https://github.com/microsoft/onnxruntime/issues/15407 https://github.com/microsoft/onnxruntime/issues/19288	2024-02-06 13:31:33 -08:00
Tianlei Wu	c4b49fb7bf	[CUDA] remove CUBLAS_TENSOR_OP_MATH mode (#19431 ) This pull request replaces `CUBLAS_TENSOR_OP_MATH` with `CUBLAS_DEFAULT_MATH`. The changes affect several files, including test cases and a Python script for AMD hipify process. ### Motivation and Context CUBLAS_TENSOR_OP_MATH mode is deprecated: https://docs.nvidia.com/cuda/cublas/index.html#cublasmath-t On CUDA versions prior to 11, users are required to set the math mode to CUBLAS_TENSOR_OP_MATH manually to be able to use tensor cores for FP16. On CUDA 11 and CUDA 12, this is no longer required. Since latest ORT only supports CUDA >= 11 so it is safe to remove CUBLAS_TENSOR_OP_MATH from our code base.	2024-02-06 12:48:39 -08:00
dependabot[bot]	61b0e04b03	Bump github/issue-labeler from 3.3 to 3.4 (#19410 ) Bumps [github/issue-labeler](https://github.com/github/issue-labeler) from 3.3 to 3.4. <details> <summary>Release notes</summary> <p><em>Sourced from <a href="https://github.com/github/issue-labeler/releases">github/issue-labeler's releases</a>.</em></p> <blockquote> <h2>v3.4</h2> <h2>What's Changed</h2> <ul> <li>Fix warning by update node version by <a href="https://github.com/renanfranca"><code>@renanfranca</code></a> in <a href="https://redirect.github.com/github/issue-labeler/pull/82">github/issue-labeler#82</a></li> </ul> <h2>New Contributors</h2> <ul> <li><a href="https://github.com/renanfranca"><code>@renanfranca</code></a> made their first contribution in <a href="https://redirect.github.com/github/issue-labeler/pull/82">github/issue-labeler#82</a></li> </ul> <p><strong>Full Changelog</strong>: <a href="https://github.com/github/issue-labeler/compare/v3.3...v3.4">https://github.com/github/issue-labeler/compare/v3.3...v3.4</a></p> </blockquote> </details> <details> <summary>Commits</summary> <ul> <li><a href="`c1b0f9f52a`"><code>c1b0f9f</code></a> v3.4 release</li> <li><a href="`50f02baa95`"><code>50f02ba</code></a> Fix warning by update node version (<a href="https://redirect.github.com/github/issue-labeler/issues/82">#82</a>)</li> <li><a href="`a8d4f1b8e8`"><code>a8d4f1b</code></a> Update README.md</li> <li>See full diff in <a href="https://github.com/github/issue-labeler/compare/v3.3...v3.4">compare view</a></li> </ul> </details> <br /> [![Dependabot compatibility score](https://dependabot-badges.githubapp.com/badges/compatibility_score?dependency-name=github/issue-labeler&package-manager=github_actions&previous-version=3.3&new-version=3.4)](https://docs.github.com/en/github/managing-security-vulnerabilities/about-dependabot-security-updates#about-compatibility-scores) Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting `@dependabot rebase`. [//]: # (dependabot-automerge-start) [//]: # (dependabot-automerge-end) --- <details> <summary>Dependabot commands and options</summary> <br /> You can trigger Dependabot actions by commenting on this PR: - `@dependabot rebase` will rebase this PR - `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it - `@dependabot merge` will merge this PR after your CI passes on it - `@dependabot squash and merge` will squash and merge this PR after your CI passes on it - `@dependabot cancel merge` will cancel a previously requested merge and block automerging - `@dependabot reopen` will reopen this PR if it is closed - `@dependabot close` will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually - `@dependabot show <dependency name> ignore conditions` will show all of the ignore conditions of the specified dependency - `@dependabot ignore this major version` will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this minor version` will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this dependency` will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself) </details> Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-02-06 11:02:24 -08:00
Yongxin Wang	cfe93a1eab	fix type annotation for sess_options (#19393 ) ### Description Changed the type annotation of sess_options in InferenceSession's `__init__` method ### Motivation and Context sess_options is one `SessionOptions`, not a sequence of it. It is passed directly into `C.InferenceSession`, and from the definition of [`C.InferenceSession`](`efc17e79de/onnxruntime/python/onnxruntime_pybind_state.cc (L1790)`), we can see that it is not a sequence: ```cpp py::class_<PyInferenceSession>(m, "InferenceSession", R"pbdoc(This is the main class used to run a model.)pbdoc") // In Python3, a Python bytes object will be passed to C++ functions that accept std::string or char* // without any conversion. So this init method can be used for model file path (string) and model content (bytes) .def(py::init([](const PySessionOptions& so, const std::string arg, bool is_arg_file_name, bool load_config_from_model = false) { ```	2024-02-06 10:58:46 -08:00
Yulong Wang	5ff27ef02a	[js/webgpu] support customop FastGelu (#19392 ) ### Description Support WebGPU custom operator FastGelu.	2024-02-06 09:07:31 -08:00
Yulong Wang	a4cfdc1c28	update comments for nodejs binding artifact preparation. (#19425 ) ### Description document update as a following-up for #19274	2024-02-05 22:58:35 -08:00
Tianlei Wu	23996bbbbe	[CUDA][ROCm][Training] Fix cuda/rocm provider info hash (#19398 ) When I test a new provider option, the training pipeline failed. I found that training uses hash code of provider info to try get provider instance. If a provider option is not used in hashing, the provider instance fetched from cache might have different configuration for that option. Here I fix the hashing to use all provider options (except the default Arena config that cannot be set from python API since training is used with PyTorch in most cases). Fixed a few obvious typo in the touched files. Add regression test cases.	2024-02-05 14:35:57 -08:00
Jian Chen	06a84c8a0d	Enable DML on Windows and CUDA on Linux for Node.js binding (#19274 ) This pull request includes modifications to the `c-api-cpu.yml` Azure Pipelines configuration file. The changes mainly revolve around the Node.js packaging stage and the handling of Node.js artifacts. The most significant changes include renaming the Node.js packaging stage, adding a new dependency to the stage, changing artifact names, adding a new script to list Node.js artifacts, and updating the source folder for copying NuGet binaries. Changes in Node.js packaging: * [`tools/ci_build/github/azure-pipelines/templates/c-api-cpu.yml`](diffhunk://#diff-00815920cc190d10fdebceac0c3a4b8a59e408684ae38177dfe7f96cae276c59L503-R508): Renamed the Node.js packaging stage from `Nodejs_Packaging_CPU` to `Nodejs_Packaging` and added `Windows_CI_GPU_DML_Dev` as a new dependency to the stage. Changes in handling of Node.js artifacts: * [`tools/ci_build/github/azure-pipelines/templates/c-api-cpu.yml`](diffhunk://#diff-00815920cc190d10fdebceac0c3a4b8a59e408684ae38177dfe7f96cae276c59L568-R569): Changed the artifact name from `drop-onnxruntime-nodejs-win-x64` to `drop-onnxruntime-nodejs-win-x64-dml` in the task to download pipeline artifacts for Windows x64. * [`tools/ci_build/github/azure-pipelines/templates/c-api-cpu.yml`](diffhunk://#diff-00815920cc190d10fdebceac0c3a4b8a59e408684ae38177dfe7f96cae276c59R595-R598): Added a new script to list Node.js artifacts from the directory `$(Build.BinariesDirectory)/nodejs-artifacts/win32/x64/`. * [`tools/ci_build/github/azure-pipelines/templates/c-api-cpu.yml`](diffhunk://#diff-00815920cc190d10fdebceac0c3a4b8a59e408684ae38177dfe7f96cae276c59L635-R640): Updated the source folder from `$(Build.BinariesDirectory)\RelWithDebInfo\RelWithDebInfo\nuget-artifacts\onnxruntime-win-x64\lib` to `$(Build.BinariesDirectory)\nodejs-artifacts\win32\x64` in the task to copy NuGet binaries to the directory `$(Build.SourcesDirectory)\js\node\bin\napi-v3\win32\x64`. --------- Co-authored-by: Yulong Wang <7679871+fs-eire@users.noreply.github.com>	2024-02-05 14:33:58 -08:00
aciddelgado	d2d9b5b5f9	fix output shape inference packed gqa (#19374 ) ### Description fix output shape inference packed gqa	2024-02-05 13:53:39 -08:00
Prathik Rao	d120104dcd	add ATen support for bicubic interpolation (#19380 ) ### Description <!-- Describe your changes. --> Add ATen fallback support for bicubic interpolation algorithm. ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. --> Required for facebook/dinov2 model architecture as part of ONNX Runtime integration with AML Vision models.	2024-02-05 13:11:37 -08:00
Ye Wang	aaf32fb1b1	phi2 conversion/optimization script (#19338 ) ### Description <!-- Describe your changes. --> This PR adds onnx conversion script for dynamo exported phi2, optimization script, and inference example script A readme file is added as documentation. https://github.com/microsoft/onnxruntime/tree/wangye/phi2_doc/onnxruntime/python/tools/transformers/models/phi2#readme ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. --> --------- Co-authored-by: Edward Chen <18449977+edgchen1@users.noreply.github.com>	2024-02-05 10:15:16 -08:00
dependabot[bot]	e6d3518db9	Bump gradle/gradle-build-action from 2 to 3 (#19297 ) Bumps [gradle/gradle-build-action](https://github.com/gradle/gradle-build-action) from 2 to 3. <details> <summary>Release notes</summary> <p><em>Sourced from <a href="https://github.com/gradle/gradle-build-action/releases">gradle/gradle-build-action's releases</a>.</em></p> <blockquote> <h2>v3.0.0-rc.1</h2> <p>First release candidate of <code>gradle/gradle-build-action@v3.0.0</code>. This release candidate will the first release available under the <code>v3</code> version tag.</p> <blockquote> <p>[!IMPORTANT] As of <code>v3</code> this action has been superceded by <code>gradle/actions/setup-gradle</code>. Any workflow that uses <code>gradle/gradle-build-action@v3</code> will transparently delegate to <code>gradle/actions/setup-gradle@v3</code>.</p> <p>Users are encouraged to update their workflows, replacing:</p> <pre><code>uses: gradle/gradle-build-action@v3 </code></pre> <p>with</p> <pre><code>uses: gradle/actions/setup-gradle@v3 </code></pre> <p>See the <a href="https://github.com/gradle/actions/tree/main/setup-gradle">setup-gradle documentation</a> for up-to-date documentation for <code>gradle/actons/setup-gradle</code>.</p> </blockquote> <h2>Changes from <code>gradle-build-action@v2</code></h2> <p>This release brings some useful and much requested features, including:</p> <ul> <li>save and restore the Gradle configuration-cache data</li> <li>add the Job summary content as a PR comment</li> <li>easily publish Build Scans® to the free <a href="https://scans.gradle.com">Gradle Build Scan service</a></li> <li>compatibility with Node 20</li> </ul> <p>The only major breaking change from <code>gradle-build-action@v2.12.0</code> is the update to require a Node 20 runtime environment. Aside from that change, this release should generally serve as a drop-in replacement for <code>gradle-build-action@v2</code>.</p> <h3>Changelog</h3> <ul> <li>[NEW] - Run with NodeJs 20.x (<a href="https://redirect.github.com/gradle/gradle-build-action/issues/946">gradle/gradle-build-action#946</a>)</li> <li>[NEW] - Support for save & restore of configuration-cache data (<a href="https://redirect.github.com/gradle/gradle-build-action/issues/966">gradle/gradle-build-action#966</a>)</li> <li>[NEW] - Support for automatic adding PR comment with Job Summary content (<a href="https://redirect.github.com/gradle/gradle-build-action/issues/1020">gradle/gradle-build-action#1020</a>)</li> <li>[NEW] - Make it easy to publish a Build Scan® to <a href="https://scans.gradle.com">https://scans.gradle.com</a> (<a href="https://redirect.github.com/gradle/gradle-build-action/issues/1044">gradle/gradle-build-action#1044</a>)</li> <li>[NEW] - Added <code>dependency-graph-continue-on-failure</code> input, which can be set to <code>false</code> to force the Job to fail when dependency graph submission fails (<a href="https://redirect.github.com/gradle/gradle-build-action/issues/1036">gradle/gradle-build-action#1036</a>). Failure modes include: <ul> <li>Fail build step if version of Gradle being executed is not supported for dependency-graph generation (<a href="https://redirect.github.com/gradle/gradle-build-action/issues/1034">gradle/gradle-build-action#1034</a>)</li> <li>Fail job if permissions are insufficient to submit dependency graph via Dependency Submission API (<a href="https://redirect.github.com/gradle/gradle-build-action/issues/997">gradle/gradle-build-action#997</a>)</li> </ul> </li> <li>[NEW] - Add <code>dependency-graph: clear</code> option to clear any dependency-graph previously submitted by the job</li> <li>[FIX] Allow cache entries to be reused by jobs with the same ID in different workflows (<a href="https://redirect.github.com/gradle/gradle-build-action/issues/1017">gradle/gradle-build-action#1017</a>) <ul> <li>Workflow name remains part of the cache key, but cache entries generated by the same job id in a different workflow may be restored</li> </ul> </li> <li>[FIX] Register pre-installed JDKs in Maven toolchains.xml file (<a href="https://redirect.github.com/gradle/gradle-build-action/issues/1024">gradle/gradle-build-action#1024</a>) <ul> <li>This allows pre-installed JDKs to be auto-detected by Gradle Toolchain support on Windows</li> </ul> </li> <li>[FIX] - Update the Gradle Enterprise injection configuration for product rename to Develocity (<a href="https://redirect.github.com/gradle/gradle-build-action/issues/995">gradle/gradle-build-action#995</a>)</li> <li>[FIX] - Avoid submitting an empty dependency graph when state is loaded from configuration-cache</li> <li>[DEPRECATION] - Deprecation of the arguments parameter (<a href="https://redirect.github.com/gradle/gradle-build-action/issues/996">gradle/gradle-build-action#996</a>)</li> <li>[BREAKING CHANGE] - Remove the <code>gradle-executable</code> input parameter. Use a separate workflow Step to execute a Gradle from a custom location.</li> </ul> <!-- raw HTML omitted --> </blockquote> <p>... (truncated)</p> </details> <details> <summary>Commits</summary> <ul> <li><a href="`4a8703fa34`"><code>4a8703f</code></a> Delegate to 'setup-gradle@v3.0.0-rc.1'</li> <li><a href="`4a39eedb8c`"><code>4a39eed</code></a> Mention setup-gradle in README</li> <li><a href="`272883a7ba`"><code>272883a</code></a> Remove all action sources: these have been migrated to 'gradle/actions'</li> <li><a href="`2a8bfcf231`"><code>2a8bfcf</code></a> Delegate action implementation to gradle/actions/setup-gradle</li> <li><a href="`e1ada08a9a`"><code>e1ada08</code></a> Bump the github-actions group with 1 update (<a href="https://redirect.github.com/gradle/gradle-build-action/issues/1047">#1047</a>)</li> <li><a href="`a8e3e5e2b4`"><code>a8e3e5e</code></a> Apply dependency version updates</li> <li><a href="`2be01ca1c6`"><code>2be01ca</code></a> Build outputs</li> <li><a href="`a00827eebb`"><code>a00827e</code></a> Bump the npm-dependencies group with 7 updates</li> <li><a href="`ad80850e98`"><code>ad80850</code></a> Bump the github-actions group with 2 updates</li> <li><a href="`bd6d0a74d4`"><code>bd6d0a7</code></a> Configure explicit java version for config-cache test</li> <li>Additional commits viewable in <a href="https://github.com/gradle/gradle-build-action/compare/v2...v3">compare view</a></li> </ul> </details> <br /> [![Dependabot compatibility score](https://dependabot-badges.githubapp.com/badges/compatibility_score?dependency-name=gradle/gradle-build-action&package-manager=github_actions&previous-version=2&new-version=3)](https://docs.github.com/en/github/managing-security-vulnerabilities/about-dependabot-security-updates#about-compatibility-scores) Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting `@dependabot rebase`. [//]: # (dependabot-automerge-start) [//]: # (dependabot-automerge-end) --- <details> <summary>Dependabot commands and options</summary> <br /> You can trigger Dependabot actions by commenting on this PR: - `@dependabot rebase` will rebase this PR - `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it - `@dependabot merge` will merge this PR after your CI passes on it - `@dependabot squash and merge` will squash and merge this PR after your CI passes on it - `@dependabot cancel merge` will cancel a previously requested merge and block automerging - `@dependabot reopen` will reopen this PR if it is closed - `@dependabot close` will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually - `@dependabot show <dependency name> ignore conditions` will show all of the ignore conditions of the specified dependency - `@dependabot ignore this major version` will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this minor version` will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this dependency` will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself) </details> Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-02-05 09:41:57 -08:00
Yi Zhang	435e19953e	Fix llama.covert_onnx to make it runnable in CI (#19372 ) ### Description 1. make parity_check use local model to avoid using hf token 2. del the model didn't work because it tried to del the object define out of the function scope. So it caused out of memory in A10. 3. In fact, 16G GPU memory (one T4) is enough. But the conversion process always be killed in T4 and it works on A10/24G. Standard_NC4as_T4_v3 has 28G CPU memory Standard_NV36ads_A10_v5 has 440G memory. It looks that the model conversion needs very huge memory. ### Motivation and Context Last time, I came across some issues in convert_to_onnx.py so I use the onnx model in https://github.com/microsoft/Llama-2-Onnx for testing. Now, these issues could be fixed. So I use onnx model generated by this repo and the CI can cover the model conversion.	2024-02-05 07:26:24 +08:00
PeixuanZuo	0cba56e0a0	[ROCm] Fix CI pipeline by fixing pytest version (#19407 ) Fix pytest version to 7.4.4, higher version will cause error `from onnxruntime.capi import onnxruntime_validation ModuleNotFoundError: No module named 'onnxruntime.capi'`	2024-02-04 16:37:36 +08:00
Scott McKay	debd1cab10	Add coremltools 7.1 as a dependency (#19389 ) ### Description <!-- Describe your changes. --> Setup usage of coremltools via dependencies instead of copying files. Pull in some changes from https://github.com/microsoft/onnxruntime/pull/19347 in preparation for supporting ML Program and enabling building the ML Model on all platforms to make development and testing of CoreML EP code easier. - Update to coremltools 7.1 - Add patch for changes required for cross platform build of ML Program related code - Generate coreml proto files on all platforms - mainly to test these changes work everywhere, as the proto files will be used on all platforms when #19347 is checked in - rename onnxruntime_coreml_proto target to coreml_proto as it contains purely coreml protobuf code with no ORT related chagnes ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. --> Improve setup.	2024-02-03 09:42:21 +10:00
Tianlei Wu	18c3acb198	update import in convert_generation.py (#19385 ) Fix for https://github.com/microsoft/onnxruntime/issues/19376 - Use absolute import instead of relative import for now. - Fix some typo	2024-02-02 10:16:37 -08:00
Jiajia Qin	ccbe264a39	[js/webgpu] Add LeakyRelu activation for fusedConv (#19369 ) ### Description This PR 1) adds LeakyRelu activation for fusedConv; 2) makes `vec4<f16>` value work with `float32` uniforms attributes. For example: `clamp(value, vec4<f16>(uniforms.clip_min), vec4<f16>(uniforms.clip_max)` will throw compilation errors since `uniforms.clip_min` and `uniforms.clip_min` are `f32` not `f16`. So we need to change it to `clamp(value, vec4<f16>(f16(uniforms.clip_min)), vec4<f16>(f16(uniforms.clip_max))` And above problem was introduced when we make activation attributes as uniforms instead of constant. BTW, after adding LeakyRelu, `realesrgan-t256` model can pass.	2024-02-02 09:06:38 -08:00
Yulong Wang	50806a7dd5	[js/web] support external data in npm test (#19377 ) ### Description support external data in npm test. This allows test runner to detect whether an external data is available in the test folder, and if it is, load it as external data automatically. this feature does not parse every model to figure out whether the model has external data. the following comments in code explained how to determine whether should parse the model file. ```js // for performance consideration, we do not parse every model. when we think it's likely to have external // data, we will parse it. We think it's "likely" when one of the following conditions is met: // 1. any file in the same folder has the similar file name as the model file // (e.g., model file is "model_abc.onnx", and there is a file "model_abc.pb" or "model_abc.onnx.data") // 2. the file size is larger than 1GB ```	2024-02-02 09:05:57 -08:00
Jiajia Qin	efc17e79de	[js/webgpu] Fix the undefined push error (#19366 ) ### Description This PR fixes below errors when enable webgpu profiling: ``` TypeError: Cannot read properties of undefined (reading 'push') ```	2024-02-02 02:04:06 -08:00

1 2 3 4 5 ...

10540 commits