onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-05-17 21:10:43 +00:00

Author	SHA1	Message	Date
Edward Chen	a3a1237270	Disable xcpretty filtering of xcodebuild output in iOS packaging pipeline. (#17429 )	2023-09-06 09:04:17 -07:00
Changming Sun	c6b0d185b4	Update cmake to 3.27 and upgrade Linux CUDA docker files from CentOS7 to UBI8 (#16856 ) ### Description 1. Update docker files and their build instructions. ARM64 and x86_64 can use the same docker file. 2. Upgrade Linux CUDA pipeline's base docker image from CentOS7 to UBI8 AB#18990	2023-09-05 18:12:10 -07:00
aciddelgado	44101e8771	Flash Attention v2 MHA (#17227 ) ### Description Integrate Flash Attention V2 to PackedMultiHeadAttention, MultiHeadAttention and Attention operators. Flash Attention v2 source code is from https://github.com/Dao-AILab/flash-attention/tree/main/csrc/flash_attn/src. We did some change to remove dependency on Torch, then removed backward and bfloat16 related code. Add benchmark script (see benchmark_mha.sh) to compare different attention kernels for MultiHeadAttention operator. Current limitations for Flash Attention in PackedMultiHeadAttention, MultiHeadAttention and Attention operators: * Relative Position Bias is not supported * Different hidden size for Q and V is not supported * Only float16 is supported * Padding/attention mask is not supported * For MultiHeadAttention, when there is past or present input, bias shall be provided to activate flash attention * For Attention, past or present inputs will deactivate flash attention * Causal is not supported Some limitations (like attention mask and causal) might be removed later. Currently, Flash Attention v2 only works in Linux. For Windows, we will enable later with Cutlass 3.2. Two environment variables can be used for testing purpose: (1) `ORT_DISABLE_FLASH_ATTENTION` to disable flash attention. Default value is 0 (enable). Set it to "1" to disable it. (2) `ORT_MIN_SEQ_LEN_FLASH_ATTENTION_PACKED_QKV`. Default value is "513", which means that we only enable flash attention when sequence length is larger than 512 for packed QKV format. Set it to "0" if you want to use flash attention v2 whenever possible. ### Speedup The following result is from Standard_ND96amsr_A100_v4 VM (A100-SXM4-80GB GPU) using benchmark_mha.sh. The metric is TFLOPs per second for MultiHeadAttention operator. There are 3 input formats: * `Q,K,V` means separated inputs query, key and value of BxSxNH * `Q,KV` means packed KV, where key is 5D: BxSxNx2xH * `QKV` means packed QKV, where query is 5D: BxSxNx3xH Note that flash attention cannot use packed QKV format, so extra Transpose is needed. We found that TensorRT kernel is faster for sequence length <= 512 for packed QKV. The reason might be no transpose is needed for TensorRT kernel in this format. We also notice that, TensorRT kernel is faster for stable diffusion 512x512 image (see seq_len=4096, heads=8, head_dim=40 below), while flash attention v2 is faster for 1024x1024 image (see seq_len=16384, heads=8, head_dim=40 below). input format \| batch size \| sequence length \| heads \| head dim \| flash_v2 (TFLOPs/s) \| TensorRT (TFLOPs/s) \| Memory Efficient Attention (TFLOPs/s) -- \| -- \| -- \| -- \| -- \| -- \| -- \| -- Q,K,V \| 32 \| 512 \| 64 \| 32 \| 78.1 \| 60.0 \| 39.3 Q,K,V \| 32 \| 512 \| 128 \| 16 \| 46.8 \| 44.1 \| 21.7 Q,K,V \| 16 \| 1024 \| 64 \| 32 \| 99.0 \| 72.8 \| 44.3 Q,K,V \| 16 \| 1024 \| 128 \| 16 \| 54.7 \| 49.2 \| 23.4 Q,K,V \| 8 \| 2048 \| 64 \| 32 \| 113.8 \| 81.2 \| 47.8 Q,K,V \| 8 \| 2048 \| 128 \| 16 \| 59.7 \| 51.9 \| 24.7 Q,K,V \| 4 \| 4096 \| 64 \| 32 \| 122.5 \| 85.6 \| 49.7 Q,K,V \| 4 \| 4096 \| 128 \| 16 \| 62.5 \| 53.3 \| 25.3 Q,K,V \| 2 \| 8192 \| 64 \| 32 \| 127.4 \| 87.5 \| 50.7 Q,K,V \| 2 \| 8192 \| 128 \| 16 \| 64.0 \| 54.2 \| 25.6 Q,K,V \| 1 \| 16384 \| 64 \| 32 \| 129.5 \| 91.0 \| 51.2 Q,K,V \| 1 \| 16384 \| 128 \| 16 \| 64.7 \| 54.5 \| 25.8 Q,K,V \| 1 \| 4096 \| 8 \| 40 \| 51.0 \| 43.6 \| 36.8 Q,K,V \| 1 \| 4096 \| 8 \| 80 \| 97.7 \| 77.0 \| 55.5 Q,K,V \| 1 \| 4096 \| 8 \| 160 \| 120.0 \| 39.7 \| 57.8 Q,K,V \| 4 \| 4096 \| 8 \| 40 \| 89.0 \| 84.4 \| 49.2 Q,K,V \| 4 \| 4096 \| 8 \| 80 \| 133.0 \| 92.2 \| 63.2 Q,K,V \| 4 \| 4096 \| 8 \| 160 \| 164.8 \| 42.7 \| 63.8 Q,K,V \| 1 \| 16384 \| 8 \| 40 \| 96.9 \| 91.3 \| 52.1 Q,K,V \| 1 \| 16384 \| 8 \| 80 \| 142.9 \| 101.5 \| 65.6 Q,K,V \| 1 \| 16384 \| 8 \| 160 \| 177.4 \| 44.2 \| 65.7 Q,K,V \| 128 \| 128 \| 12 \| 64 \| 29.0 \| 26.9 \| 25.7 Q,K,V \| 64 \| 128 \| 12 \| 64 \| 23.1 \| 10.8 \| 21.3 Q,K,V \| 128 \| 384 \| 12 \| 64 \| 83.5 \| 60.8 \| 55.7 Q,K,V \| 64 \| 384 \| 12 \| 64 \| 72.6 \| 40.5 \| 52.8 Q,K,V \| 128 \| 512 \| 12 \| 64 \| 98.9 \| 77.9 \| 62.1 Q,K,V \| 64 \| 512 \| 12 \| 64 \| 94.7 \| 75.6 \| 60.4 Q,KV \| 32 \| 512 \| 64 \| 32 \| 85.9 \| 41.1 \| 41.1 Q,KV \| 32 \| 512 \| 128 \| 16 \| 47.1 \| 21.6 \| 21.6 Q,KV \| 16 \| 1024 \| 64 \| 32 \| 104.4 \| 45.8 \| 45.8 Q,KV \| 16 \| 1024 \| 128 \| 16 \| 54.7 \| 23.6 \| 23.6 Q,KV \| 8 \| 2048 \| 64 \| 32 \| 116.8 \| 48.5 \| 48.5 Q,KV \| 8 \| 2048 \| 128 \| 16 \| 59.8 \| 24.7 \| 24.7 Q,KV \| 4 \| 4096 \| 64 \| 32 \| 124.2 \| 50.1 \| 50.1 Q,KV \| 4 \| 4096 \| 128 \| 16 \| 62.6 \| 25.3 \| 25.3 Q,KV \| 2 \| 8192 \| 64 \| 32 \| 128.5 \| 50.8 \| 50.9 Q,KV \| 2 \| 8192 \| 128 \| 16 \| 64.1 \| 25.6 \| 25.6 Q,KV \| 1 \| 16384 \| 64 \| 32 \| 129.4 \| 51.2 \| 51.2 Q,KV \| 1 \| 16384 \| 128 \| 16 \| 64.8 \| 25.8 \| 25.8 Q,KV \| 1 \| 4096 \| 8 \| 40 \| 67.5 \| 37.7 \| 37.5 Q,KV \| 1 \| 4096 \| 8 \| 80 \| 101.3 \| 56.7 \| 56.6 Q,KV \| 1 \| 4096 \| 8 \| 160 \| 124.0 \| 58.6 \| 58.6 Q,KV \| 4 \| 4096 \| 8 \| 40 \| 90.8 \| 49.8 \| 49.8 Q,KV \| 4 \| 4096 \| 8 \| 80 \| 135.6 \| 63.8 \| 63.8 Q,KV \| 4 \| 4096 \| 8 \| 160 \| 166.3 \| 64.5 \| 64.5 Q,KV \| 1 \| 16384 \| 8 \| 40 \| 97.5 \| 52.3 \| 52.3 Q,KV \| 1 \| 16384 \| 8 \| 80 \| 143.5 \| 65.9 \| 65.8 Q,KV \| 1 \| 16384 \| 8 \| 160 \| 178.4 \| 65.9 \| 65.8 Q,KV \| 128 \| 128 \| 12 \| 64 \| 26.8 \| 48.1 \| 30.9 Q,KV \| 64 \| 128 \| 12 \| 64 \| 28.0 \| 38.9 \| 25.0 Q,KV \| 128 \| 384 \| 12 \| 64 \| 97.7 \| 61.1 \| 61.0 Q,KV \| 64 \| 384 \| 12 \| 64 \| 89.5 \| 57.8 \| 57.9 Q,KV \| 128 \| 512 \| 12 \| 64 \| 111.9 \| 66.7 \| 66.9 Q,KV \| 64 \| 512 \| 12 \| 64 \| 107.2 \| 64.9 \| 64.8 QKV \| 32 \| 512 \| 64 \| 32 \| 77.2 \| 84.7 \| 39.3 QKV \| 32 \| 512 \| 128 \| 16 \| 43.4 \| 53.1 \| 20.9 QKV \| 16 \| 1024 \| 64 \| 32 \| 98.8 \| 87.4 \| 44.6 QKV \| 16 \| 1024 \| 128 \| 16 \| 52.0 \| 54.1 \| 23.2 QKV \| 8 \| 2048 \| 64 \| 32 \| 113.1 \| 89.0 \| 47.9 QKV \| 8 \| 2048 \| 128 \| 16 \| 58.2 \| 54.6 \| 24.5 QKV \| 4 \| 4096 \| 64 \| 32 \| 120.6 \| 89.7 \| 49.7 QKV \| 4 \| 4096 \| 128 \| 16 \| 61.7 \| 54.6 \| 25.2 QKV \| 2 \| 8192 \| 64 \| 32 \| 125.9 \| 89.5 \| 50.7 QKV \| 2 \| 8192 \| 128 \| 16 \| 63.6 \| 54.8 \| 25.5 QKV \| 1 \| 16384 \| 64 \| 32 \| 128.5 \| 92.0 \| 51.2 QKV \| 1 \| 16384 \| 128 \| 16 \| 64.6 \| 54.8 \| 25.7 QKV \| 1 \| 4096 \| 8 \| 40 \| 60.2 \| 69.8 \| 38.1 QKV \| 1 \| 4096 \| 8 \| 80 \| 101.6 \| 75.2 \| 56.7 QKV \| 1 \| 4096 \| 8 \| 160 \| 130.2 \| 41.2 \| 58.4 QKV \| 4 \| 4096 \| 8 \| 40 \| 90.6 \| 91.0 \| 49.5 QKV \| 4 \| 4096 \| 8 \| 80 \| 133.6 \| 98.1 \| 62.8 QKV \| 4 \| 4096 \| 8 \| 160 \| 165.3 \| 43.7 \| 63.9 QKV \| 1 \| 16384 \| 8 \| 40 \| 97.2 \| 92.8 \| 52.1 QKV \| 1 \| 16384 \| 8 \| 80 \| 143.0 \| 103.1 \| 65.6 QKV \| 1 \| 16384 \| 8 \| 160 \| 177.6 \| 44.5 \| 65.7 QKV \| 128 \| 128 \| 12 \| 64 \| 31.1 \| 65.9 \| 27.6 QKV \| 64 \| 128 \| 12 \| 64 \| 26.1 \| 49.8 \| 23.5 QKV \| 128 \| 384 \| 12 \| 64 \| 84.6 \| 88.5 \| 56.1 QKV \| 64 \| 384 \| 12 \| 64 \| 79.1 \| 80.3 \| 53.5 QKV \| 128 \| 512 \| 12 \| 64 \| 97.3 \| 114.2 \| 62.2 QKV \| 64 \| 512 \| 12 \| 64 \| 95.9 \| 110.7 \| 60.6 QKV \| 4 \| 2048 \| 32 \| 128 \| 125.26 \| 44.72 \| 78.15 QKV \| 4 \| 4096 \| 32 \| 128 \| 141.62 \| 46.29 \| 85.84 QKV \| 8 \| 2048 \| 32 \| 128 \| 127.40 \| 45.49 \| 78.75 QKV \| 8 \| 4096 \| 32 \| 128 \| 144.24 \| 46.60 \| 86.95 ### Known Issues NVCC uses huge memory while compiling flash attention CUDA kernel. Linux build with CUDA might fail when machine has limited memory while number of CPUs is large. Walkaround is to use a build machine with larger memory, or use argument like `--nvcc_threads 1` to limit nvcc threads in build. ### Motivation and Context Increases speed and efficiency of MHA or Packed MHA. --------- Co-authored-by: Tianlei Wu <tlwu@microsoft.com> Co-authored-by: tlwu@microsoft.com <tlwu@a100.crj0ad2y1kku1j4yxl4sj10o4e.gx.internal.cloudapp.net>	2023-08-31 13:52:21 -07:00
Rachel Guo	b54619509f	Refine build script for adding disable selected data types option (#17284 ) ### Description <!-- Describe your changes. --> As title. ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. --> Now we have multiple data types that we want to disable for minimal build and to reduce binary size. may be worth adding an argument in the build script for specifying that. Also for fp16 type stuff, it may be too restrict to disable that for all minimal build. --------- Co-authored-by: rachguo <rachguo@rachguos-Mac-mini.local>	2023-08-31 13:32:55 -07:00
Yi Zhang	507a40e1e9	Add compiler cache in Linux GPU TensorRT CI. (#17348 ) ### Description Add the compiler cache in linux GPU tensorRT CI. Save about 30 minutes in the GPU machine. (52 minutes -> 24 minutes) PS. There're only white-space differences in the dockerfile. ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. -->	2023-08-31 08:13:26 +08:00
Jian Chen	081c0692a4	Update to nodejs version from 16 to 18.17.1 (#17351 ) ### Description Update to nodejs version from 16 to 18.17.1 ### Motivation and Context Nodejs will reach EOL in September 2023	2023-08-30 12:41:48 -07:00
Changming Sun	71da0824f3	Upgrade binskim and fix an error in nuget packaging pipeline (#17340 ) ### Description Upgrade binskim and fix an error in nuget packaging pipeline.	2023-08-30 07:52:06 -07:00
Jian Chen	922629aad8	Upgrade Centos7 to Alamlinux8 (#16907 ) ### Description <!-- Describe your changes. --> ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. --> Get the latest gcc 12 by default --------- Co-authored-by: Changming Sun <chasun@microsoft.com>	2023-08-29 21:05:36 -07:00
Yi Zhang	d4a61ac71f	Pr trggiers generated by code (#17247 ) ### Description 1. Refactor the trigger rules generation. 2. Skip all doc changes in PR pipelines. ### Motivation and Context Make all trigger rules generated by running set-trigger-rules.py to reduce inconsistences. It's easily to make mistakes to copy&paste manually. For example: these 2 excludes are different, Why? `4e6cec4d09/tools/ci_build/github/azure-pipelines/linux-ci-pipeline.yml (L16-L18)` `4e6cec4d09/tools/ci_build/github/azure-pipelines/linux-gpu-ci-pipeline.yml (L27-L29)` ### Note All changes in workflow yamls are generated by code. Please review the skip-js.yml, skip-docs.yml and set-trigger-rules.py. @fs-eire, please double check the filter rules in skip-js.yml and the skipped workflows `7023c2edff/tools/ci_build/set-trigger-rules.py (L14-L41)`	2023-08-30 05:57:03 +08:00
Yi Zhang	0e9e9b2a67	Fix one exception in post merge (#17327 ) ### Description <!-- Describe your changes. --> ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. -->	2023-08-29 19:24:50 +08:00
cloudhan	bf8b1681f9	Build nuget pkg for ROCm (#16791 ) Add nuget pkg building and publishing for ROCm EP --------- Co-authored-by: Yi Zhang <zhanyi@microsoft.com>	2023-08-28 13:35:08 +08:00
Yifan Li	808215366d	Fix Multi GPU TensorRT tests (#17269 ) ### Description * Integrate `trt_multi_gpu` test stage in ORT post merge CI (Win-2xA10 vm) * Deprecate Linux MultiGPU TRT CI (This vm will be deprecated soon) * Add multi gpu support to existing C# test cases * Deprecate unfunctional flag `--enable_multi_device_tests` ### Motivation and Context * Two contexts of replacing Linux MultiGPU TRT CI: * Flag `--enable_multi_device_tests` is not functional, which cannot detect issues like #17036 * The Linux-2xM60 VM of this CI pool is about to be deprecated 9/6/23. Need to enable this test in other dualGPU vm pool.	2023-08-25 20:30:45 -07:00
Arthur Islamov	c262879214	Added DML and CUDA provider support in onnxruntime-node (#16050 ) ### Description I've added changes to support CUDA and DML (only on Windows, on other platforms it will throw an error) ### Motivation and Context It fixes this feature request https://github.com/microsoft/onnxruntime/issues/14127 which is tracked here https://github.com/microsoft/onnxruntime/issues/14529 I was working on StableDiffusion implementation for node.js and it is very slow on CPU, so GPU support is essential. Here is a working demo with a patched and precompiled version https://github.com/dakenf/stable-diffusion-nodejs ---------	2023-08-25 16:57:06 -07:00
Yi Zhang	9cd33e07b4	Readd Tests in Window GPU Reduced Ops workflow (#17294 ) ### Description Add single test step in Window GPU Reduced Ops workflow ### Motivation and Context The old workflow's building and testing were running in one command. In PR #17263, the test step was removed by mistake. So, readd it. How to consolidate the test step is in consideration.	2023-08-25 15:56:59 +08:00
Yi Zhang	756eda2cc4	Windows CI build steps template (#17263 ) ### Description 1. New windows ci build steps template. 2. Remove useless variables. ### Motivation and Context 1. Make it easier to apply build cache to all windows CIs. 2. Other team's devs only need to take care of build options ###Comparision Before: `9f21f694cf/tools/ci_build/github/azure-pipelines/win-gpu-tensorrt-ci-pipeline.yml (L19-L82)` After: `b4c1f2261b/tools/ci_build/github/azure-pipelines/win-gpu-tensorrt-ci-pipeline.yml (L35-L54)`	2023-08-25 05:58:49 +08:00
Jian Chen	33415b9da4	Removing 10.14 suffix from osx nuget package (#17277 ) ### Description <!-- Describe your changes. --> ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. -->	2023-08-24 08:51:54 -07:00
cloudhan	87bef1f3f2	Move composable_kernel to deps.txt (#17245 )	2023-08-23 17:39:16 -07:00
Yi Zhang	61a79436e2	Common pre-build steps of Windows CI (#16970 ) ### Description Unify some pre-build common steps. ### Motivation and Context In the long run, other devs should only focus on build option and test commands. It would reduce mistakes and maintenance cost to use common template steps. There will be more PRs to achieve the goal.	2023-08-22 18:09:55 +08:00
cloudhan	4e6cec4d09	Update ck and enable test (#16383 ) Apply the fix in https://github.com/ROCmSoftwarePlatform/composable_kernel/issues/728 Introduce more kernel instances and allow the introduction of streamk and splitk.	2023-08-22 11:08:55 +08:00
Baiju Meswani	aae9a52e8b	Avoid pushing cpu package to https://download.onnxruntime.ai/ (#17238 )	2023-08-21 15:47:07 -07:00
Changming Sun	e2b6827a59	Add a CUDA 12.x pipeline and improve install_third_party_deps.ps1 (#17231 ) ### Description 1. Add a CUDA 12.x pipeline 2. Improve install_third_party_deps.ps1: avoid using Start-process. Directly call the command instead. ### Motivation and Context Since our official packages and all CI pipelines still use CUDA 11.x, we need extra pipelines to validate our source code level compatibility with CUDA 12.x. BTW for sure the prebuilt binaries in our release page are not compatible with CUDA 12.x. Do not report bugs for that. AB#15152	2023-08-21 13:04:36 -07:00
Chi Lo	9445539e2c	Update dependency for deps.txt (#17220 ) https://github.com/microsoft/onnxruntime/pull/17059 updates deps.txt and we also need to update cgmanifest.json and upload the files to Azure DevOps https://aiinfra.visualstudio.com/Lotus/_build/results?buildId=342803&view=results for testing	2023-08-19 00:43:25 -07:00
Edward Chen	d6cd41cfc1	[CoreML EP] Add Shape, Gather, and Slice ops (#17153 ) Add CoreML EP shape related ops: - Shape - Gather - Slice Add support for int64/int32 inputs in CoreML EP.	2023-08-18 22:34:34 -07:00
Yulong Wang	3426954525	disable browser stack tests (#17224 ) ### Description disable browser stack tests	2023-08-18 17:14:12 -07:00
Changming Sun	6db72165eb	Fix python packaging test pipeline (#17204 ) ### Description 1. Fix python packaging test pipeline. There was an error in tools/ci_build/github/linux/run_python_tests.sh that it installed a released version of onnxruntime python package from pypi.org to run the test. Supposedly it should pick one from the current build. 2. Refactor the pipeline to allow choosing cmake build type from the web UI when manually trigger a build. Now this feature is for Linux only. Because I don't want to change too much when we are about to cut a release branch. After that I will expand it to all platforms. This feature is useful for debugging pipeline issues, also, we may consider having a nightly pipeline to run all tests in Debug mode which may catch extra bugs because in debug mode we can enforce range check. Test run: https://aiinfra.visualstudio.com/Lotus/_build/results?buildId=342674&view=results ### Motivation and Context Currently the pipeline has a crash error. AB#18580	2023-08-18 14:51:26 -07:00
Adrian Lizarraga	6ee4be724b	Update LICENSE name in NuGet packaging pipelines (#17183 ) ### Description Updates NuGet packaging pipelines to use the correct license name. ### Motivation and Context The license name changed. See https://github.com/microsoft/onnxruntime/pull/17170 The QNN_Windows_Nuget and Zip-Nuget-* pipelines will not run without this update.	2023-08-17 22:22:19 -07:00
Changming Sun	0cccbcc47b	Move DML build job's Prefast task to a CPU machine pool (#17192 ) ### Description Move DML build job's Prefast task to a CPU machine pool which has larger memory. The current one runs out of memory in every run. ### Motivation and Context To fix the broken python packaging pipeline.	2023-08-17 13:16:29 -07:00
Jian Chen	e0022d061f	Set web-ci-pipeline.yml only triggered when related fields are updated (#17148 ) - 'js/web' - 'js/node' - 'onnxruntime/core/providers/js' is updated ### Description <!-- Describe your changes. --> ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. -->	2023-08-17 12:55:35 -07:00
Adrian Lizarraga	96b1ff610b	Add CI and PR validation triggers to QNN Windows x64 Pipeline yaml (#17178 ) ### Description Adds continuous integration and pull-requestion validation triggers directly to the yaml file for the Windows x64 QNN CI Pipeline. ### Motivation and Context There have been various unit tests failures that break the QNN_Windows_Nuget pipeline, which builds QNN EP for Windows x64. This PR ensures that QNN EP is built and tested on a Windows x64 image for every pull request.	2023-08-16 11:44:54 -07:00
Jian Chen	8998b6811d	Fix NPM Packaging Pipeline (#17182 ) ### Description <!-- Describe your changes. --> ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. -->	2023-08-15 22:56:38 -07:00
Adam Louly	c647e3e8ab	Run nightly pipeline tests from the commit id. (#17162 ) ### Description The onnxruntime-CI-nightly-ort-pipeline encounters occasional failures due to synchronization discrepancies between the ACPT nightly image and the repository. We are addressing this by executing tests using the commit ID associated with the ort build within the ACPT image. --------- Co-authored-by: Adam Louly <adamlouly@microsoft.com@orttrainingdev9.d32nl1ml4oruzj4qz3bqlggovf.px.internal.cloudapp.net>	2023-08-15 12:07:38 -07:00
Changming Sun	8e203efc69	Cleanup cmake file (#17154 ) ### Description 1. Clean up cmake files. Remove some unused code 2. Remove the "Semmle" task from tools/ci_build/github/azure-pipelines/templates/win-ci.yml. Semmle is deprecated and replaced by CodeQL.	2023-08-15 10:51:33 -07:00
Changming Sun	2a22325005	Explicitly set JDK version when building ORT java package (#17147 ) ### Description Explicitly set JDK version when building ORT java package. This is to fix an internal build error.	2023-08-15 10:36:05 -07:00
Adrian Lizarraga	b734db1924	[QNN EP] Fix CI build on Windows x64 pipelines (#17152 ) ### Description - Disables Resize tests that use nearest mode on QNN CPU. - Fixes indentation problems on yaml for win x64 qnn pipeline. ### Motivation and Context The QNN windows Nuget pipeline does not run due to failing unit tests on Windows x64. These tests should not be enabled until we determine the rounding behavior of QNN's ResizeNearestNeighbor operator.	2023-08-14 21:03:14 -07:00
Baiju Meswani	289600b47d	ONNX Runtime training cpu package name for ADO (#17109 )	2023-08-14 11:32:35 -07:00
PeixuanZuo	be2200c00b	[ROCm] fix python package pipeline (#17136 ) ROCm python package pipeline failed because this PR(https://github.com/microsoft/onnxruntime/pull/16325) changed onnx version to a commit and we need to build onnx from source. Low protobuf version will cause build errors. This PR remove `cmake ` and `protobuf ` from Dockerfile, these two will install by `install_os_deps.sh`.	2023-08-14 11:22:43 -07:00
Jian Chen	45f52987a2	Web CI Pipeline Isolation (#17005 ) ### Description <!-- Describe your changes. --> ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. -->	2023-08-14 10:37:37 -07:00
Jian Chen	68ea9631af	Fix typo onnxruntimecpubuilpython (#17120 ) ### Description The correct name should be onnxruntimecpubuildpython ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. --> Co-authored-by: Yulong Wang <7679871+fs-eire@users.noreply.github.com>	2023-08-14 08:34:43 -07:00
Changming Sun	4728f20f9a	Fix CI build (#17118 ) ### Description Some pipelines are failing. It is because PR #16325 set ONNX version to `rel-1.14.1` . It is a branch name, not a commit or tag name. It means whenever the branch got a new commit, we will auto pick it and use it.	2023-08-11 10:56:38 -07:00
Edward Chen	e7e974b23f	Use double quotes so variable gets expanded. (#17105 )	2023-08-11 09:05:41 -07:00
Hector Li	344c41fdb9	[QNN EP] Update QNN to v2.13 (#17079 ) ### Description Update QNN SDK to v2.13, update some UTs accordingly	2023-08-10 20:47:55 -07:00
Yulong Wang	9cd4e5af68	[wasm] upgrade emsdk to 3.1.44 (#17069 ) ### Description This change upgrade emsdk to 3.1.44. Because backend is upgraded to LLVM 16, so need to fix a lot of build failures caused by "-Wshorten-64-to-32". most of the build failures comes from generated `onnx.pb.h`, and this can be fixed by including "core/graph/onnx_protobuf.h", which detects and ignore shorten-64-to-32 warnings.	2023-08-10 16:08:36 -07:00
Bowen Bao	6986981482	Bump ONNX version (#16325 ) ### Description Bump ONNX version to https://github.com/onnx/onnx/tree/rel-1.14.1 to include a fix for segfault when shape inferencing nested onnx functions. ### Motivation and Context Resolves #16170	2023-08-10 11:27:28 -07:00
PeixuanZuo	12837ba5c7	[ROCm] Update CI based on ubuntu 22.04 (#17076 ) - Update ROCm version to ROCm5.6 - Update CI based on ubuntu 22.04	2023-08-10 09:51:29 -07:00
RandySheriffH	a7542f48d6	Make AzureEP default for python and c# packaging (#17025 ) Make AzureEP default for python and c# packaging, with UT. --------- Co-authored-by: Randy Shuai <rashuai@microsoft.com>	2023-08-09 12:36:52 -07:00
Yulong Wang	56bced0581	[js/web] enable webgpu in browser unit test (#16310 ) ### Description enable webgpu in browser unit test. The CI pipeline uses Edge v113+ which enables WebGPU. === UPDATE on 08/07/2023: - add flags to Edge browser launch commandline so that Edge on CI agents can initialize WebGPU correctly. - ONLY enable webgpu on web release build. Other pipelines are using flag `-b=wasm,webgl,xnnpack` to specify the other 3 backends explicitly. - disable "Resize" related test failures. Once they are fixed the tests can be re-enabled. --------- Co-authored-by: Satya Jandhyala <satya.k.jandhyala@gmail.com>	2023-08-08 11:45:04 -07:00
Edward Chen	50719d2f8e	[iOS] Add script to get simulator device info. (#17012 ) Add script to get iOS simulator device info so we don't need to use hardcoded specifiers which may or may not refer to a valid simulator device. Add use-xcode-version step to a packaging pipeline so it uses a consistent version of Xcode.	2023-08-08 09:04:06 -07:00
Baiju Meswani	249917a093	Add mac and windows python packages for onnxruntime-training (#16993 )	2023-08-07 20:32:55 -07:00
Yifan Li	d6ce43db5e	[EP Perf] MemTest: Add Valgrind and fix addressSanitizer (#16930 ) ### Description 1. Add valgrind to existing ep_perf CI MemTest and parse ORT-TRT memLeak details 1. General Valgrind logs and logs related to ORT-TRT will be parsed in [CI artifacts](https://aiinfra.visualstudio.com/Lotus/_build/results?buildId=334122&view=artifacts&pathAsName=false&type=publishedArtifacts) 1. Logic: 1. Run valgrind with `onnxruntime-perf-test -e tensorrt` and export log to `valgrind.log` 2. Identify if any `definitely lost` memleak happened 1. For log paragraphs which show `definitely lost`, parse if they have keyword `TensorrtExecutionProvider`. 2. If so, extract these details to `ort_trt_memleak_detail.log`, and return `build failure` to EP Perf CI 3. Fix existing addressSanitizer and sync the squeezenet testcase with latest update from [ort-inference-example](https://github.com/microsoft/onnxruntime-inference-examples/blob/main/c_cxx/squeezenet/main.cpp) 1. Updates in short: Upgrade main.cpp to be using OrtTensorRTProviderOptionsV2 4. Reorder the 7-min-MemTest to be ahead of 9-hr-model-tests, and enable MemTest by default	2023-08-04 16:58:57 -07:00
Yulong Wang	5af8774a0b	[build] do init and precheck first (#16961 ) ### Description This change allows Web CI to do some check as the first step, so that if there are errors it won't launch the task to build web assembly, which is heavy. Checks includes: - "npm ci" in /js, /js/common and /js/web. this implicitly include: - typescript compiler in /js - typescript compiler in /js/common - webpack build in /js/common - typescript compiler in /js/web - ESLint on typescripts - clang-format formatter (.js, .ts, .cc, .h, .mm) - Prettier formatter (.json, .jsonc, .md) --------- Co-authored-by: Caroline Zhu <carolinezhu@microsoft.com@orttrainingdev7.d32nl1ml4oruzj4qz3bqlggovf.px.internal.cloudapp.net>	2023-08-04 16:44:45 -07:00

1 2 3 4 5 ...

1599 commits