onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-05-16 21:00:14 +00:00

Author	SHA1	Message	Date
PeixuanZuo	e2526714e2	[ROCm] Move MIGraphX build step on CPU only machine (#16582 ) - Move MIGraphX build step on CPU only machine - Use ccache on build step - Not pass host uid into docker build process.	2023-07-05 13:55:28 +08:00
Wei-Sheng Chin	a0a5f57581	[DORT] Use new FX-to-ONNX exporter (#16450 ) The ONNX exporter in DORT have been moved to PyTorch as a formal feature. We therefore switch to consume the exporter from PyTorch instead of maintaining two duplicates.	2023-07-04 13:13:04 -07:00
pengwa	ac100ebb64	Fix orttraining-ortmodule-distributed CI (#16569 ) ### Fix orttraining-ortmodule-distributed CI https://pypi.org/project/pydantic/#history released version 2.0 1st July, Deepspeed has known issue on newer version of it (https://github.com/microsoft/DeepSpeed/issues/3280). So fix this by add similar check as DS did in https://github.com/microsoft/DeepSpeed/pull/3290	2023-07-03 13:18:59 +08:00
Yifan Li	e2c214d81f	[TensorRT EP] TRT 8.6 minor version update (#16475 ) ### Description * Minor version update: TRT 8.6.0.12->8.6.1.6 * CI pipeline ymls/dockerfiles are updated * cgmanifest.json/deps.txt/download-deps.yml are updated; Win trt binaries uploaded to [win img 307029](https://aiinfra.visualstudio.com/AI%20Infra%20Management/_build/results?buildId=307029&view=results) * Re-enable unit tests which were failed in 8.6.0 and re-gained support in 8.6.1	2023-06-26 10:44:27 -07:00
yf711	0ad0d6ebbf	Unblock Linux MultiGPU TensorRT CI (#16446 ) ### Description Revert docker base image to nvidia/cuda:11.8.0-cudnn8-devel-ubuntu20.04@sha256:b754c43fe9d62e88862d168c4ab9282618a376dbc54871467870366cacfa456e ### Motivation and Context The default img env of nvidia/cuda:11.8.0-cudnn8-devel-ubuntu20.04 has minor upgrade, which make Linux MultiGPU TensorRT CI (NV12 instance with Maxwell GPU) fail on three CApiTestGlobalThreadPoolsWithProvider tests (these three tests have higher error which are above the tolerance) That minor upgrade includes cudnn 8.7.0->8.9.0, which might be a factor that make maxwell GPU generator higher error. CIs with T4 GPU are not affected.	2023-06-21 17:15:39 -07:00
PeixuanZuo	a95f8ae53c	[ROCm] Update ROCm/MIGraphX CI pipeline (#16215 ) MIGraphX CI - Change docker container user name to `onnxruntimedev` ROCm CI - Build docker image every job instead of using prebuild image. - Every job create a container with only one GPU with command `docker run -it --device=/dev/kfd --device=/dev/dri/renderDxxx` - Remove tests that are unstable or use outdated interfaces. - Enable training ortmodule test.	2023-06-05 10:28:10 +08:00
Changming Sun	6b5b79872b	Avoid taking dependency on dl.fedoraproject.org (#16202 ) ### Description 1. Avoid taking dependency on dl.fedoraproject.org The website is not very stable. Our build pipelines often fail to fetch packages from there. 2. Update manylinux to the latest version	2023-06-02 07:41:46 -07:00
yf711	84f1af7ff5	ort build flag fix (#16072 ) ### Description * Sync and clean build flag `--use_tensorrt_builtin_parser` from existing CI config as this becomes default flag * cuda version update	2023-05-24 12:32:10 -07:00
RandySheriffH	d35361bf9d	Fix python pipeline for AzureEP without using root (#16023 ) Fix python pipeline for AzureEP without using root, this is for 1.15. --------- Co-authored-by: Randy Shuai <rashuai@microsoft.com>	2023-05-22 16:38:47 -07:00
PeixuanZuo	d78bbf5ef2	[ROCm] remove ROCm5.2.3, ROCm5.3, ROCm5.4 from pipeline (#16004 ) remove ROCm5.2.3, ROCm5.3, ROCm5.4 from pipeline.	2023-05-19 10:29:01 +08:00
PeixuanZuo	af6cb2af87	[ROCm] update ROCm/MIGraphX CI to ROCm5.5 (#15905 ) update ROCm/MIGraphX CI to ROC5.5. TODO: two PR to fix failure on orttraining/orttraining/test/python/orttraining_test_ortmodule_api.py - test_gradient_correctness_minmax/test_gradient_correctness_argmax_unfold/test_gradient_correctness_argmax_diagonal (https://github.com/microsoft/onnxruntime/pull/15903) - test_ortmodule_attribute_name_collision_warning (https://github.com/microsoft/onnxruntime/pull/15884)	2023-05-15 10:28:15 +08:00
Changming Sun	7c58d013aa	Remove Ubuntu 18.04 usages (#15781 ) ### Description Remove Ubuntu 18.04 usages because it will be EOL this month. ### Motivation and Context	2023-05-11 11:44:00 -07:00
liqun Fu	ac9ae9f7c5	update onnx release 1.14 for docker files (#15680 ) ### Description this is for ort 1.15 release to work with onnx 1.14 It shall be merged after onnx 1.14 release and before ort 1.15 release. ### Motivation and Context --------- Signed-off-by: Liqun Fu <liqfu@microsoft.com>	2023-05-10 13:15:56 -07:00
PeixuanZuo	41457885e0	[ROCm] add rocm5.5 to python package pipeline (#15820 ) add rocm5.5 to python packaging pipeline. https://aiinfra.visualstudio.com/Lotus/_build/results?buildId=306082&view=results TODO: Remove version 5.2.3, 5.3.2 and 5.4 in the next PR.	2023-05-06 10:21:15 +08:00
Changming Sun	5b826b1bc3	Update cmake version in Linux build (#15707 ) ### Description All our Windows build pipelines already uses cmake 3.26 except one pipeline: QNN ARM64. This PR does the same for Linux build pipelines. ### Motivation and Context This change is related to #15704 .	2023-04-27 20:02:33 -07:00
yf711	2e1f92a986	Fix EP Perf pipeline (#15507 ) ### Description * Update TensorRT 8.6 lib dependencies in dockerfile of TRT EP Perf pipeline * Avoid using `--allow_running_as_root` and build ORT with non-root user ### Motivation and Context To fix the build issue on EP perf pipeline Fixed [AB#14615]	2023-04-27 10:09:14 -07:00
yf711	28985c47b7	[TensorRT EP] Unleash opset16-17 onnx model tests (#15657 ) ### Description In 2021 we restricted onnx node test CI execution in range of opset 14-15 for ORT-TRT, which was the latest opset that TRT EP could support Update this range to opset 14-17 to improve the ORT-TRT unit test coverage, as [Nvidia announced that TRT 8.6 supported opset17](https://github.com/onnx/onnx-tensorrt/blob/main/docs/operators.md)	2023-04-26 11:44:19 -07:00
yf711	d701dcd027	Fix Linux MultiGPU TensorRT CI (#15697 ) ### Description * Reverting default TensorRT version to 8.5 as temporary fix * Apart from that, this PR temporarily leaves this CI as a place to validate user behavior that uses TRT 8.5 with latest ORT ### Context * This CI pool equips 2xTesla M60 GPUs, which are no longer supported by TensorRT 8.6. * Currently, other CIs are using single-T4 VM but there's no VM with 2xT4 or other suitable dualGPU in the range. * Once we decide which VM instance for this CI to migrate to, TRT8.6 can be enabled on this CI * According to [Nvidia](https://docs.nvidia.com/deeplearning/tensorrt/release-notes/index.html): * TensorRT 8.5.3 was the last release supporting NVIDIA Kepler (SM 3.x) and NVIDIA Maxwell (SM 5.x) devices. These devices are no longer supported in TensorRT 8.6. NVIDIA Pascal (SM 6.x) devices are deprecated in TensorRT 8.6.	2023-04-26 10:01:33 -07:00
Baiju Meswani	11b0a18de6	Add support for cuda 11.8 and python 3.11 for training (#15548 )	2023-04-20 12:56:45 -07:00
Yi Zhang	4e1f75810c	Add compilation cache in 2 Linux CPU pipelines and refactor the Linux build step with cache (#15484 ) ### Description 1. Add compilation cache in Linux CPU ARM and Linux Minimal Build. 2. Integrate 4 Linux CPU build step with cache into one. 3. install ccache from source code in Linux ARM64 image. ### Motivation and Context 1. Enable more build steps with compilation cache. 2. Make it easier to add cache. It could save 40 more minutes of compilation time in Linux ARM64. https://dev.azure.com/onnxruntime/onnxruntime/_build/results?buildId=959619&view=logs&j=1e0830bb-fd74-5d0a-5029-1c63b4266d7b&t=75260ed7-7566-5947-2095-566660191920	2023-04-14 23:56:59 +08:00
yf711	8cd5f3ad9c	[TensorRT EP] support TensorRT 8.6-EA (#15299 ) ### Description <!-- Describe your changes. --> * Integrate TRT 8.6EA on relevant Linux/Windows/pkg pipelines * Update onnx-tensorrt to 8.6 * Add new dockerfiles for TRT 8.6 and clean old ones * Update [CGManifest](https://github.com/microsoft/onnxruntime/tree/main/cgmanifests) files and ort build deps version * yml/script update * Enable built-in TRT parser option on TRT related pipelines by default * Exclude test TopKOperator.Top3ExplicitAxisInfinity out of TRT EP tests (8.6-EA has issue with topk operator)	2023-04-12 11:34:59 -07:00
Jian Chen	af28754e6f	Update python package pipeline to support 3.11 (#15311 ) ### Description Update python package pipeline to support 3.11 ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. -->	2023-04-04 10:55:32 -07:00
PeixuanZuo	d80859f63d	[ROCm] fix python packaging pipeline and add python10 (#15282 ) rocm python packaging pipeline failed because manylinux version and manylinux.patch update. 1. fix duplicate `epel-release` installation issue, ROCm pipeline install it at the begin of the dockerfile to install rocm libs. remove duplicate installation on install-runtime-packages.sh. ``` /var/tmp/yum-root-sMRl36/epel-release-latest-7.noarch.rpm: does not update installed package. Error: Nothing to do ``` 2. add python10 to fix error below. ``` + /opt/python/cp310-cp310/bin/python -m venv /opt/_internal/tools build_scripts/finalize.sh: line 40: /opt/python/cp310-cp310/bin/python: No such file or directory ``` 3. add python10 to rocm pipeline. pipeline link: https://aiinfra.visualstudio.com/Lotus/_build/results?buildId=294776&view=results	2023-03-31 10:25:21 +08:00
Changming Sun	15f7dca9fb	Update protobuf to 3.21.x (#15245 ) ### Description Fixed [AB#10092](https://aiinfra.visualstudio.com/6a833879-cd9b-44a4-a9de-adc2d818f13c/_workitems/edit/10092), [AB#11753](https://aiinfra.visualstudio.com/6a833879-cd9b-44a4-a9de-adc2d818f13c/_workitems/edit/11753), [AB#11759](https://aiinfra.visualstudio.com/6a833879-cd9b-44a4-a9de-adc2d818f13c/_workitems/edit/11759) ### Motivation and Context The one we use has a security issue in Java, though we don't use that version's protobuf java package.	2023-03-29 14:08:18 -07:00
Jian Chen	85948d6bc6	Cjian/windows update python3.11 (#15243 ) ### Description windows update python3.11 ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. --> --------- Co-authored-by: Ubuntu <chasun@chasunlinux.lw3b1xzoyrkuzm34swpscft0ff.dx.internal.cloudapp.net>	2023-03-28 22:15:47 -07:00
PeixuanZuo	62b2947ac1	[ROCm] remove python3.7 from python packaging pipeline (#15230 ) remove python3.7 from python packaging pipeline. https://aiinfra.visualstudio.com/Lotus/_build/results?buildId=289720&view=results	2023-03-28 10:37:04 +08:00
Jian Chen	792d411135	Update python 3.11 and remove 3.7 for Linux (#15214 ) ### Description Update python 3.11 and remove 3.7 ### Motivation and Context Update python 3.11 and remove 3.7 --------- Co-authored-by: Ubuntu <chasun@chasunlinux.lw3b1xzoyrkuzm34swpscft0ff.dx.internal.cloudapp.net>	2023-03-27 14:46:30 -07:00
Changming Sun	ffcfb1ec98	Remove protobuf submodule (#15190 ) ### Description Remove protobuf submodule as a follow-up of #13523 "Android CI Pipeline" and "Zip-Nuget-Java-Nodejs Packaging Pipeline" need to be tested. ### Motivation and Context It is related to [AB#11753](https://aiinfra.visualstudio.com/6a833879-cd9b-44a4-a9de-adc2d818f13c/_workitems/edit/11753) Fixed [AB#14027](https://aiinfra.visualstudio.com/6a833879-cd9b-44a4-a9de-adc2d818f13c/_workitems/edit/14027)	2023-03-27 10:35:49 -07:00
Edward Chen	bd142bfb04	Gradle clean up (#14973 ) - Use java/gradlew directly in .github/workflows/publish-java-apidocs.yml. - Remove use of deleted step from tools/ci_build/github/azure-pipelines/android-arm64-v8a-QNN-crosscompile-ci-pipeline.yml. - Remove Gradle installations and PATH updates from Dockerfiles and scripts. Now Gradle wrapper is used so a system Gradle installation is not needed.	2023-03-10 10:50:32 -08:00
zhijiang	80e25ad6ac	fix cg issue (#14372 ) ### Description tensorboard depends on rsa>=3.1.4, while rsa 4.5 has vuln issue, so pin it to higher version as suggested Fixed [AB#7352](https://aiinfra.visualstudio.com/6a833879-cd9b-44a4-a9de-adc2d818f13c/_workitems/edit/7352) ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. -->	2023-03-09 15:28:11 +08:00
Chun-Wei Chen	70a31e047a	Consume ONNX 1.13.1 in ONNX Runtime (#14812 ) ### Description <!-- Describe your changes. --> Consume ONNX 1.13.1 in ONNX Runtime. (ONNX 1.13.0 to ONNX 1.13.1) ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. --> ONNX 1.13.1 patch was just released yesterday. This PR is making ORT's ONNX submodule consistent with the latest released ONNX. Not sure whether this PR is really needed, but let me make it ready. Previous PR for testing ONNX 1.13.1rc2 : https://github.com/microsoft/onnxruntime/pull/14634. Fixed [AB#13174](https://aiinfra.visualstudio.com/6a833879-cd9b-44a4-a9de-adc2d818f13c/_workitems/edit/13174) .	2023-03-02 14:57:35 -08:00
Baiju Meswani	68a402e739	Add support for python 3.10 for onnxruntime-training cuda and cpu (#14100 )	2023-02-02 11:32:41 -08:00
Baiju Meswani	7954976e0a	Fix python packaging pipeline (#14533 ) fix onnx and protobuf inconsistencies in python packaging pipeline.	2023-02-02 13:11:18 +08:00
Yi Zhang	80f807c03d	upgrade protobuf to 3.20.2 and onnx to 1.13 (#14279 ) ### Description upgrade protobuf to 3.20.2, same as onnx 1.13.0 ### Motivation and Context Per component governance requirement and Fixes #14060 unused-parameter error occurs in 2 conditions. 1. compile protolbuf `onnxruntime_src/cmake/external/protobuf/src/google/protobuf/repeated_ptr_field.h:752:66: error: unused parameter ‘prototype’ [-Werror=unused-parameter]` 2. include onnx_pb.h ``` 2023-01-28T10:20:15.0410853Z FAILED: CMakeFiles/onnxruntime_pybind11_state.dir/onnxruntime_src/onnxruntime/python/onnxruntime_pybind_iobinding.cc.o ...... 2023-01-28T10:20:15.0466024Z from /build/Debug/_deps/onnx-src/onnx/onnx_pb.h:51, 2023-01-28T10:20:15.0466958Z from /onnxruntime_src/include/onnxruntime/core/framework/to_tensor_proto_element_type.h:10, .... 2023-01-28T10:20:15.0609678Z /build/Debug/_deps/onnx-build/onnx/onnx-operators-ml.pb.h:1178:25: required from here 2023-01-28T10:20:15.0610895Z /onnxruntime_src/cmake/external/protobuf/src/google/protobuf/repeated_ptr_field.h:752:66: error: unused parameter ‘prototype’ [-Werror=unused-parameter] 2023-01-28T10:20:15.0611707Z cc1plus: all warnings being treated as errors ``` https://dev.azure.com/onnxruntime/2a773b67-e88b-4c7f-9fc0-87d31fea8ef2/_apis/build/builds/874605/logs/22	2023-01-31 12:55:09 -08:00
Yi Zhang	6d60dc24fe	install shared deps script (#14234 ) ### Description Add a new install_shared_deps.sh ### Motivation and Context Azcopy, Ninja, Node.js and CCache are all needed, but they are copied everywhere.	2023-01-16 18:27:29 +08:00
sfatimar	7654cd50e8	Openvino ep 2022.3 v4.3 (#14210 ) ### Description Changes to incorporate OpenVINO EP 2022.3 ### Motivation and Context This change is required to incorportate OpenVINO EP 2022.3 - If it fixes an open issue, please link to the issue here. --> Co-authored-by: mohsinmx <mohsinx.mohammad@intel.com> Co-authored-by: Preetha Veeramalai <preetha.veeramalai@intel.com> Co-authored-by: Aravind <aravindx.gunda@intel.com> Co-authored-by: mayavijx <mayax.vijayan@intel.com> Co-authored-by: flexci <mohsinmx>	2023-01-11 16:31:26 -08:00
PeixuanZuo	33367fa2dc	[MIGraphX] update the MIGraphX version used in ORT to rocm-5.4.0 (#14184 ) ### Description Update the MIGraphX version used in ORT to rocm-5.4.0 ### Motivation and Context The previous branch migraphx_for_ort has stopped updating, it is too far away from the MIgraphX latest release branch. More discussion here: https://github.com/microsoft/onnxruntime/issues/14126#issuecomment-1373201049 Co-authored-by: peixuanzuo <peixuanzuo@linmif39a000004.zvflicr54joexhdgnhvmxrxygg.phxx.internal.cloudapp.net>	2023-01-10 13:40:25 +08:00
Baiju Meswani	c6ff5bac9d	Update torch in eager mode CI pipeline (#14094 )	2023-01-06 11:46:44 -08:00
Ashwini Khade	e5e3570ac5	fix cg issue (#14112 ) ### Description Update torch version to 1.13.1 to fix CG issue: https://dev.azure.com/aiinfra/ONNX%20Runtime/_workitems/edit/10666/	2023-01-04 09:07:13 -08:00
Baiju Meswani	0ff61f7b97	Update torch to 1.13.1 in CI and packaging pipelines for ort training (#14055 )	2023-01-03 20:03:33 -08:00
Baiju Meswani	b85878953f	Fix nightly ort training ci pipeline (#14007 )	2022-12-30 12:28:57 -08:00
PeixuanZuo	ab2dd8dfaf	[ROCm] Update ROCm and MigraphX CI to ROCm5.4 (#14011 ) Update ROCm and MigraphX CI to ROCm5.4 Run ortmodule_test with ROCm5.4 and all passed(https://dev.azure.com/onnxruntime/onnxruntime/_build/results?buildId=824742&view=logs&j=8292f886-7946-5da9-7977-04484c342eda&t=5de68eaa-cbdc-5be5-13d0-bb946f4ddb2d). Co-authored-by: peixuanzuo <peixuanzuo@linmif39a000004.zvflicr54joexhdgnhvmxrxygg.phxx.internal.cloudapp.net>	2022-12-22 10:01:05 +08:00
Chi Lo	ba89cae3bd	Update package pipelines to support TRT 8.5 (#13998 ) Update following package pipelines to support TRT 8.5 after https://github.com/microsoft/onnxruntime/pull/13867: - [Linux Multi GPU TensorRT CI Pipeline](https://aiinfra.visualstudio.com/Lotus/_build?definitionId=1016&_a=summary) - [Python packaging pipeline](https://aiinfra.visualstudio.com/Lotus/_build?definitionId=841&_a=summary) - [build-perf-test-binaries](https://aiinfra.visualstudio.com/Lotus/_build?definitionId=1130&_a=summary) - [Linux-GPU-EP-Perf](https://aiinfra.visualstudio.com/Lotus/_build?definitionId=841&_a=summary)	2022-12-16 15:01:50 -08:00
Yi Zhang	7d20d889d1	Use cache for compilation in container (#13960 ) ### Description For compilation in container, ADO Cache task doesn't work directly. The workaround is to mount the cache directory to the container, and let CCache in container to read/write cache data. In short, we just leverage ADO API to download/upload cache data. The Post-jobs works in stack-mode, So the PostBuildCleanUp Tasks should be defined first. Thus, The PostBuildCleanUp would be executed lastly. Else, Cache Task would fail to upload cache because the Agent Directory is cleaned.	2022-12-16 07:19:07 +08:00
Chi Lo	5b492cbae3	[TensorRT EP] support TensorRT 8.5 (#13867 ) Integrate TensorRT 8.5 - Update TensorRT EP to support TensorRT 8.5 - Update relevant CI pipelines - Disable known non-supported ops for TensorRT - Make timeout configurable. We observe more than [20 hours](https://aiinfra.visualstudio.com/Lotus/_build/results?buildId=256729&view=logs&j=71ce39d8-054f-502a-dcd0-e89fa9931f40) of running unit tests with TensorRT 8.5 in package pipelines. Because we can't use placeholder to significantly reduce testing time (c-api application test will deadlock) in package pipelines, we only run subsets of model tests and unit tests that are related to TRT (add new build flag--test_all_timeout and set it to 72000 seconds by package pipelines). Just to remember, we still run all the tests in TensorRT CI pipelines to have full test coverage. - include https://github.com/microsoft/onnxruntime/pull/13918 to fix onnx-tensorrt compile error. Co-authored-by: George Wu <jywu@microsoft.com>	2022-12-14 13:06:03 -08:00
Edward Chen	b4dd5dda12	Revert "Update protobuf version to 3.18.3 in tools/ci_build/github/linux/docker/scripts/requirements.txt." (#13963 ) Reverts microsoft/onnxruntime#13922	2022-12-13 18:15:06 -08:00
Edward Chen	b23395f977	Update protobuf version to 3.18.3 in tools/ci_build/github/linux/docker/scripts/requirements.txt. (#13922 ) ### Description <!-- Describe your changes. --> Update protobuf version to 3.18.3 in tools/ci_build/github/linux/docker/scripts/requirements.txt. ### Motivation and Context Address component governance alert CVE-2022-1941	2022-12-12 12:38:27 -08:00
Adrian Lizarraga	db9c677b63	[EP Perf Dashboard] Add TensorRT 8.5.1.1 dockerfile (#13843 ) ### Description - Adds a dockerfile for Ubuntu with TensorRT 8.5.1.1. - Adds option to run EP Perf pipeline with TensorRT 8.5 ### Motivation and Context Necessary to benchmark models with TensorRT 8.5	2022-12-09 14:33:52 -08:00
shalvamist	d22be84add	Pin packaging to version 21.3 to address training pipeline failures	2022-12-09 09:05:55 -08:00
PeixuanZuo	7694b695a9	[ROCm] Simplify ROCm manylinux dockerfile (#13873 ) ### Description <!-- Describe your changes. --> 1. Remove ROCm5.3 pipeline because it has rocblas bug, we don't need it. 2. We removed the dependency on centos docker image provided by AMD(https://hub.docker.com/r/rocm/dev-centos-7) and build ROCm centos base image by ourselves. The reference dockerfile(https://github.com/RadeonOpenCompute/ROCm-docker/blob/master/dev/Dockerfile-centos-7) is very redundant for our need. We simplified the ROCm manylinux dockerfile. 3. Different versions of rocm use the same dockerfile `Dockerfile.manylinux2014_rocm`. ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. --> Co-authored-by: peixuanzuo <peixuanzuo@linmif39a000004.zvflicr54joexhdgnhvmxrxygg.phxx.internal.cloudapp.net>	2022-12-08 09:18:27 +08:00

1 2 3 4 5 ...

410 commits