onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-07-02 03:55:34 +00:00

Author	SHA1	Message	Date
PeixuanZuo	665fb346ab	[ROCm] set parallel=16 when build on ROCm CI (#13368 ) ### Description <!-- Describe your changes. --> ROCm CI build step takes more than one hour. Set parallel=16 when build on ROCm CI to reduce build time. ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. --> Co-authored-by: peixuanzuo <peixuanzuo@linmif39a000004.zvflicr54joexhdgnhvmxrxygg.phxx.internal.cloudapp.net>	2022-10-20 11:36:00 +08:00
Adrian Lizarraga	418304743d	[EP-Perf-Dashboard] Update table schemas (#13327 ) Updates EP perf benchmarking scripts to upload new data with an improved table schema. In order to preserve compatibility with the current benchmarking pipeline, we still upload data that uses the old schema as well. These changes are required in order to improve data filtering capabilities and general UX in dashboards that visualize this data. Details: - EP names no longer hardcoded as columns for tables that store inference latency, session creation times, memory usage, and model/EP status. - Add explicit branch, commit ID, and commit date columns to all tables - Improvements to the docker image building scripts (simplify docker image build; support installing binary TensorRT packages) - Remove use of deprecated DataFrame.append in favor of pandas.concat.	2022-10-19 16:15:05 -07:00
Edward Chen	2fa18ea77e	[React Native CI] Record more info to debug E2E test (#13329 ) Record more info from the React Native CI E2E test. In particular, log the view hierarchy when exiting the test and dump logs from Android emulator to the build output.	2022-10-18 17:21:28 -07:00
Adam Louly	61ee5585b2	update the nightly build to use the latest ptca image. (#13309 ) ### Description updating the ptca image used in the nightly pipeline Co-authored-by: Adam Louly <adamlouly@microsoft.com@orttrainingdev7.d32nl1ml4oruzj4qz3bqlggovf.px.internal.cloudapp.net>	2022-10-17 14:12:03 -07:00
PeixuanZuo	b4853a978a	[ROCm] add rocm python package pipeline with --use_rocm_profiling (#13068 ) ### Description <!-- Describe your changes. --> ROCm developers always need to build onnxruntime whl with `--enable_rocm_profiling`. Add a ROCm dev python package pipeline which product .whl with build args `--enable_rocm_profiling`. The dev *whl need to upload to azure storage and can get from https://download.onnxruntime.ai/onnxruntime_nightly_rocm53.profiling.html ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. -->	2022-10-17 10:11:20 +08:00
Wei-Sheng Chin	dc324b1d90	[LazyTensor] Make LORT Build Again with Latest PyTorch (#13303 ) `python setup.py develop` doesn't install PyTorch as a normal package in site-packages anymore, and the user must stay at PyTorch's root directory to call `import torch`. This will break LORT tests because LORT tests contains `import torch` and are called outside PyTorch root directory. To make PyTorch a normal package again, this PR build PyTorch with `python setup.py install`.	2022-10-13 13:56:17 -07:00
PeixuanZuo	6895918b1c	[ROCm] Revert CI pipeline to ROCm5.2.3 (#13297 ) ### Description <!-- Describe your changes. --> Unit test with ROCm5.3 slower than ROCm5.2.3. Revert to ROCm5.2.3. We will update to ROCm5.3 when the issue resloved by AMD. ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. -->	2022-10-12 10:47:33 -07:00
Edward Chen	9422438782	Objective-C static analysis - use different llvm path to try to find clang-tidy. (#13280 ) Use different llvm path to try to find clang-tidy. Sometimes the build fails because it can't find clang-tidy. Hopefully this path works better.	2022-10-12 10:16:26 -07:00
Yi Zhang	67bde18d0d	Update Win_GPU_CI trigger (#13290 ) ### Description supplement of #13248 Add PR trigger https://learn.microsoft.com/en-us/azure/devops/pipelines/repos/github?view=azure-devops&tabs=yaml#pr-triggers fix: master -> main Testted with #13289 #13292 NB: the real pipeline is always triggered if the workflow yaml changed even it's added in the path filter. ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. --> Make sure the real pipeline not run in the backend.	2022-10-12 15:22:42 +08:00
PeixuanZuo	b2353fa737	[ROCm] Add ROCm5.3 to python package pipeline (#13249 ) ### Description <!-- Describe your changes. --> 1. Remove ROCm5.1.1 and ROCm5.2 from ROCm python package pipeline 2. Add ROCm5.3 to ROCm python package pipeline pipeline: https://aiinfra.visualstudio.com/Lotus/_build/results?buildId=237172&view=results ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. -->	2022-10-12 07:23:42 +08:00
Yi Zhang	6b499db7e1	increase ios pipeline timeout limit (#13268 ) ### Description <!-- Describe your changes. --> ### Motivation and Context The timeout issues increased	2022-10-11 14:07:04 +08:00
Yi Zhang	ea128cdb18	skip windows GPU check if changes only in doc (#13248 ) ### Description Use Path filter and fake workflow to skip windows GPU check if there's only changes in doc. Refs: https://docs.github.com/en/repositories/configuring-branches-and-merges-in-your-repository/defining-the-mergeability-of-pull-requests/troubleshooting-required-status-checks#handling-skipped-but-required-checks The fake github yaml is generated by code. ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. --> ###verifications:### In this PR: since the win-gpu-ci-pipeline.yml and .github are updated, so the real Windows GPU workflows are always triggered. in #13256 To avoid update win-gpu-ci-pipleline.yml, I added the path filter in devops page. the fake win GPU workflows triggered, and the real workflows are skipped.	2022-10-11 13:51:44 +08:00
PeixuanZuo	4d25b9c8f0	[ROCm] Update ROCm and MIGraphX CI pipeline to ROCm5.3 (#13257 ) ### Description <!-- Describe your changes. --> 1. Update ROCm pipeline and MIGraphX pipeline to ROCm5.3 ROCm pipeline run ortmodule test one time and disable it : https://dev.azure.com/onnxruntime/onnxruntime/_build/results?buildId=777794&view=logs&j=48b14a85-ff1a-5ca4-53fa-8ea420d27feb&t=9c199f35-fc50-565d-6c65-5162c9bb1b04 2. Add `workspace: clean: all `. ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. -->	2022-10-11 13:47:22 +08:00
Edward Chen	00146b2541	Add onnxruntime_BUILD_UNIT_TESTS=OFF definition to iOS package build options. (#13238 ) Add onnxruntime_BUILD_UNIT_TESTS=OFF definition to iOS package build options. The `--skip_tests` option is already specified.	2022-10-10 18:00:17 -07:00
Edward Chen	d411bd277e	Increase iOS packaging pipeline timeout. (#13233 ) Increase iOS packaging pipeline timeout to 300 minutes.	2022-10-07 14:49:16 -07:00
Jian Chen	6662ece4a1	increase timeout to 5 hours (#13226 ) ### Description Increase MacOS pipeline timeout to 5 hours ### Motivation and Context It blocks Release pipeline	2022-10-07 13:02:48 -04:00
cloudhan	51ac6617f5	Fix warnings and enable dev mode for ROCm CI (#13223 ) Fix warnings and enable dev mode for ROCm CI: * Fix ROCm headers complaining "This file is deprecated. Use the header file from ..." * Disable warning signed and unsigned compare for kernel explorer * Fix unused and nondiscard warnings * Enable dev mode for ROCm CI * Walkaround error "unknown warning option '-Wno-nonnull-compare'" in kernel explorer by using '-Wno-unknown-warning-option' to ignore the unknown option * Fix error "unused parameter 'mask'" * Fix warning "instantiation of variable 'onnxruntime::rocm::Consts<float>::One' required here, but no definition is available", etc. Fixed by using C++17's inline (implied by constexpr) static initialization. * Remove unused variable * Add the missing `override` specifier	2022-10-07 09:45:01 +08:00
Edward Chen	4e37464cc5	Add build configuration to binary size checks pipeline. (#13208 ) Add another build configuration to binary size checks pipeline. Enable additional configurations to be added more easily.	2022-10-05 12:39:19 -07:00
cloudhan	72076b1eb2	Update ROCm CI to use HIP LANGUAGE (#13214 ) Update for ROCm CI before reland tunable GEMM #12853. This PR also update composable kernel to use CMakes's HIP language support so that we can mix C/C++ compiler with HIP compiler instead of locking to hip-clang	2022-10-05 16:15:16 +08:00
Yulong Wang	82786baed1	[js/web] add 'xnnpack' to EP list (#12723 ) Description: This PR adds support for "XNNPACK EP" in ORTWeb and changes the behavior of how ORTWeb deals with "backends", or "EPs" in API. Background: Term "backend" is introduced in ONNX.js to representing a TypeScript type which implements a "backend" interface, which is a similar but different concept to ORT's EP (execution provider). There was 3 backends in ONNX.js: "cpu", "wasm" and "webgl". When ORT Web is launched, the concept is derived to help users to integrate smoothly. Technically, when "wasm" backend is used, users need to also specify "EP" in the session options. Considering it may get complicated and confused for users to figure out the difference between "backend" and "EP", the JS API hide the "backend" concept and made a mapping between names, backends and EPs: "webgl" (Name) <==> "onnxjsBackend" (Backend) "wasm" (Name) <==> "wasmBackend" (Backend) <==> "CPU" (EP) Details: The following changes are applied in this PR: 1. allow multi-registration for backends using the same name. This is for use scenarios where both "onnxruntime-node" and "onnxruntime-web" are consumed in a Node.js App ( so "cpu" will be registered twice in this scenario. ) 2. re-assign priority values to backends. I give 100 as base to "cpu" for node and react_native, and 10 as base to "cpu" in web. 3. add "cpu", "xnnpack" as new names of backends. 4. update onnxruntime wasm exported functions to support EP registration. 5. update implementations in ort web to handle execution providers in session options. 6. add '--use_xnnpack' as default build flag for ort-web	2022-10-03 10:38:45 -07:00
Baiju Meswani	0cf17b1921	Add linux debug training package to nightly pipeline (#13192 )	2022-10-01 06:58:43 -07:00
Yulong Wang	054464dce2	fix XNNPACK on WebAssembly SIMD (#13161 ) ### Description fix XNNPACK on WebAssembly SIMD. Flag "-msimd128" need to be applied to every source file when compiling WASM SIMD. Currently only a part of the source files are compiled with this flag so we get inconsistent result for `sizeof(xnn_f32_minmax_params)` because the type definition include a `#ifdef` for `__wasm_simd128__`. The inconsistency causes writing garbage data to a stack variable and eventually cause the crash. XNNPACK libraries are C libraries so need to apply the build flags not only to `CMAKE_CXX_FLAGS` but also to `CMAKE_C_FLAGS`.	2022-09-30 16:34:15 -07:00
Changming Sun	5f1bc8ff56	Add "--parallel" to the build flags of WASM pipeline (#13179 )	2022-09-30 06:54:39 -07:00
Yi Zhang	a862b0cad1	increase ios_CI_coreml stage timeout limit (#13157 ) ### Description As titile ### Motivation and Context Recently, it became more frequently that the workflow canceled due to timeout.	2022-09-30 14:45:14 +08:00
PeixuanZuo	3157cdb19a	[ROCm] Fix MIGraphX ciagent user Permissions issues (#13137 ) ### Description <!-- Describe your changes. --> fix migraphx ci pipeline failed problem. Disabled MIGraphX pipeline now. It will be Enabled when this PR merge. ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. -->	2022-09-29 10:25:02 +08:00
Baiju Meswani	5182d6610d	Upgrade pytorch to 1.12.1 for training pipelines (#13128 )	2022-09-28 17:59:49 -07:00
sfatimar	c9a86fa27f	Openvino GPU Unit/Python Tests fix failure (#13122 ) ### Description We fix iGPU Unit and Python tests with this PR We add packaging pip pkg to build Many Linux DockerFile ### Motivation and Context This change is required to make sure iGPU Unit Test/Python Tests with OV are fixed - If it fixes an open issue, please link to the issue here. --> Co-authored-by: shamaksx <shamax.kshirsagar@intel.com> Co-authored-by: mayavijx <mayax.vijayan@intel.com> Co-authored-by: pratiksha <pratikshax.bapusaheb.vanse@intel.com> Co-authored-by: pratiksha <mohsinx.mohammad@intel.com> Co-authored-by: Sahar Fatima <sfatima.3001@gmail.com> Co-authored-by: Preetha Veeramalai <preetha.veeramalai@intel.com> Co-authored-by: nmaajidk <n.maajid.khan@intel.com> Co-authored-by: Mateusz Tabaka <mateusz.tabaka@intel.com>	2022-09-28 16:00:06 -07:00
Edward Chen	55ae71c160	Reduce Objective-C static analysis build time. (#13149 )	2022-09-28 15:49:48 -07:00
PeixuanZuo	5e4ebbd9d9	[ROCm] add MIGraphX ci pipeline (#11569 ) Description: Describe your changes. Add migraphx ci pipeline, test build and unit tests. This PR is based on #11492 Pipeline : https://dev.azure.com/onnxruntime/onnxruntime/_build/results?buildId=765711&view=results	2022-09-28 10:59:30 +08:00
Baiju Meswani	f99d00fa38	Add rel* branches to upload training packages to final storage (#13124 )	2022-09-27 17:20:17 -07:00
leqiao-1	43766ee36d	Fix OLive build pipeline (#13114 )	2022-09-27 10:19:58 -07:00
RandySheriffH	237ccc01c7	Remove one last nuphar reference (#13111 ) Remove one last nuphar reference.	2022-09-26 23:02:36 -07:00
RandySheriffH	77a066c700	Drop nuphar from java API (#13107 ) Drop nuphar from: - java API - tvm.cmake - run_build.sh	2022-09-26 17:06:08 -07:00
Edward Chen	b62ba0b5a7	Remove old enable_linux_gpu_tests parameter from template invocation. (#13102 ) Remove old enable_linux_gpu_tests parameter from template invocation in build-perf-test-binaries-pipeline.yml.	2022-09-26 16:27:40 -07:00
RandySheriffH	a83a9ed6b0	Remove miscellaneous nuphar configs (#13070 ) Remove a handful of nuphar related configurations after deprecation. Co-authored-by: Randy Shuai <rashuai@microsoft.com>	2022-09-26 13:41:28 -07:00
Changming Sun	7116825aef	Add CMAKE_CUDA_ARCHITECTURES list to python packaging pipeline (#13081 )	2022-09-26 10:22:43 -07:00
mayavijx	ade0d29174	Updated Dockerfile.ubuntu_openvino with OV 2022.2 official release (#13069 ) Updated Dockerfile.ubuntu_openvino to use OV 2022.2 official release which was using pre release only.	2022-09-26 00:15:52 -07:00
dependabot[bot]	6587a85f8f	Bump protobuf from 3.18.1 to 3.18.3 in /tools/ci_build/github/linux/tvm Bumps [protobuf](https://github.com/protocolbuffers/protobuf) from 3.18.1 to 3.18.3. - [Release notes](https://github.com/protocolbuffers/protobuf/releases) - [Changelog](https://github.com/protocolbuffers/protobuf/blob/main/generate_changelog.py) - [Commits](https://github.com/protocolbuffers/protobuf/compare/v3.18.1...v3.18.3) --- updated-dependencies: - dependency-name: protobuf dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com>	2022-09-24 21:12:16 -07:00
dependabot[bot]	c1ff4b468d	Bump protobuf in /tools/ci_build/github/linux/docker/scripts/manylinux Bumps [protobuf](https://github.com/protocolbuffers/protobuf) from 3.18.1 to 3.18.3. - [Release notes](https://github.com/protocolbuffers/protobuf/releases) - [Changelog](https://github.com/protocolbuffers/protobuf/blob/main/generate_changelog.py) - [Commits](https://github.com/protocolbuffers/protobuf/compare/v3.18.1...v3.18.3) --- updated-dependencies: - dependency-name: protobuf dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com>	2022-09-24 15:21:50 -07:00
dependabot[bot]	63c3b21902	Bump protobuf from 3.18.1 to 3.18.3 in /tools/ci_build/github/linux/docker/inference/x64/python/cpu/scripts (#13080 )	2022-09-23 22:15:36 -07:00
Changming Sun	9e21ffb649	Add license header to some files. (#13074 )	2022-09-23 18:46:02 -07:00
Baiju Meswani	8bb16ab900	Propagate environment variable to docker image (#13031 )	2022-09-23 11:23:49 -07:00
Changming Sun	eafd67b8fd	Update CUDA version to 11.6 and refactor python packaging pipeline (#13002 ) 1. Update CUDA version from 11.4 to 11.6. 2. Update Manylinux version 3. Upgrade GCC version from 10 to 11 for most x86_64 pipelines. CentOS 7 ARM64 doesn't have GCC 11 yet. 4. Refactor python packaging pipeline: a. Split Linux GPU build job to two parts, build and test, so that the build part doesn't need to use a GPU machine b. Make the Linux GPU build job and Linux CPU build job more similar: share the same bash script and yaml file. 5. Temporarily disable Attention_Mask1D_Fp16_B2_FusedNoPadding because it is causing one of our packaging pipeline to fail. I have created an ADO task for this.	2022-09-23 00:29:27 -07:00
Scott McKay	078ceab1db	Use full ORT package for onnxruntime-react-native. (#13037 ) Description: Use full ORT package for onnxruntime-react-native. Left the params required for the mobile build in comments so they're easily discovered if we need to create onnxruntime-react-native-mobile in the future. Motivation and Context Remove barrier to using ORT with react native as the mobile package that was being used supports a limited range of opsets/operators/types, and requires ORT format models. The full package will run any model.	2022-09-23 07:20:03 +10:00
sfatimar	cccbe90764	Openvino ep 2022.2 v4.2 (#13023 ) This changes are to align OV 2022.2 Release with ORT . Changes CPU FP16 Support, dGPU Support, RHEL Dockerfile, Ubuntu 20 Dockerfile Motivation and Context - This change is required to ensure ORT-OpenVINO Execution Provider is aligned with latest changes. - If it fixes an open issue, please link to the issue here. Co-authored-by: mayavijx <mayax.vijayan@intel.com> Co-authored-by: shamaksx <shamax.kshirsagar@intel.com> Co-authored-by: pratiksha <pratikshax.bapusaheb.vanse@intel.com> Co-authored-by: pratiksha <mohsinx.mohammad@intel.com> Co-authored-by: Sahar Fatima <sfatima.3001@gmail.com> Co-authored-by: Preetha Veeramalai <preetha.veeramalai@intel.com> Co-authored-by: nmaajidk <n.maajid.khan@intel.com> Co-authored-by: Mateusz Tabaka <mateusz.tabaka@intel.com> Co-authored-by: intel <intel@iotgecsp-nuc04.iind.intel.com>	2022-09-22 12:31:40 -07:00
Adrian Lizarraga	39e20686a0	[EP Perf Dashboard] Fix incorrect calls to trtexec with fp16 inputs (#13018 )	2022-09-21 10:31:45 -07:00
Yi Zhang	8356e3b9b0	Add onnx single node test data to tests (#12822 ) 1. add node test data to current model tests 2. support opset version to filter tests. 3. remove old filter based on onnx version. To avoid confusion, ONLY support opset version filter in onnxruntime_test_all 4. support read onnx test data from absolute path on Windows.	2022-09-21 10:02:57 -07:00
Changming Sun	b2b4f703a5	Move Linux GPU CI pipeline to T4 (#12996 ) Move Linux GPU CI pipeline to T4	2022-09-20 20:21:32 -07:00
Edward Chen	454f77cd94	Update kernel matching logic: decouple from op schemas and remove kernel def hashes (#12791 ) # Motivation Currently, ORT minimal builds use kernel def hashes to map from nodes to kernels to execute when loading the model. As the kernel def hashes must be known ahead of time, this works for statically registered kernels. This works well for the CPU EP. For this approach to work, the kernel def hashes must also be known at ORT format model conversion time, which means the EP with statically registered kernels must also be enabled then. This is not an issue for the always-available CPU EP. However, we do not want to require that any EP which statically registers kernels is always available too. Consequently, we explore another approach to match nodes to kernels that does not rely on kernel def hashes. An added benefit of this is the possibility of moving away from kernel def hashes completely, which would eliminate the maintenance burden of keeping the hashes stable. # Approach In a full build, ORT uses some information from the ONNX op schema to match a node to a kernel. We want to avoid including the ONNX op schema in a minimal build to reduce binary size. Essentially, we take the necessary information from the ONNX op schema and make it available in a minimal build. We decouple the ONNX op schema from the kernel matching logic. The kernel matching logic instead relies on per-op information which can either be obtained from the ONNX op schema or another source. This per-op information must be available in a minimal build when there are no ONNX op schemas. We put it in the ORT format model. Existing uses of kernel def hashes to look up kernels are replaced with the updated kernel matching logic. We no longer store kernel def hashes in the ORT format model’s session state and runtime optimization representations. We no longer keep the logic to generate and ensure stability of kernel def hashes.	2022-09-20 14:24:59 -07:00
Prathik Rao	8ea742b507	downgrade setuptools	2022-09-19 12:39:35 -07:00
Yi Zhang	08af88e3e2	Assign generate document job to CPU pool. (#12973 )	2022-09-15 10:42:12 -07:00
Changming Sun	626d94aa23	Refactor python packaging pipeline and nuget packaging pipeline (#12945 ) 1. Move the Linux ARM64 part of python packaging pipeline to a real ARM64 machine pool 2. Refactor the Linux CPU build jobs of python packaging pipeline to two parts: build and test. The test part will be exempted from Cyber EO compliance requirements as it won't affect the final bits we publish. This refactoring is to reduce dependencies in the build part. For example, this PR remove pytorch from the build dependencies. 3. Combine DML nuget packaging pipeline with "Zip-Nuget-Java-Nodejs Packaging Pipeline" as they all produce ORT nuget packages. Also, publish DML nuget packages and ORT GPU nuget packages to https://aiinfra.visualstudio.com/PublicPackages/_artifacts/feed/ORT-Nightly feed.	2022-09-13 14:50:31 -07:00
Yi Zhang	d8636c2be8	Add enable_onnx_tests in windows nuget test step (#12926 )	2022-09-12 10:08:24 -07:00
Dwayne Robinson	8e4eb24648	Update operator kernel table to include DML operators (#12887 ) * Fix bug in pybind get_all_operator_schema due to premature reference dropping * Add updated operator kernels markdown table * Update build.py to include documentation generation for DML operators too * Update GPU pipeline to include DML in the build to so operators can be generated. * Use a separate pipeline stage, feedback from Changming and Scott * Appease annoying Python linter * Add onnxruntime_BUILD_UNIT_TESTS=OFF and remove stale --use_dml in cuda stage	2022-09-09 10:21:25 -07:00
Changming Sun	ff52d6a6bf	Delete Dockerfile.ubuntu (#12888 ) The file was solely for Nuphar.	2022-09-08 10:26:40 -07:00
Changming Sun	a811c7629f	Remove "Build Python Documentation" from py-packaging-stage.yml (#12890 ) Remove "Build Python Documentation" from py-packaging-stage.yml because the task has been moved to Github actions by @natke in PR #10116 .	2022-09-08 09:56:54 -07:00
RandySheriffH	d3b684cd9e	Drop nuphar (#11555 ) * drop nuphar code and configs * refactor test case * format python * remove nuphar from training test * remove commented nuphar logics * restore llvm setting * drop nuphar ci * fix compile err * fix compile err Co-authored-by: Randy Shuai <rashuai@microsoft.com>	2022-09-07 15:11:18 -07:00
Yi Zhang	c571b99336	Refactor setup_test_data (#12818 ) * refactory setup_test_data * mv setup test data to test stage * model link for C# test * add comment	2022-09-07 08:33:27 +08:00
Baiju Meswani	295bd26980	Remove orttraining-distributed CI pipeline (#12738 )	2022-09-02 14:34:26 -07:00
PeixuanZuo	adbc0757ad	[UPDATE] update ROCm ci pipeline to ROCm5.2.3 (#12799 ) * [Update] update to rocm5.2.3 * [Fix] cmake version * [Fix] disbale ortmodule tests * [revert] revert performance number	2022-09-01 10:32:24 +08:00
Baiju Meswani	a52543ecd8	Generate windows training package (#12789 )	2022-08-30 16:35:50 -07:00
Yulong Wang	82a28cc2c3	upgrade emsdk to 3.1.19 (#12690 ) * upgrade emsdk to 3.1.19 * fix build break * ignore '-Wunused-but-set-variable' in eigen * add malloc and free in exported functions * EXPORTED_FUNCTIONS	2022-08-30 13:42:45 -07:00
Yi Zhang	b4f6dad7c9	increase timeout limit of mac silicon package workflow (#12784 ) increase timeout	2022-08-30 13:57:01 +08:00
PeixuanZuo	19ca2a0089	[ADD] python package pipeline for ROCm5.2.3 (#12770 ) * [TEST] test rocm5.2.3 [TEST] rm torchversion [Update]sort Co-authored-by: Ubuntu <peixuanzuo@peixuanzuomi200vm.zvflicr54joexhdgnhvmxrxygg.phxx.internal.cloudapp.net>	2022-08-30 11:05:59 +08:00
Edward Chen	1ce14e752b	Increase timeout for clean-build-docker-image-cache-pipeline. (#12776 )	2022-08-29 15:30:35 -07:00
Baiju Meswani	80c8d934b8	Add debug option to packaging pipeline (#12685 )	2022-08-26 20:25:52 -07:00
Adam Louly	ee543a47f6	upgrade cuda version on ci pipelines (training CI pipelines) (#12708 ) * upgrade cuda version on ci pipelines * keeping folder name same * keeping folder name same * setting manual seed for primitive test case * resolving comments * changing atol and rtrol only for test case Co-authored-by: Adam Louly <adamlouly@microsoft.com@orttrainingdev7.d32nl1ml4oruzj4qz3bqlggovf.px.internal.cloudapp.net>	2022-08-26 16:51:19 -07:00
Baiju Meswani	34d90dd5bd	mac-objc-static-analysis-ci-pipeline increase timeout (#12737 )	2022-08-26 12:49:49 -07:00
Adam Louly	3bb5fb0f90	moving training pipelines from cuda 11.5 to 11.6 and deprecating 11.3 (packaging pipeline) (#12688 ) * moving training pipelines from cuda 11.5 to 11.6 and deprecating cuda 11.3 * change to cuda 11.6.2 * change pytorch's & torchvision's cuda version to 11.6 * specify deps version to 11.6.2 * update pytorch and torch text version * torch 1.12.1 * change torchvision and torchtext version to be compatible with torch 1.12.1 * change cuda to 11.6 for cuda_home comaptibility Co-authored-by: Adam Louly <adamlouly@microsoft.com@orttrainingdev7.d32nl1ml4oruzj4qz3bqlggovf.px.internal.cloudapp.net>	2022-08-25 22:12:01 -07:00
Scott McKay	8483b9c6e3	MacOS pipeline and MAUI CoreML fixes (#12724 ) * Add asm statement to model.mm to force linker to link against CoreML.Framework. Update targets.xml as per Rolf's suggestions * Remove explicit numpy version from macos build. We don't specify it for other CIs and the version specified doesn't have a pre-built 3.10 wheel. This leads to the CI attempting to build numpy which fails.	2022-08-26 08:51:37 +10:00
Cassie Breviu	e85dce8cea	Add csharp docfx (#12596 ) * add docfx and gh action to build docs * kick off build from feature branch * Fix LGTM linting * update az pipeline to win22 & remove nuget install * remove azure ci changes * fix implicit using to support 5.0 * fix more js issues * remove resource designer changes * remove space * fix linting misspellings in autogenerated js temp * fix misspellings in generated code * delete log file	2022-08-25 09:51:32 -05:00
Yi Zhang	dee2fdffb0	Remove debug build/test in Mac CPU training (#12698 ) * run mac training parallely * update jobname * remove debug build/test	2022-08-25 13:38:53 +08:00
Yi Zhang	d91f017da1	remove redundant publish unit test results (#12697 ) rm redundant publish unit test results	2022-08-25 11:18:07 +08:00
Cheng	eba4f77d00	enable xnnpack in default_full_aar_build_settings (#12682 )	2022-08-25 10:41:06 +08:00
Changming Sun	7927d525a7	Remove CUDNN path from CI build scripts (#12671 )	2022-08-24 18:21:50 -07:00
Adam Louly	94f76b944e	nightly pipeline build using PTCA image. (#12605 ) * nightly pipeline yaml and requirements files * changed names, removed torchvision installing * delete old file Co-authored-by: Adam Louly <adamlouly@microsoft.com@orttrainingdev7.d32nl1ml4oruzj4qz3bqlggovf.px.internal.cloudapp.net>	2022-08-24 10:40:55 -07:00
Changming Sun	cb2601c5ea	Update mac-ci.yml to increase macOS build jobs' timeout value to 3 hours (#12675 )	2022-08-22 21:31:30 -07:00
Wei-Sheng Chin	dc486d146b	Make ORT callable from various Pytorch compilers (LazyTensor, TorchDynamo, etc) (#10460 ) * Make ORT as Pytorch JIT backend LORT likely doesn't work with aten fallback so we only test LORT in its own CI. * Revert changes to enable external CUDA allocator. Will add it later. Revert "Revert changes to enable external CUDA allocator. Will add it later." This reverts commit d5487f2e193014c805505afae8fb577c53667658. Fix external allocator * Relax tolerance and remove commented code * Print more information in CI * Fix pointer * Address comments. 1. Reuse ORT-eager mode's environment. 2. Remove unused ctor. * Use Pytorch master branch as all PRs are merged Fix * Refine based on cpplint feedbacks * Revert changes to allow custom CUDA allocator in public APIs * Use torch.testing.assert_close * Use unittest framework * Switch docker repo * Rename .cpp to .cc * Address comments * Add comment * Use same pipeline file for eager and lort pipelines * Address comments * Add yaml comment * Fix cmake files * Address comments * Rename flags, remove printing code, remove dead comment	2022-08-22 09:40:40 -07:00
Changming Sun	b270334e1e	Update numpy version from 1.21.0 to 1.21.6 to avoid building it from source (#12644 )	2022-08-18 22:11:48 -07:00
Changming Sun	ac7538b909	Remove CUDA 10.2 support (#12541 )	2022-08-10 22:46:41 -07:00
Baiju Meswani	3e78f3cf1f	Add win-ci pipeline for on-device training (#12513 )	2022-08-10 14:45:39 -07:00
Changming Sun	c0d396d176	Restrict "Component Detection" task to Lotus project only (#12536 ) It is related to PR #12426	2022-08-10 03:25:29 -07:00
Changming Sun	e810480403	Replace the occurrences of "master" to "main" in yaml files (#12534 )	2022-08-09 22:03:21 -07:00
Vincent Wang	e85e31ee80	Update ORTModule Default Opset Version to 15 (#12419 ) * update ortmodule opset to 15 * update torch version * fix ut * fix ut * rollback * rollback for orttrainer	2022-08-05 16:55:04 +08:00
PeixuanZuo	3e1b0ac4b3	[DELETE] delete python package rocm4.3.1 (#12480 ) [delete] delete rocm4.3.1	2022-08-05 13:27:42 +08:00
Changming Sun	5d610bc8eb	Disable CG task in PR pipelines (#12426 )	2022-08-02 19:01:41 -07:00
Yulong Wang	feed5da435	[js] loosen test timeout (#12427 ) Losen the following test timeout: 1. "Test Web Multi-Browsers" stage in "ONNX Runtime Web CI Pipeline": 30min -> 60min 2. Node.js binding default per-case timeout: 30 sec -> 90 sec	2022-08-02 19:01:19 -07:00
Changming Sun	1a64b94f60	Fix a small issue in nuget packaging pipeline (#12405 ) In #12358 I typed a wrong path in the yaml file.	2022-08-02 15:44:43 -07:00
Yi Zhang	5d1173fe68	Run IOS pipeline concurrently (#12400 ) split ios pipelines	2022-08-02 11:07:17 +08:00
Yi Zhang	63d64636f6	Add the comment linking to wiki (#12398 ) add the comment	2022-08-02 10:09:16 +08:00
Yi Zhang	8b4ad77ea2	pipeline can use last run's artifacts (#12379 ) * first step * depends on stage * temp change * specific * runId * parameters * fix typo * fix typo * add nnapi * add nnapi * fix typo * minor fix * condition on stage * format * format	2022-07-30 21:34:57 +08:00
Changming Sun	7b4ce0c1e1	Delete the build scripts that were copied from manylinux project (#12358 ) 1. Delete the build scripts that were copied from manylinux project. Use "git checkout" instead. 2. Update manylinux version to get python 3.11. Related issue: Python 3.11 support #12343 3. Change the cuda version of linux gpu build job of nuget packaging pipeline from cuda 11.4 to cuda 11.6 to match the TRT job within the same pipeline.. (A lot other places need be updated as well, but I'd prefer to put them in another PR) 4. Make dockerfile names static. For example, replace tools/ci_build/github/linux/docker/$(DockerFile) to tools/ci_build/github/linux/docker/Dockerfile.manylinux2014_cpu . The former one relies on a runtime variable $(DockerFile), Template Parameters are expanded early in processing a pipeline run when most variables are not available. It like C++ macros vs variables.	2022-07-29 18:24:19 -07:00
Jian Chen	7a7e372b9f	Remove training cuda 10.2 pipeline (#12347 ) * update to 2022 * Update the VS version * Rolling back to gcc 10 * Rolling back * Update cuda home * remove "CMAKE_CUDA_ARCHITECTURES=52" * update cuda Architure to 70 * Delete cuda 10.2 training pipeline * rolling back a mistake * Update win-gpu-reduce-op-ci-pipeline.yml * Update win-gpu-reduce-op-ci-pipeline.yml * Update win-gpu-reduce-op-ci-pipeline.yml * Delete tools/ci_build/github/linux/docker/scripts/training/ortmodule/stage1/requirements_torch1.10.0_cu10.2 directory * Delete tools/ci_build/github/linux/docker/scripts/training/ortmodule/stage1/requirements_torch1.11.0_cu10.2 directory	2022-07-28 14:58:17 -04:00
Edward Chen	6e892a95b4	Use specific Android NDK version in CI builds. (#12350 ) Current builds use a NDK version that happens to be on the build machine. The build machine environment may change in ways that are outside of our control. This change installs a specific version of NDK (the current LTS version 25.0.8775105) and uses it.	2022-07-28 11:01:04 -07:00
Changming Sun	e6bb447101	Change native folder name for java macos arm64 (#12335 )	2022-07-27 15:13:07 -07:00
msftlincoln	9cf6912bba	Fix ORT Eager Mode to work with Pytorch 1.12 (#12323 )	2022-07-27 16:24:46 -04:00
Yi Zhang	4df4471d5e	add missing build_java in Android testing stage. (#12187 ) add missing build_java in testing	2022-07-27 14:13:08 +08:00
pengwa	2b2367efbf	Fix orttraining-linux-gpu-ci-pipeline (fairscale dependency) (#12320 ) authored by: @pengwa	2022-07-26 15:11:04 -07:00
Baiju Meswani	ddb45e9126	On device training CI pipeline (#11987 )	2022-07-25 10:07:17 -07:00
Rachel Guo	496618594f	Update supported ops md for NNAPI/CoreML EP (#12245 ) * update supported ops md * address pr comments * address pr comments * wording	2022-07-21 10:23:08 -07:00

1 2 3 4 5 ...

1270 commits