onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-06-07 00:13:17 +00:00

Author	SHA1	Message	Date
PeixuanZuo	c26bb1bb19	Allow fastgelu/skiplayernorm profile by pass args from commandline (#13025 ) Description: Describe your changes. This allow us quickly launch a microbench session by, for example: `python skip_layer_norm_test.py 8 128 128 float32 `	2022-09-28 15:48:59 -07:00
cloudhan	32c2c4b480	Change ROCm to use tunable GEMM (#12853 ) Change ROCm to use tunable GEMM. It is not enabled in this PR. This will drastically improve GEMM performance in some shapes and dtypes configuration. This will benefit the overall performance for BERT inference and hopefully, training, when enabled.	2022-09-28 16:21:54 +08:00
PeixuanZuo	5e4ebbd9d9	[ROCm] add MIGraphX ci pipeline (#11569 ) Description: Describe your changes. Add migraphx ci pipeline, test build and unit tests. This PR is based on #11492 Pipeline : https://dev.azure.com/onnxruntime/onnxruntime/_build/results?buildId=765711&view=results	2022-09-28 10:59:30 +08:00
Yi Zhang	19774f9230	print test case's skip reason (#13118 ) ### Description as title ### Motivation and Context easy to debug	2022-09-28 09:33:31 +08:00
Baiju Meswani	f99d00fa38	Add rel* branches to upload training packages to final storage (#13124 )	2022-09-27 17:20:17 -07:00
Rachel Guo	9a44a69653	Refactor NNAPI EP OpBuilder/OpSupportChecker structure (#13065 ) ### Description <!-- Describe your changes. --> As title -Split long OpBuilder and OpSupportChecker files into individual operator files. -Add OpBuilder/SupportChecker registry factories. -Combine the functionality of op_builder and op_support_checker into one op_builder. ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. --> The NNAPI OPBuilder was splitted into OPBuilder (For EP::Compile) and OPSupportChecker (for EP::GetCapability) At the time it was reasonable choice, but OPBuilder/OPSupportChecker share some logic and has to use addition helper. Clean up now to make NNAPI OPBuilder/OPSupportChecker into single OPBuilder (similar to what CoreML EP has)	2022-09-27 17:12:09 -07:00
Edward Chen	457a53c92f	Fix static analysis warning by making derived classes final. (#13123 ) Follow up to #13059, which only updated the base classes. This change ensures that the derived classes will not be base classes.	2022-09-27 15:45:45 -07:00
Scott McKay	e19163167e	Update React Native documentation to reflect change to use full ORT (#13091 ) ### Description <!-- Describe your changes. --> Update React Native documentation to reflect change to use full ORT. Fix broken links. ### Motivation and Context <!-- - Why is this change required? What problem does it solve? ORT v1.13 uses the full ORT package. Instructions for performing a custom build did not cover this. Co-authored-by: Edward Chen <18449977+edgchen1@users.noreply.github.com>	2022-09-28 08:11:58 +10:00
Edward Chen	5c89c37f7f	Consolidate enabled/default kernel def type constraints (#13034 ) Consolidate enabled/default kernel def type constraint types into enabled.	2022-09-27 14:04:15 -07:00
Faith Xu	440f31668f	Labeler: Test /i regex for case sensitivity (#13115 ) ### Description Test if regex change will make auto labeling case insensitive	2022-09-27 13:58:09 -07:00
PeixuanZuo	13d1a3c007	[ROCm] add SkipLayerNorm vectorize Regular case (#12821 ) Description: Describe your changes. add SkipLayerNorm vectorize regular case 1. when hidden size <= 1024, SkipLayerNormTunable op can use both small case and regular case 2. when hidden size > 1024, SkipLayerNormTunable op can only use regular case. Motivation and Context - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here.	2022-09-27 12:52:10 -07:00
leqiao-1	43766ee36d	Fix OLive build pipeline (#13114 )	2022-09-27 10:19:58 -07:00
Vincent Wang	94e34ace15	Bugfix for SimplifiedLayerNormalization (#12975 ) This PR is to fix https://github.com/microsoft/onnxruntime/issues/12930 and https://github.com/microsoft/onnxruntime/issues/12579. In detail: - For CPU EP, since current impl of SimplifiedLayerNormalization doesn't support input and scale having different data types, so if the sub-graph contains Cast Op, the sub-graph will not fused, this guarantee that both inputs and output data type will be same - For CUDA EP, add (fp16, float) support to (T,V) type constraints all combinations of fp16 and float can be supported in the impl With the fix, the original model can be run with SimplifiedLayerNormalization, which also helps to improve the perf.	2022-09-27 14:24:16 +08:00
RandySheriffH	237ccc01c7	Remove one last nuphar reference (#13111 ) Remove one last nuphar reference.	2022-09-26 23:02:36 -07:00
Changming Sun	b25437ec41	Upgrade protobuf version (#13100 ) Upgrade protobuf version from 3.18.1 to 3.18.3 to address CVE-2022-1941	2022-09-26 21:30:28 -07:00
Hector Li	073dbba784	skip the placeholder inputs while adding node inputs as sub-graph inputs (#13106 ) Fix issue that all nodes inputs are added as sub-graph inputs event the input does not exist. Solution: Skip the placeholder inputs while adding node inputs as sub-graph inputs. E.g Onnx node test test_resize_upsample_scales_linear, 2nd input roi is empty.	2022-09-26 21:06:29 -07:00
Yufeng Li	c746083344	use parameter names to specify argument mapping (#13108 ) use parameter names to specify argument mapping to avoid mismatches.	2022-09-26 20:56:59 -07:00
RandySheriffH	e3bdba37a8	Mitigate prefast static analysis warnings (#13032 ) Address static analysis warnings: https://msdata.visualstudio.com/DefaultCollection/Vienna/_workitems/edit/1944984/ https://msdata.visualstudio.com/DefaultCollection/Vienna/_workitems/edit/1943846/ Co-authored-by: Randy Shuai <rashuai@microsoft.com>	2022-09-26 17:06:33 -07:00
RandySheriffH	77a066c700	Drop nuphar from java API (#13107 ) Drop nuphar from: - java API - tvm.cmake - run_build.sh	2022-09-26 17:06:08 -07:00
Vincent Wang	0e98fb4e9b	Fix Build Error for CUDA113 Introduced by `6efa9d9` (#13089 ) Fix build error for CUDA version < 11.4. The error was introduce by commit `6efa9d9e10`.	2022-09-27 07:57:14 +08:00
Edward Chen	b62ba0b5a7	Remove old enable_linux_gpu_tests parameter from template invocation. (#13102 ) Remove old enable_linux_gpu_tests parameter from template invocation in build-perf-test-binaries-pipeline.yml.	2022-09-26 16:27:40 -07:00
Chen Fu	e9b1bbc6a5	fix Numpy array None judgement bug (#13103 ) fix https://github.com/microsoft/onnxruntime/issues/13054	2022-09-26 15:15:32 -07:00
RandySheriffH	a83a9ed6b0	Remove miscellaneous nuphar configs (#13070 ) Remove a handful of nuphar related configurations after deprecation. Co-authored-by: Randy Shuai <rashuai@microsoft.com>	2022-09-26 13:41:28 -07:00
Jian Chen	44c14e8cbb	Adding test case for conv per channel with QDQ format (#13041 ) Description: Adding test case for conv per channel with QDQ format	2022-09-26 16:25:28 -04:00
Dale Phurrough	2ae33b3613	fix CuDNN lib path for Windows (#12974 ) Fixes microsoft/onnxruntime#12969 ### Motivation and Context Build is broken, can't find cudnn.lib with nvidia official install of cuDNN Alternative method is to use `IF(EXISTS ${onnxruntime_CUDNN_HOME}/lib/x64/cudnn.lib)` to test for legacy location and only add the legacy dir to the path, else add the current official `lib/` dir.	2022-09-26 13:23:38 -07:00
Nat Kershaw (MSFT)	ce2ea44a56	Try to fix GitHub labeling action (#12999 )	2022-09-26 11:46:28 -07:00
Changming Sun	7116825aef	Add CMAKE_CUDA_ARCHITECTURES list to python packaging pipeline (#13081 )	2022-09-26 10:22:43 -07:00
mayavijx	ade0d29174	Updated Dockerfile.ubuntu_openvino with OV 2022.2 official release (#13069 ) Updated Dockerfile.ubuntu_openvino to use OV 2022.2 official release which was using pre release only.	2022-09-26 00:15:52 -07:00
dependabot[bot]	365a01397d	Bump protobuf from 3.17.0 to 3.18.3 in /tools/ci_build Bumps [protobuf](https://github.com/protocolbuffers/protobuf) from 3.17.0 to 3.18.3. - [Release notes](https://github.com/protocolbuffers/protobuf/releases) - [Changelog](https://github.com/protocolbuffers/protobuf/blob/main/generate_changelog.py) - [Commits](https://github.com/protocolbuffers/protobuf/compare/v3.17.0...v3.18.3) --- updated-dependencies: - dependency-name: protobuf dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com>	2022-09-25 20:00:36 -07:00
Scott McKay	b820256f34	Add check that bias and scale sizes match norm_size in LayerNormalization (#13060 ) ### Description Add check that bias and scale sizes match norm_size in LayerNormalization. ### Motivation and Context #12917	2022-09-26 08:22:49 +10:00
Hariharan Seshadri	19c51376c4	Introduce QDQ transformer fusion tools for ordered quantized ops (#12661 )	2022-09-24 23:22:44 -07:00
dependabot[bot]	6587a85f8f	Bump protobuf from 3.18.1 to 3.18.3 in /tools/ci_build/github/linux/tvm Bumps [protobuf](https://github.com/protocolbuffers/protobuf) from 3.18.1 to 3.18.3. - [Release notes](https://github.com/protocolbuffers/protobuf/releases) - [Changelog](https://github.com/protocolbuffers/protobuf/blob/main/generate_changelog.py) - [Commits](https://github.com/protocolbuffers/protobuf/compare/v3.18.1...v3.18.3) --- updated-dependencies: - dependency-name: protobuf dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com>	2022-09-24 21:12:16 -07:00
dependabot[bot]	c1ff4b468d	Bump protobuf in /tools/ci_build/github/linux/docker/scripts/manylinux Bumps [protobuf](https://github.com/protocolbuffers/protobuf) from 3.18.1 to 3.18.3. - [Release notes](https://github.com/protocolbuffers/protobuf/releases) - [Changelog](https://github.com/protocolbuffers/protobuf/blob/main/generate_changelog.py) - [Commits](https://github.com/protocolbuffers/protobuf/compare/v3.18.1...v3.18.3) --- updated-dependencies: - dependency-name: protobuf dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com>	2022-09-24 15:21:50 -07:00
Chih-Hsuan Yen	9abd6e3a30	setup.py: use packaging instead of wheel.vendored.packaging (#13083 )	2022-09-24 08:32:44 -07:00
ytaous	2cc4e7e5c2	[Build] Fix broken AMD CI (#13082 ) Introduced by https://github.com/microsoft/onnxruntime/pull/12949 - add missing lines in excluded list Co-authored-by: Ethan Tao <ettao@microsoft.com>	2022-09-24 00:21:25 -07:00
dependabot[bot]	63c3b21902	Bump protobuf from 3.18.1 to 3.18.3 in /tools/ci_build/github/linux/docker/inference/x64/python/cpu/scripts (#13080 )	2022-09-23 22:15:36 -07:00
Scott McKay	8e2528bad2	More LayoutNormalization opset 17 changes (#13066 ) ### Description Add CUDA kernel. Support double in CPU kernel and only write Mean and InvStdDev values if the optional outputs exist. ### Motivation and Context Complete opset 17 support for LayoutNormalization	2022-09-24 13:22:44 +10:00
Changming Sun	9e21ffb649	Add license header to some files. (#13074 )	2022-09-23 18:46:02 -07:00
Baiju Meswani	bcc93ab17c	Deprecate ORTTrainer (#13022 )	2022-09-23 18:10:09 -07:00
Tianlei Wu	6f27659ceb	Fix prefast warnings (#13017 ) Fix prefast warnings: [C26451](https://learn.microsoft.com/en-us/cpp/code-quality/C26451?view=msvc-170) [C26436](https://learn.microsoft.com/en-us/cpp/code-quality/c26436?view=msvc-170) [C26814](https://learn.microsoft.com/en-us/cpp/code-quality/C26814?view=msvc-170)	2022-09-23 12:50:23 -07:00
Baiju Meswani	8bb16ab900	Propagate environment variable to docker image (#13031 )	2022-09-23 11:23:49 -07:00
Zhang Lei	6efa9d9e10	Add more qordered int8 operators for CUDA provider (#12949 ) Attention, Quantize/Dequantize etc. Update QOrderedMatmul's schema, updated unittest. Verified test data for QOrdered Attention. Co-authored-by: Zhang Lei <phill.zhang@gmail.com> Co-authored-by: Lei Zhang <zhalei@microsoft.com>	2022-09-23 10:49:33 -07:00
Edward Chen	5f611b63a1	Make classes IKernelTypeStrResolver and IKernelLookup have protected destructors. (#13059 )	2022-09-23 09:16:45 -07:00
PeixuanZuo	2ef1f8b93e	[ROCm] add tunable SkipLayerNorm for ROCm EP (#12817 ) Description: Describe your changes. Related PR: https://github.com/microsoft/onnxruntime/pull/12803 https://github.com/microsoft/onnxruntime/pull/12816 https://github.com/microsoft/onnxruntime/pull/12821 1.add tunable skip layernorm for rocm ep 2. keep origin implementation when disable tuning. Motivation and Context - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here.	2022-09-23 16:39:44 +08:00
Changming Sun	eafd67b8fd	Update CUDA version to 11.6 and refactor python packaging pipeline (#13002 ) 1. Update CUDA version from 11.4 to 11.6. 2. Update Manylinux version 3. Upgrade GCC version from 10 to 11 for most x86_64 pipelines. CentOS 7 ARM64 doesn't have GCC 11 yet. 4. Refactor python packaging pipeline: a. Split Linux GPU build job to two parts, build and test, so that the build part doesn't need to use a GPU machine b. Make the Linux GPU build job and Linux CPU build job more similar: share the same bash script and yaml file. 5. Temporarily disable Attention_Mask1D_Fp16_B2_FusedNoPadding because it is causing one of our packaging pipeline to fail. I have created an ADO task for this.	2022-09-23 00:29:27 -07:00
Yi Zhang	92237567d3	add opset17 node test data (#13062 ) ### Description ### Add opset17 node test data ### Motivation and Context ### <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. -->	2022-09-23 14:33:37 +08:00
cloudhan	a24b41d92e	Move all TunableOp related falicilities to EP level directory (#12857 ) Some Ops in EP directory instead of contrib_ops directory will require TunableOp. We will also need to add EP level session tuning options for it. So move those code all at once. Also remove duplicated utility functions.	2022-09-23 11:10:19 +08:00
Faith Xu	8fb3f05cd6	Add cgmanifest file in codeowner list (#13042 ) Marks @onnxruntime-admin as owner for cgmanifest file to help review changes in dependencies and version updates.	2022-09-22 18:58:01 -07:00
Scott McKay	394c249c7c	Add ONNX LayerNormalization(17) (#12978 ) Description: LayerNormalization is now part of the ONNX spec as of opset 17. We had a LayerNormalization contrib op, which (incorrectly) was registered in the ONNX domain. Use that implementation for the ONNX operator. Update skip_layer_norm_fusion.cc. There are other optimizers that use LayerNormalization that need updates as well. Motivation and Context #12916	2022-09-23 09:49:27 +10:00
wangxiyuan	952c99304a	Add CANN EP (#12416 ) Description: This PR adds Ascend CANN execution provider support. Motivation and Context - Why is this change required? What problem does it solve? As the info shown in the issue. CANN is the API layer for Ascend processor. Add CANN EP can allow user run onnx model on Ascend hardware via onnxruntime The detail change: 1. Added CANN EP framework. 2. Added the basic operators to support ResNet and VGG model. 3. Added C/C++、Python API support - If it fixes an open issue, please link to the issue here. https://github.com/microsoft/onnxruntime/issues/11477 Author: lijiawei <lijiawei19@huawei.com> wangxiyuan <wangxiyuan1007@gmail.com> Co-authored-by: FFrog <ljw1101.vip@gmail.com>	2022-09-22 14:53:40 -07:00

1 2 3 4 5 ...

7466 commits