onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-07-13 18:08:13 +00:00

Author	SHA1	Message	Date
Changming Sun	cc6bc34c8c	Update protobuf submodule (#10801 )	2022-03-09 09:37:58 -08:00
Dmitri Smirnov	58521fb822	Make training CUDA kernels to adhere established code structure patterns (#10735 ) Current training optimizer kernels include CPU headers that affects changes that we can make in the CPU code with C++14 compiler and other refactoring efforts. Rearrange the kernel according to the established patterns and do not include headers that are not needed.	2022-03-09 09:06:45 -08:00
Adam Pocock	4ef81b142d	Making the Java tests faster by optionally disabling ones which require running multiple JVMs. (#10811 )	2022-03-08 22:19:37 -08:00
Hariharan Seshadri	ae97ecf05b	Fix CPU, CUDA Selu activation logic (#10771 )	2022-03-08 19:53:27 -08:00
Edward Chen	c147c9dda6	Remove ORT_ENABLE_RUNTIME_OPTIMIZATION_IN_MINIMAL_BUILD. (#10778 ) Remove ORT_ENABLE_RUNTIME_OPTIMIZATION_IN_MINIMAL_BUILD as it is now implied by ORT_EXTENDED_MINIMAL_BUILD. Remove related CMake option.	2022-03-08 16:18:49 -08:00
George Wu	769aa8363d	update onnx-tensorrt to bring in https://github.com/onnx/onnx-tensorrt/pull/812 (#10810 )	2022-03-08 14:51:07 -08:00
Jingqiao Fu	f4fd67cc2c	Revert "add load from buffer (#10162 )" (#10590 ) This reverts commit `5cd57bb726`.	2022-03-08 13:35:23 -08:00
dependabot[bot]	7e04dccca7	Bump numpy in /tools/ci_build/github/linux/docker/scripts (#10385 ) Bumps [numpy](https://github.com/numpy/numpy) from 1.16.6 to 1.21.0. - [Release notes](https://github.com/numpy/numpy/releases) - [Changelog](https://github.com/numpy/numpy/blob/main/doc/HOWTO_RELEASE.rst.txt) - [Commits](https://github.com/numpy/numpy/compare/v1.16.6...v1.21.0) --- updated-dependencies: - dependency-name: numpy dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2022-03-08 11:02:36 -08:00
Sunghoon	68c8f5a1ef	Change a pipeline vmImage from windows-latest to windows-2019 (#10804 )	2022-03-08 10:49:59 -08:00
Yufeng Li	33c6819196	add qdq support of Sigmoid (#10800 )	2022-03-08 10:29:15 -08:00
Changming Sun	6260733533	Fix eager mode pipeline (#10802 ) It was still using python 3.6	2022-03-08 09:26:20 -08:00
Hariharan Seshadri	a9d9c6b486	Register CPU, CUDA and ROCM opset-16 kernels for some operators (#10643 )	2022-03-08 09:18:39 -08:00
Changming Sun	ce07dc30fd	Change how we apply patches to absl (#10799 )	2022-03-08 02:03:06 -08:00
George Wu	1e4a4bfe58	update onnx-tensorrt reference. (#10795 )	2022-03-07 21:45:46 -08:00
liqun Fu	da885a72e8	update with onnx 1.11 release (#10441 )	2022-03-07 21:10:55 -08:00
Yulong Wang	80917342b7	[js] upgrade mocha@8.2.1 to 9.2.1 (#10793 )	2022-03-07 20:40:24 -08:00
dependabot[bot]	4d943c9bd3	Bump numpy from 1.16.6 to 1.21.0 in /tools/ci_build/github/linux/docker/scripts/manylinux (#10387 ) * Bump numpy in /tools/ci_build/github/linux/docker/scripts/manylinux	2022-03-07 20:39:49 -08:00
PeixuanZuo	c07a27a008	[FIX] delete python3.6 from AMD python package docker image builder (#10790 ) * [UPDATE] delete python3.6 to cooperate numpy==1.21.0 * [UPDATE] delete python3.6 to cooperate numpy==1.21.0	2022-03-07 18:21:43 -08:00
Vincent Wang	4a38f9e31d	enable strided tensor for training only (#10748 )	2022-03-08 08:31:28 +08:00
zhangyaobit	b7f00b9682	Refactor the common code per operator into an abstract base class. (#10785 )	2022-03-07 13:15:49 -08:00
Daigo HIROOKA	a08036da09	correct symbolic name of GridSample operation (#10782 ) Function name needs to match PyTorch ATen op name, which is `aten::grid_sampler`.	2022-03-07 12:49:12 -08:00
dependabot[bot]	3e54f94bb0	Bump karma from 6.3.14 to 6.3.16 in /js/web Bumps [karma](https://github.com/karma-runner/karma) from 6.3.14 to 6.3.16. - [Release notes](https://github.com/karma-runner/karma/releases) - [Changelog](https://github.com/karma-runner/karma/blob/master/CHANGELOG.md) - [Commits](https://github.com/karma-runner/karma/compare/v6.3.14...v6.3.16) --- updated-dependencies: - dependency-name: karma dependency-type: direct:development ... Signed-off-by: dependabot[bot] <support@github.com>	2022-03-07 11:47:23 -08:00
Yulong Wang	25fdcfbd14	[js/web] allow multiple inference session creating concurrently (#10784 ) * test case * bugfix * fix * support multi session init	2022-03-07 11:35:06 -08:00
RandySheriffH	a4b5fa334a	Add type and shape information to profiled numbers (#10773 ) * add func to collect type shape * reformat * refactor perf view * remove obsolete	2022-03-07 10:17:58 -08:00
Changming Sun	d8bf9a479b	Remove python 3.6 from training pipelines (#10780 ) Because the numpy we use doesn't support python 3.6. And inference pipelines already removed python 3.6.	2022-03-07 09:57:24 -08:00
Hariharan Seshadri	9d30262422	Fix AMD training pipeline (#10788 )	2022-03-07 08:53:08 -08:00
Chen Fu	50a6f095cd	Symmetric QGEMM kernel for ARMv8 A55 chip (#10754 ) ARM a55 micro-architecture (with dot product instructions), similar to a53, is widely used as little cores in big.Little configurations. A55 has a narrower memory load/store hardware, where a 128b load instruction would block the pipeline for 2 whole cycles, during which no other instructions can be executed. On the other hand, a 64b load instruction can be duo issued with many other instructions. This change adds a Symmetric QGEMM kernel for a55 micro-architecture, where we replace ldr q4,[x1],#16 with ldr d4,[x1],#8 ldr x11,[x1],#8 ins v4.d[1],x11 so that we can try to hide the memory load cycles behind computing cycles in the kernel. Co-authored-by: Chen Fu <fuchen@microsoft.com>	2022-03-07 08:41:13 -08:00
PeixuanZuo	55af7a96a7	update the amd ci pipeline (#10723 ) * [TEST] test to get amd pipeline information * [FIX] lower the threshold * [UPDATE] add retry task * [UPDATE] add retry task * [ERROR] error to occur retry * [FIX] error * [UPDATE] update retryCountOnTaskFailure to 1 time * [UPDATE] add showmeminfo	2022-03-07 18:39:42 +08:00
Fei Hu	60acfd3dd8	Support CUDA Graph in the CUDA EP (#9978 )	2022-03-06 20:47:31 -08:00
Tianlei Wu	0e335aba37	Update BeamSearch operator spec to support t5 (#10777 ) * change BeamSearch op to support encoder decoder model * check model_type and decoder attribute * fix * update comments * warn shape inference issue with onnx v1.11 or T5 * skip parity test when tempature != 1.0 * fix build	2022-03-04 21:52:45 -08:00
George Nash	6be5185088	Update dnnl Add, Mul, Sub, Div ops to handle scalar values (#10756 ) * Update dnnl Add, Mul, Sub, Div ops to handle scalar values Signed-off-by: George Nash <george.nash@intel.com> * Add additional scalar support for dnnl execution provider This will add scalar support for: Eltwise operators: Abs, Elu, Exp, LeakyRelu, Log, Relu, Round, Sigmoid, Softplus, Sqrt, and Tanh Gelu operators: BiasGelu, FastGelu, and Gelu Softmax operator Signed-off-by: George Nash <george.nash@intel.com>	2022-03-04 19:28:25 -08:00
Ye Wang	259ade2557	Add ability to modify num_hidden_layers from benchmark script (#10760 ) * add ability to modify num_hidden_layers from benchmark script * comment * Revert "comment" This reverts commit 28794b0e4f86506dcc937738894fcef97fc84e48. * Revert "add ability to modify num_hidden_layers from benchmark script" This reverts commit 96f36ed7f751721bcf4e3ab8748a715f19a4e044. * review coments Co-authored-by: Ubuntu <wy@linux-v100.aidmrjtolptuzevavgwhrapqcd.jx.internal.cloudapp.net>	2022-03-04 18:28:51 -08:00
Ella Charlaix	fde847473b	Add min max moving average calibration method (#10753 ) * Add min max moving average calibration method * Modify the calibration extra options dictionnary creation	2022-03-04 14:55:31 -08:00
Maxiwell	43ff27c7c8	ppc64le: optimizing the MlasQuantizeLinear() with VSX (#10644 ) This code is valid only when -mcpu is set to utilize POWER9 technology or above. A compatible code for POWER8 was created as well, but it was not tuned for performance.	2022-03-04 14:54:56 -08:00
Tianlei Wu	379b3cdef6	T5 to ONNX conversion script (#10766 ) * T5 onnx conversion script	2022-03-04 14:42:04 -08:00
Olivia Jain	12eb660415	Compare TRT vs ORT-TRT Accurately (#10565 ) * get inputs independently for trtexec * track one process only * remove engine and profile files * change time to commit time * add runtime option for io binding * move to commit date * fixes * add option for graph optimization * cleanup docker script * include remaining changes * choose graph optimization option * add space in option	2022-03-04 10:14:18 -08:00
dependabot[bot]	e3c85d4262	Bump numpy Bumps [numpy](https://github.com/numpy/numpy) from 1.19.5 to 1.21.0. - [Release notes](https://github.com/numpy/numpy/releases) - [Changelog](https://github.com/numpy/numpy/blob/main/doc/HOWTO_RELEASE.rst.txt) - [Commits](https://github.com/numpy/numpy/compare/v1.19.5...v1.21.0) --- updated-dependencies: - dependency-name: numpy dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com>	2022-03-04 09:51:32 -08:00
dependabot[bot]	b780a3784e	Bump numpy in /tools/ci_build/github/linux/docker/scripts/training Bumps [numpy](https://github.com/numpy/numpy) from 1.19.5 to 1.21.0. - [Release notes](https://github.com/numpy/numpy/releases) - [Changelog](https://github.com/numpy/numpy/blob/main/doc/HOWTO_RELEASE.rst.txt) - [Commits](https://github.com/numpy/numpy/compare/v1.19.5...v1.21.0) --- updated-dependencies: - dependency-name: numpy dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com>	2022-03-04 09:38:38 -08:00
dependabot[bot]	0b0e8ccf92	Bump numpy Bumps [numpy](https://github.com/numpy/numpy) from 1.19.5 to 1.21.0. - [Release notes](https://github.com/numpy/numpy/releases) - [Changelog](https://github.com/numpy/numpy/blob/main/doc/HOWTO_RELEASE.rst.txt) - [Commits](https://github.com/numpy/numpy/compare/v1.19.5...v1.21.0) --- updated-dependencies: - dependency-name: numpy dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com>	2022-03-04 09:34:58 -08:00
Changming Sun	283d0c47b4	Update our absl cmake files (#10762 )	2022-03-04 09:28:04 -08:00
zhangyaobit	4c88fa5971	Add micro-benchmark for FastGelu (#10744 ) * Add micro-benchmark for FastGelu * Delete the bert-base case, as it is very similar to the bert-large one. * Add argument parsing and more user-friendly provider type assertion.	2022-03-04 08:51:15 -08:00
Valery Chernov	46d0b20ac2	upstream TVM. small code cleaning (#10515 ) Co-authored-by: Valery Chernov <valery.chernov@deelvin.com>	2022-03-04 12:15:29 +01:00
Edward Chen	395a7242d6	[iOS packaging] Minor updates. (#10755 ) * Change storage container, simplify build definition parameters. * Remove explicit version from Objective-C docs. * Increase timeout. * Use real storage account. * Get static website URL with az cli.	2022-03-04 16:02:53 +10:00
Scott McKay	e337f5faf3	Enable QDQ cleanup and NHWC optimizers in an extended minimal build. (#10729 ) * Enable QDQ cleanup and NHWC optimizers in an extended minimal build.	2022-03-04 15:45:42 +10:00
Guoyu Wang	7aa706854f	Pipeline changes to build full ORT package for Android (#10654 ) * Add android package build settings for full build Co-authored-by: gwang0000 <62914304+gwang0000@users.noreply.github.com> Co-authored-by: Scott McKay <skottmckay@gmail.com> Co-authored-by: Edward Chen <18449977+edgchen1@users.noreply.github.com>	2022-03-04 15:35:54 +10:00
Scott McKay	6072c6b65e	Simplify QLinearConv registration so type reduction works with it. (#10747 ) * Simplify QLinearConv registration so type reduction works with it. * Update QLinearMatMul registration to be a standard typed registration	2022-03-04 14:06:04 +10:00
Abhishek Kulkarni	c2c85dd6b1	Add an option to export ONNX graphs in ORTModule tests (#10579 ) Co-authored-by: Abhishek Kulkarni <abkulkarni@microsoft.com>	2022-03-03 16:56:19 -08:00
Yulong Wang	745fa5885f	optimize web assembly build flags for multi-thread (#10759 )	2022-03-03 16:44:14 -08:00
Edward Chen	c8ec7782bd	Fix unused variable warning, move variable definitions closer to usages. (#10757 )	2022-03-04 09:18:33 +10:00
Olivia Jain	ed87e1b721	Change axis to 0D in cumsum tests. (#10715 ) * changing axis to 0 * if def for openvino * removing extra header * include changes * pass in 0D scalar * Add comment explaining change.	2022-03-03 10:44:46 -08:00

1 2 3 4 5 ...

6477 commits