Changming Sun
cc6bc34c8c
Update protobuf submodule ( #10801 )
2022-03-09 09:37:58 -08:00
Dmitri Smirnov
58521fb822
Make training CUDA kernels to adhere established code structure patterns ( #10735 )
...
Current training optimizer kernels include CPU headers
that affects changes that we can make in the CPU code with C++14 compiler and
other refactoring efforts. Rearrange the kernel according to the established patterns
and do not include headers that are not needed.
2022-03-09 09:06:45 -08:00
Adam Pocock
4ef81b142d
Making the Java tests faster by optionally disabling ones which require running multiple JVMs. ( #10811 )
2022-03-08 22:19:37 -08:00
Hariharan Seshadri
ae97ecf05b
Fix CPU, CUDA Selu activation logic ( #10771 )
2022-03-08 19:53:27 -08:00
Edward Chen
c147c9dda6
Remove ORT_ENABLE_RUNTIME_OPTIMIZATION_IN_MINIMAL_BUILD. ( #10778 )
...
Remove ORT_ENABLE_RUNTIME_OPTIMIZATION_IN_MINIMAL_BUILD as it is now implied by ORT_EXTENDED_MINIMAL_BUILD.
Remove related CMake option.
2022-03-08 16:18:49 -08:00
George Wu
769aa8363d
update onnx-tensorrt to bring in https://github.com/onnx/onnx-tensorrt/pull/812 ( #10810 )
2022-03-08 14:51:07 -08:00
Jingqiao Fu
f4fd67cc2c
Revert "add load from buffer ( #10162 )" ( #10590 )
...
This reverts commit 5cd57bb726 .
2022-03-08 13:35:23 -08:00
dependabot[bot]
7e04dccca7
Bump numpy in /tools/ci_build/github/linux/docker/scripts ( #10385 )
...
Bumps [numpy](https://github.com/numpy/numpy ) from 1.16.6 to 1.21.0.
- [Release notes](https://github.com/numpy/numpy/releases )
- [Changelog](https://github.com/numpy/numpy/blob/main/doc/HOWTO_RELEASE.rst.txt )
- [Commits](https://github.com/numpy/numpy/compare/v1.16.6...v1.21.0 )
---
updated-dependencies:
- dependency-name: numpy
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-03-08 11:02:36 -08:00
Sunghoon
68c8f5a1ef
Change a pipeline vmImage from windows-latest to windows-2019 ( #10804 )
2022-03-08 10:49:59 -08:00
Yufeng Li
33c6819196
add qdq support of Sigmoid ( #10800 )
2022-03-08 10:29:15 -08:00
Changming Sun
6260733533
Fix eager mode pipeline ( #10802 )
...
It was still using python 3.6
2022-03-08 09:26:20 -08:00
Hariharan Seshadri
a9d9c6b486
Register CPU, CUDA and ROCM opset-16 kernels for some operators ( #10643 )
2022-03-08 09:18:39 -08:00
Changming Sun
ce07dc30fd
Change how we apply patches to absl ( #10799 )
2022-03-08 02:03:06 -08:00
George Wu
1e4a4bfe58
update onnx-tensorrt reference. ( #10795 )
2022-03-07 21:45:46 -08:00
liqun Fu
da885a72e8
update with onnx 1.11 release ( #10441 )
2022-03-07 21:10:55 -08:00
Yulong Wang
80917342b7
[js] upgrade mocha@8.2.1 to 9.2.1 ( #10793 )
2022-03-07 20:40:24 -08:00
dependabot[bot]
4d943c9bd3
Bump numpy from 1.16.6 to 1.21.0 in /tools/ci_build/github/linux/docker/scripts/manylinux ( #10387 )
...
* Bump numpy in /tools/ci_build/github/linux/docker/scripts/manylinux
2022-03-07 20:39:49 -08:00
PeixuanZuo
c07a27a008
[FIX] delete python3.6 from AMD python package docker image builder ( #10790 )
...
* [UPDATE] delete python3.6 to cooperate numpy==1.21.0
* [UPDATE] delete python3.6 to cooperate numpy==1.21.0
2022-03-07 18:21:43 -08:00
Vincent Wang
4a38f9e31d
enable strided tensor for training only ( #10748 )
2022-03-08 08:31:28 +08:00
zhangyaobit
b7f00b9682
Refactor the common code per operator into an abstract base class. ( #10785 )
2022-03-07 13:15:49 -08:00
Daigo HIROOKA
a08036da09
correct symbolic name of GridSample operation ( #10782 )
...
Function name needs to match PyTorch ATen op name, which is `aten::grid_sampler`.
2022-03-07 12:49:12 -08:00
dependabot[bot]
3e54f94bb0
Bump karma from 6.3.14 to 6.3.16 in /js/web
...
Bumps [karma](https://github.com/karma-runner/karma ) from 6.3.14 to 6.3.16.
- [Release notes](https://github.com/karma-runner/karma/releases )
- [Changelog](https://github.com/karma-runner/karma/blob/master/CHANGELOG.md )
- [Commits](https://github.com/karma-runner/karma/compare/v6.3.14...v6.3.16 )
---
updated-dependencies:
- dependency-name: karma
dependency-type: direct:development
...
Signed-off-by: dependabot[bot] <support@github.com>
2022-03-07 11:47:23 -08:00
Yulong Wang
25fdcfbd14
[js/web] allow multiple inference session creating concurrently ( #10784 )
...
* test case
* bugfix
* fix
* support multi session init
2022-03-07 11:35:06 -08:00
RandySheriffH
a4b5fa334a
Add type and shape information to profiled numbers ( #10773 )
...
* add func to collect type shape
* reformat
* refactor perf view
* remove obsolete
2022-03-07 10:17:58 -08:00
Changming Sun
d8bf9a479b
Remove python 3.6 from training pipelines ( #10780 )
...
Because the numpy we use doesn't support python 3.6. And inference pipelines already removed python 3.6.
2022-03-07 09:57:24 -08:00
Hariharan Seshadri
9d30262422
Fix AMD training pipeline ( #10788 )
2022-03-07 08:53:08 -08:00
Chen Fu
50a6f095cd
Symmetric QGEMM kernel for ARMv8 A55 chip ( #10754 )
...
ARM a55 micro-architecture (with dot product instructions), similar to a53, is widely used as little cores in big.Little configurations. A55 has a narrower memory load/store hardware, where a 128b load instruction would block the pipeline for 2 whole cycles, during which no other instructions can be executed. On the other hand, a 64b load instruction can be duo issued with many other instructions.
This change adds a Symmetric QGEMM kernel for a55 micro-architecture, where we replace
ldr q4,[x1],#16
with
ldr d4,[x1],#8
ldr x11,[x1],#8
ins v4.d[1],x11
so that we can try to hide the memory load cycles behind computing cycles in the kernel.
Co-authored-by: Chen Fu <fuchen@microsoft.com>
2022-03-07 08:41:13 -08:00
PeixuanZuo
55af7a96a7
update the amd ci pipeline ( #10723 )
...
* [TEST] test to get amd pipeline information
* [FIX] lower the threshold
* [UPDATE] add retry task
* [UPDATE] add retry task
* [ERROR] error to occur retry
* [FIX] error
* [UPDATE] update retryCountOnTaskFailure to 1 time
* [UPDATE] add showmeminfo
2022-03-07 18:39:42 +08:00
Fei Hu
60acfd3dd8
Support CUDA Graph in the CUDA EP ( #9978 )
2022-03-06 20:47:31 -08:00
Tianlei Wu
0e335aba37
Update BeamSearch operator spec to support t5 ( #10777 )
...
* change BeamSearch op to support encoder decoder model
* check model_type and decoder attribute
* fix
* update comments
* warn shape inference issue with onnx v1.11 or T5
* skip parity test when tempature != 1.0
* fix build
2022-03-04 21:52:45 -08:00
George Nash
6be5185088
Update dnnl Add, Mul, Sub, Div ops to handle scalar values ( #10756 )
...
* Update dnnl Add, Mul, Sub, Div ops to handle scalar values
Signed-off-by: George Nash <george.nash@intel.com>
* Add additional scalar support for dnnl execution provider
This will add scalar support for:
Eltwise operators: Abs, Elu, Exp, LeakyRelu, Log, Relu, Round,
Sigmoid, Softplus, Sqrt, and Tanh
Gelu operators: BiasGelu, FastGelu, and Gelu
Softmax operator
Signed-off-by: George Nash <george.nash@intel.com>
2022-03-04 19:28:25 -08:00
Ye Wang
259ade2557
Add ability to modify num_hidden_layers from benchmark script ( #10760 )
...
* add ability to modify num_hidden_layers from benchmark script
* comment
* Revert "comment"
This reverts commit 28794b0e4f86506dcc937738894fcef97fc84e48.
* Revert "add ability to modify num_hidden_layers from benchmark script"
This reverts commit 96f36ed7f751721bcf4e3ab8748a715f19a4e044.
* review coments
Co-authored-by: Ubuntu <wy@linux-v100.aidmrjtolptuzevavgwhrapqcd.jx.internal.cloudapp.net>
2022-03-04 18:28:51 -08:00
Ella Charlaix
fde847473b
Add min max moving average calibration method ( #10753 )
...
* Add min max moving average calibration method
* Modify the calibration extra options dictionnary creation
2022-03-04 14:55:31 -08:00
Maxiwell
43ff27c7c8
ppc64le: optimizing the MlasQuantizeLinear() with VSX ( #10644 )
...
This code is valid only when -mcpu is set to utilize POWER9 technology
or above. A compatible code for POWER8 was created as well, but it
was not tuned for performance.
2022-03-04 14:54:56 -08:00
Tianlei Wu
379b3cdef6
T5 to ONNX conversion script ( #10766 )
...
* T5 onnx conversion script
2022-03-04 14:42:04 -08:00
Olivia Jain
12eb660415
Compare TRT vs ORT-TRT Accurately ( #10565 )
...
* get inputs independently for trtexec
* track one process only
* remove engine and profile files
* change time to commit time
* add runtime option for io binding
* move to commit date
* fixes
* add option for graph optimization
* cleanup docker script
* include remaining changes
* choose graph optimization option
* add space in option
2022-03-04 10:14:18 -08:00
dependabot[bot]
e3c85d4262
Bump numpy
...
Bumps [numpy](https://github.com/numpy/numpy ) from 1.19.5 to 1.21.0.
- [Release notes](https://github.com/numpy/numpy/releases )
- [Changelog](https://github.com/numpy/numpy/blob/main/doc/HOWTO_RELEASE.rst.txt )
- [Commits](https://github.com/numpy/numpy/compare/v1.19.5...v1.21.0 )
---
updated-dependencies:
- dependency-name: numpy
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
2022-03-04 09:51:32 -08:00
dependabot[bot]
b780a3784e
Bump numpy in /tools/ci_build/github/linux/docker/scripts/training
...
Bumps [numpy](https://github.com/numpy/numpy ) from 1.19.5 to 1.21.0.
- [Release notes](https://github.com/numpy/numpy/releases )
- [Changelog](https://github.com/numpy/numpy/blob/main/doc/HOWTO_RELEASE.rst.txt )
- [Commits](https://github.com/numpy/numpy/compare/v1.19.5...v1.21.0 )
---
updated-dependencies:
- dependency-name: numpy
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
2022-03-04 09:38:38 -08:00
dependabot[bot]
0b0e8ccf92
Bump numpy
...
Bumps [numpy](https://github.com/numpy/numpy ) from 1.19.5 to 1.21.0.
- [Release notes](https://github.com/numpy/numpy/releases )
- [Changelog](https://github.com/numpy/numpy/blob/main/doc/HOWTO_RELEASE.rst.txt )
- [Commits](https://github.com/numpy/numpy/compare/v1.19.5...v1.21.0 )
---
updated-dependencies:
- dependency-name: numpy
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
2022-03-04 09:34:58 -08:00
Changming Sun
283d0c47b4
Update our absl cmake files ( #10762 )
2022-03-04 09:28:04 -08:00
zhangyaobit
4c88fa5971
Add micro-benchmark for FastGelu ( #10744 )
...
* Add micro-benchmark for FastGelu
* Delete the bert-base case, as it is very similar to the bert-large one.
* Add argument parsing and more user-friendly provider type assertion.
2022-03-04 08:51:15 -08:00
Valery Chernov
46d0b20ac2
upstream TVM. small code cleaning ( #10515 )
...
Co-authored-by: Valery Chernov <valery.chernov@deelvin.com>
2022-03-04 12:15:29 +01:00
Edward Chen
395a7242d6
[iOS packaging] Minor updates. ( #10755 )
...
* Change storage container, simplify build definition parameters.
* Remove explicit version from Objective-C docs.
* Increase timeout.
* Use real storage account.
* Get static website URL with az cli.
2022-03-04 16:02:53 +10:00
Scott McKay
e337f5faf3
Enable QDQ cleanup and NHWC optimizers in an extended minimal build. ( #10729 )
...
* Enable QDQ cleanup and NHWC optimizers in an extended minimal build.
2022-03-04 15:45:42 +10:00
Guoyu Wang
7aa706854f
Pipeline changes to build full ORT package for Android ( #10654 )
...
* Add android package build settings for full build
Co-authored-by: gwang0000 <62914304+gwang0000@users.noreply.github.com>
Co-authored-by: Scott McKay <skottmckay@gmail.com>
Co-authored-by: Edward Chen <18449977+edgchen1@users.noreply.github.com>
2022-03-04 15:35:54 +10:00
Scott McKay
6072c6b65e
Simplify QLinearConv registration so type reduction works with it. ( #10747 )
...
* Simplify QLinearConv registration so type reduction works with it.
* Update QLinearMatMul registration to be a standard typed registration
2022-03-04 14:06:04 +10:00
Abhishek Kulkarni
c2c85dd6b1
Add an option to export ONNX graphs in ORTModule tests ( #10579 )
...
Co-authored-by: Abhishek Kulkarni <abkulkarni@microsoft.com>
2022-03-03 16:56:19 -08:00
Yulong Wang
745fa5885f
optimize web assembly build flags for multi-thread ( #10759 )
2022-03-03 16:44:14 -08:00
Edward Chen
c8ec7782bd
Fix unused variable warning, move variable definitions closer to usages. ( #10757 )
2022-03-04 09:18:33 +10:00
Olivia Jain
ed87e1b721
Change axis to 0D in cumsum tests. ( #10715 )
...
* changing axis to 0
* if def for openvino
* removing extra header
* include changes
* pass in 0D scalar
* Add comment explaining change.
2022-03-03 10:44:46 -08:00