Jingqiao Fu
f4fd67cc2c
Revert "add load from buffer ( #10162 )" ( #10590 )
...
This reverts commit 5cd57bb726 .
2022-03-08 13:35:23 -08:00
dependabot[bot]
7e04dccca7
Bump numpy in /tools/ci_build/github/linux/docker/scripts ( #10385 )
...
Bumps [numpy](https://github.com/numpy/numpy ) from 1.16.6 to 1.21.0.
- [Release notes](https://github.com/numpy/numpy/releases )
- [Changelog](https://github.com/numpy/numpy/blob/main/doc/HOWTO_RELEASE.rst.txt )
- [Commits](https://github.com/numpy/numpy/compare/v1.16.6...v1.21.0 )
---
updated-dependencies:
- dependency-name: numpy
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-03-08 11:02:36 -08:00
Sunghoon
68c8f5a1ef
Change a pipeline vmImage from windows-latest to windows-2019 ( #10804 )
2022-03-08 10:49:59 -08:00
Yufeng Li
33c6819196
add qdq support of Sigmoid ( #10800 )
2022-03-08 10:29:15 -08:00
Changming Sun
6260733533
Fix eager mode pipeline ( #10802 )
...
It was still using python 3.6
2022-03-08 09:26:20 -08:00
Hariharan Seshadri
a9d9c6b486
Register CPU, CUDA and ROCM opset-16 kernels for some operators ( #10643 )
2022-03-08 09:18:39 -08:00
Changming Sun
ce07dc30fd
Change how we apply patches to absl ( #10799 )
2022-03-08 02:03:06 -08:00
George Wu
1e4a4bfe58
update onnx-tensorrt reference. ( #10795 )
2022-03-07 21:45:46 -08:00
liqun Fu
da885a72e8
update with onnx 1.11 release ( #10441 )
2022-03-07 21:10:55 -08:00
Yulong Wang
80917342b7
[js] upgrade mocha@8.2.1 to 9.2.1 ( #10793 )
2022-03-07 20:40:24 -08:00
dependabot[bot]
4d943c9bd3
Bump numpy from 1.16.6 to 1.21.0 in /tools/ci_build/github/linux/docker/scripts/manylinux ( #10387 )
...
* Bump numpy in /tools/ci_build/github/linux/docker/scripts/manylinux
2022-03-07 20:39:49 -08:00
PeixuanZuo
c07a27a008
[FIX] delete python3.6 from AMD python package docker image builder ( #10790 )
...
* [UPDATE] delete python3.6 to cooperate numpy==1.21.0
* [UPDATE] delete python3.6 to cooperate numpy==1.21.0
2022-03-07 18:21:43 -08:00
Vincent Wang
4a38f9e31d
enable strided tensor for training only ( #10748 )
2022-03-08 08:31:28 +08:00
zhangyaobit
b7f00b9682
Refactor the common code per operator into an abstract base class. ( #10785 )
2022-03-07 13:15:49 -08:00
Daigo HIROOKA
a08036da09
correct symbolic name of GridSample operation ( #10782 )
...
Function name needs to match PyTorch ATen op name, which is `aten::grid_sampler`.
2022-03-07 12:49:12 -08:00
dependabot[bot]
3e54f94bb0
Bump karma from 6.3.14 to 6.3.16 in /js/web
...
Bumps [karma](https://github.com/karma-runner/karma ) from 6.3.14 to 6.3.16.
- [Release notes](https://github.com/karma-runner/karma/releases )
- [Changelog](https://github.com/karma-runner/karma/blob/master/CHANGELOG.md )
- [Commits](https://github.com/karma-runner/karma/compare/v6.3.14...v6.3.16 )
---
updated-dependencies:
- dependency-name: karma
dependency-type: direct:development
...
Signed-off-by: dependabot[bot] <support@github.com>
2022-03-07 11:47:23 -08:00
Yulong Wang
25fdcfbd14
[js/web] allow multiple inference session creating concurrently ( #10784 )
...
* test case
* bugfix
* fix
* support multi session init
2022-03-07 11:35:06 -08:00
RandySheriffH
a4b5fa334a
Add type and shape information to profiled numbers ( #10773 )
...
* add func to collect type shape
* reformat
* refactor perf view
* remove obsolete
2022-03-07 10:17:58 -08:00
Changming Sun
d8bf9a479b
Remove python 3.6 from training pipelines ( #10780 )
...
Because the numpy we use doesn't support python 3.6. And inference pipelines already removed python 3.6.
2022-03-07 09:57:24 -08:00
Hariharan Seshadri
9d30262422
Fix AMD training pipeline ( #10788 )
2022-03-07 08:53:08 -08:00
Chen Fu
50a6f095cd
Symmetric QGEMM kernel for ARMv8 A55 chip ( #10754 )
...
ARM a55 micro-architecture (with dot product instructions), similar to a53, is widely used as little cores in big.Little configurations. A55 has a narrower memory load/store hardware, where a 128b load instruction would block the pipeline for 2 whole cycles, during which no other instructions can be executed. On the other hand, a 64b load instruction can be duo issued with many other instructions.
This change adds a Symmetric QGEMM kernel for a55 micro-architecture, where we replace
ldr q4,[x1],#16
with
ldr d4,[x1],#8
ldr x11,[x1],#8
ins v4.d[1],x11
so that we can try to hide the memory load cycles behind computing cycles in the kernel.
Co-authored-by: Chen Fu <fuchen@microsoft.com>
2022-03-07 08:41:13 -08:00
PeixuanZuo
55af7a96a7
update the amd ci pipeline ( #10723 )
...
* [TEST] test to get amd pipeline information
* [FIX] lower the threshold
* [UPDATE] add retry task
* [UPDATE] add retry task
* [ERROR] error to occur retry
* [FIX] error
* [UPDATE] update retryCountOnTaskFailure to 1 time
* [UPDATE] add showmeminfo
2022-03-07 18:39:42 +08:00
Fei Hu
60acfd3dd8
Support CUDA Graph in the CUDA EP ( #9978 )
2022-03-06 20:47:31 -08:00
Tianlei Wu
0e335aba37
Update BeamSearch operator spec to support t5 ( #10777 )
...
* change BeamSearch op to support encoder decoder model
* check model_type and decoder attribute
* fix
* update comments
* warn shape inference issue with onnx v1.11 or T5
* skip parity test when tempature != 1.0
* fix build
2022-03-04 21:52:45 -08:00
George Nash
6be5185088
Update dnnl Add, Mul, Sub, Div ops to handle scalar values ( #10756 )
...
* Update dnnl Add, Mul, Sub, Div ops to handle scalar values
Signed-off-by: George Nash <george.nash@intel.com>
* Add additional scalar support for dnnl execution provider
This will add scalar support for:
Eltwise operators: Abs, Elu, Exp, LeakyRelu, Log, Relu, Round,
Sigmoid, Softplus, Sqrt, and Tanh
Gelu operators: BiasGelu, FastGelu, and Gelu
Softmax operator
Signed-off-by: George Nash <george.nash@intel.com>
2022-03-04 19:28:25 -08:00
Ye Wang
259ade2557
Add ability to modify num_hidden_layers from benchmark script ( #10760 )
...
* add ability to modify num_hidden_layers from benchmark script
* comment
* Revert "comment"
This reverts commit 28794b0e4f86506dcc937738894fcef97fc84e48.
* Revert "add ability to modify num_hidden_layers from benchmark script"
This reverts commit 96f36ed7f751721bcf4e3ab8748a715f19a4e044.
* review coments
Co-authored-by: Ubuntu <wy@linux-v100.aidmrjtolptuzevavgwhrapqcd.jx.internal.cloudapp.net>
2022-03-04 18:28:51 -08:00
Ella Charlaix
fde847473b
Add min max moving average calibration method ( #10753 )
...
* Add min max moving average calibration method
* Modify the calibration extra options dictionnary creation
2022-03-04 14:55:31 -08:00
Maxiwell
43ff27c7c8
ppc64le: optimizing the MlasQuantizeLinear() with VSX ( #10644 )
...
This code is valid only when -mcpu is set to utilize POWER9 technology
or above. A compatible code for POWER8 was created as well, but it
was not tuned for performance.
2022-03-04 14:54:56 -08:00
Tianlei Wu
379b3cdef6
T5 to ONNX conversion script ( #10766 )
...
* T5 onnx conversion script
2022-03-04 14:42:04 -08:00
Olivia Jain
12eb660415
Compare TRT vs ORT-TRT Accurately ( #10565 )
...
* get inputs independently for trtexec
* track one process only
* remove engine and profile files
* change time to commit time
* add runtime option for io binding
* move to commit date
* fixes
* add option for graph optimization
* cleanup docker script
* include remaining changes
* choose graph optimization option
* add space in option
2022-03-04 10:14:18 -08:00
dependabot[bot]
e3c85d4262
Bump numpy
...
Bumps [numpy](https://github.com/numpy/numpy ) from 1.19.5 to 1.21.0.
- [Release notes](https://github.com/numpy/numpy/releases )
- [Changelog](https://github.com/numpy/numpy/blob/main/doc/HOWTO_RELEASE.rst.txt )
- [Commits](https://github.com/numpy/numpy/compare/v1.19.5...v1.21.0 )
---
updated-dependencies:
- dependency-name: numpy
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
2022-03-04 09:51:32 -08:00
dependabot[bot]
b780a3784e
Bump numpy in /tools/ci_build/github/linux/docker/scripts/training
...
Bumps [numpy](https://github.com/numpy/numpy ) from 1.19.5 to 1.21.0.
- [Release notes](https://github.com/numpy/numpy/releases )
- [Changelog](https://github.com/numpy/numpy/blob/main/doc/HOWTO_RELEASE.rst.txt )
- [Commits](https://github.com/numpy/numpy/compare/v1.19.5...v1.21.0 )
---
updated-dependencies:
- dependency-name: numpy
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
2022-03-04 09:38:38 -08:00
dependabot[bot]
0b0e8ccf92
Bump numpy
...
Bumps [numpy](https://github.com/numpy/numpy ) from 1.19.5 to 1.21.0.
- [Release notes](https://github.com/numpy/numpy/releases )
- [Changelog](https://github.com/numpy/numpy/blob/main/doc/HOWTO_RELEASE.rst.txt )
- [Commits](https://github.com/numpy/numpy/compare/v1.19.5...v1.21.0 )
---
updated-dependencies:
- dependency-name: numpy
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
2022-03-04 09:34:58 -08:00
Changming Sun
283d0c47b4
Update our absl cmake files ( #10762 )
2022-03-04 09:28:04 -08:00
zhangyaobit
4c88fa5971
Add micro-benchmark for FastGelu ( #10744 )
...
* Add micro-benchmark for FastGelu
* Delete the bert-base case, as it is very similar to the bert-large one.
* Add argument parsing and more user-friendly provider type assertion.
2022-03-04 08:51:15 -08:00
Valery Chernov
46d0b20ac2
upstream TVM. small code cleaning ( #10515 )
...
Co-authored-by: Valery Chernov <valery.chernov@deelvin.com>
2022-03-04 12:15:29 +01:00
Edward Chen
395a7242d6
[iOS packaging] Minor updates. ( #10755 )
...
* Change storage container, simplify build definition parameters.
* Remove explicit version from Objective-C docs.
* Increase timeout.
* Use real storage account.
* Get static website URL with az cli.
2022-03-04 16:02:53 +10:00
Scott McKay
e337f5faf3
Enable QDQ cleanup and NHWC optimizers in an extended minimal build. ( #10729 )
...
* Enable QDQ cleanup and NHWC optimizers in an extended minimal build.
2022-03-04 15:45:42 +10:00
Guoyu Wang
7aa706854f
Pipeline changes to build full ORT package for Android ( #10654 )
...
* Add android package build settings for full build
Co-authored-by: gwang0000 <62914304+gwang0000@users.noreply.github.com>
Co-authored-by: Scott McKay <skottmckay@gmail.com>
Co-authored-by: Edward Chen <18449977+edgchen1@users.noreply.github.com>
2022-03-04 15:35:54 +10:00
Scott McKay
6072c6b65e
Simplify QLinearConv registration so type reduction works with it. ( #10747 )
...
* Simplify QLinearConv registration so type reduction works with it.
* Update QLinearMatMul registration to be a standard typed registration
2022-03-04 14:06:04 +10:00
Abhishek Kulkarni
c2c85dd6b1
Add an option to export ONNX graphs in ORTModule tests ( #10579 )
...
Co-authored-by: Abhishek Kulkarni <abkulkarni@microsoft.com>
2022-03-03 16:56:19 -08:00
Yulong Wang
745fa5885f
optimize web assembly build flags for multi-thread ( #10759 )
2022-03-03 16:44:14 -08:00
Edward Chen
c8ec7782bd
Fix unused variable warning, move variable definitions closer to usages. ( #10757 )
2022-03-04 09:18:33 +10:00
Olivia Jain
ed87e1b721
Change axis to 0D in cumsum tests. ( #10715 )
...
* changing axis to 0
* if def for openvino
* removing extra header
* include changes
* pass in 0D scalar
* Add comment explaining change.
2022-03-03 10:44:46 -08:00
Changming Sun
b3e96d6195
A new pipeline to replace the existing WindowsAI packaging pipeline ( #10646 )
2022-03-03 08:56:49 -08:00
Hubert Lu
fe8d867efa
Optimize BinaryElementWise and BiasGeluGrad kernels for AMD ( #10594 )
...
* Optimize elementwise and biasgelugrad kernels for AMD
* Clean up for BiasGeluGradDxKernel
2022-03-03 08:07:15 -08:00
cloudhan
4c20f6863d
Fix build with gcc 7.5 ( #10567 )
2022-03-03 18:29:02 +08:00
Fei Hu
75160d6779
Add the missing status return in beam search ( #10738 )
2022-03-03 01:24:44 -08:00
Rachel Guo
a9dc50ba8b
Add option to force QDQIsInt8Allowed to return true when exporting to ORT format ( #10719 )
...
* wip
* save
* minor update
* fix
* fix
* Revert "fix"
This reverts commit a76f364b2d .
* revert
* revert
* revert submodule removal
* address pr comments
* minor fix
* address cr comments
* fix format
Co-authored-by: rachguo <rachguo@rachguos-Mini.attlocal.net>
2022-03-02 23:26:14 -08:00
Ye Wang
44d08d80a0
Add restriction to first usage in allocation planner ( #10724 )
...
* Add restriction to first usage in allocation planner
* change phrases
* add UT
Co-authored-by: Ubuntu <wy@linux-v100.aidmrjtolptuzevavgwhrapqcd.jx.internal.cloudapp.net>
2022-03-02 22:03:50 -08:00