Commit graph

6769 commits

Author SHA1 Message Date
Yulong Wang
69aaf03345
allow catch all exceptions (#11498) 2022-05-20 03:35:47 -07:00
PeixuanZuo
a67994316a
Update rocm ci to ROCm5.1.1 + torch1.10.0
* [UPDATE] update amd ci pipeline 2 rocm5.1.1

* [FIX] json format error

* [ERROR] disable unit tests

* [FIX] ucx error

* [FIX] cmake version

* [FIX] units test
2022-05-20 11:07:21 +08:00
Tang, Cheng
abecb56832
fix buid break (#11492)
Co-authored-by: Cheng Tang <chenta@microsoft.com@orttrainingdev9.d32nl1ml4oruzj4qz3bqlggovf.px.internal.cloudapp.net>
2022-05-19 16:10:45 -07:00
Vincent Wang
436c4f9b79
Add BFloat16 (bf16) support for ATen (#11546)
Co-authored-by: Vincent Wang <weicwang@microsoft.com>
2022-05-19 10:04:08 -04:00
Adrian Lizarraga
e45197fa8c
[trt-ep-perf] Fix upload time of EP perf data (#11531)
Fix the post.py script to use the actual "upload time" in ISO format instead of the day/month/year of the commit date.
2022-05-18 15:36:21 -07:00
Valery Chernov
8092d9f9a2
[TVM EP] Support inference by shared library created by TVM (#11389)
* add so_folder option to TVM EP options. add TvmSoEP class and update TVM EP factory

* compilation from so_folder was implemented

* update TVMCompiler for default pipeline and compilation from shared lib

* filter excess so-file in so_folder

* clean Compile method and vm conditions

* implementation of TVMSoCompile on native side instead of python API

* cpplint fixes

* some fixes after review

* more cpplint fixes

* more fixes after review

* align TVMso EP with new API for compilation from #10632

* small fixes for cpplint

Co-authored-by: Valery Chernov <valery.chernov@deelvin.com>
2022-05-18 14:50:54 +02:00
Adrian Lizarraga
48efeca66c
[trt-ep-perf] Fix bug that suppresses latency gain reporting (#11321)
Fix bug that prevents EP perf script from reporting latency gain for TensortRT/CUDA
2022-05-17 14:00:52 -07:00
Edward Chen
782f9e394d
[CoreML EP] Fix condition in PRelu op supported check. (#11543) 2022-05-17 09:03:24 -07:00
Ryan Hill
deef214772
Update gather to use multiple threads (#11524) 2022-05-16 19:31:14 -07:00
Edward Chen
5eaa893936
[CoreML EP] Add support for PRelu (#11474) 2022-05-16 16:30:09 -07:00
Justin Chu
d9c9adb78b
Add python static type checking in CI checks (#11518)
- Enable pyright and pylint (https://github.com/microsoft/pyright) in CI
- Enable pyright, pylint and bandit by default in VS code

Pylint has some good style checks. pyright is Microsoft's static type checker.
2022-05-16 13:26:56 -07:00
PeixuanZuo
c556f5f22f
Add AMD python package ROCm5.1.1+torch1.11 (#11516)
* [FIX] fix name error

* [ADD] add rocm5.1.1 python package

* [ADD] torch1.10.0 rocm requirements

* [UPDATE] update docker Repository name
2022-05-16 08:14:11 +08:00
Sheil Kumar
6255194659
All LearningModelSessions created from a common LearningModelDevice should share the same thread pool (#11457)
* Share thread pools between devices

* make tests reuse device

* Change cpu thread pool options for dml sessions to use 1 thread with no spinning

* fix test failure

* Update missing type constraints for dft

* Add comment and rename inference session parameter

* default missing causing inconsistent test behavior

Co-authored-by: Sheil Kumar <sheilk@microsoft.com>
2022-05-13 11:12:43 -07:00
Yi Zhang
5709ed2e16
Fix shellcheck warning (#11489)
* fix shellcheck warning

* Update java_linux_final_test.sh
2022-05-13 15:36:59 +08:00
RajalakshmiSR
b14c1fd479
POWER: Optimize MlasQLinearAddKernelHelper() (#11454)
This patch uses vector instrinsics to optimize MlasQLinearAddKernelHelper
function for POWER processor.

Co-authored-by: Rajalakshmi Srinivasaraghavan <rajis@linux.ibm.com>
2022-05-12 18:38:45 -07:00
George Wu
09590f013a
fix windows ci debug build break (#11495)
* update msc version check

* update comment

* typo

* whitespace
2022-05-12 16:54:00 -07:00
Rachel Guo
4aef7e3aab
[CoreML EP] Add DepthToSpace op support (#11468)
* initial impl of depthtospace coreml support

* fix build

* address pr comments

* minor update

* minor pr comments

Co-authored-by: rachguo <rachguo@rachguos-Mini.attlocal.net>
Co-authored-by: rachguo <rachguo@rachguos-Mac-mini.local>
2022-05-12 13:48:51 -07:00
Yi Zhang
a3f05da338
Revert "[TVM EP] update set input to remove excess copying inside TVM (#11247)" (#11504)
This reverts commit 5ae461ec0a.
2022-05-13 02:27:36 +08:00
Tianlei Wu
ece1274ffa
revert safeint version (#11500) 2022-05-12 11:24:43 -07:00
Justin Chu
f94b25933a
ci(cpplint): Ignore runtime/references warnings (#11499)
Allow non-const references 6f85d3e5c8/docs/Coding_Conventions_and_Standards.md (L11-L12)
2022-05-12 07:51:45 -07:00
Justin Chu
6f85d3e5c8
fix(onnx_export): Extract arg value from torch Value (#11471)
**Description**: Extract arg value from torch Value

**Motivation and Context**

Input to gelu is `torch._C.Value` type values. This caused the `if approximate == "none"` check to always fail, preventing the optimized `com.microsoft::Gelu` op from being used.
2022-05-11 11:36:43 -07:00
Tianlei Wu
f5473596fa
Change longformer default kernel (#11470)
* change default to compact memory kernel
* Remove a cuda stream synchronize that is not needed
* Update longformer benchmark tool
2022-05-11 10:54:59 -07:00
Changming Sun
48ae27d578
Update protobuf-java to 3.20.1 (#10420) 2022-05-11 07:52:12 -07:00
Changming Sun
207ad7eef9 Remove spdlog from cgmanifest.json 2022-05-10 22:02:21 -07:00
Changming Sun
027fc1d391 Completely delete ORT server 2022-05-10 22:02:21 -07:00
Changming Sun
903743e823
Delete unused TRT docker files (#11486)
* Delete unused TRT docker files

* revert tools/ci_build/github/linux/docker/Dockerfile.manylinux2014_cuda11_4_tensorrt8_0
2022-05-10 22:00:53 -07:00
Changming Sun
0ac2e6e546
Update install-entrypoint.sh: add version lock for NCCL (#11475) 2022-05-10 15:37:55 -07:00
Yulong Wang
3437967e63
[js/rn] fix CI packaging for react native E2E test (#11463)
* [js/rn] fix ORTRN packaging in CI

* fix env var setting
2022-05-09 18:09:52 -07:00
Edward Chen
738d9b153c
Consolidate several types into onnxruntime::ArgType. (#11430) 2022-05-09 14:44:28 -07:00
Rachel Guo
288892335e
[NNAPI EP] Add support for DepthToSpace Op (#11354)
* initial implementation for support nnapi depthtospace

* modify depthtospace output tensor shape and enable test pass

* minor update

* minor update

* modify input output layout order and hack nnapi instance  to use nchw flag for optest

* address pr comments

* add depthtospace to layout logic

* format length and revert UT log level

* add nchw and android feature level check in opsupportchecker

* minor fix

* update

* update

* fix

* minor update
2022-05-09 11:38:12 -07:00
Changming Sun
3b16fb2000
Delete java-test-final-jar-step.yml (#8894) 2022-05-09 11:25:03 -07:00
Justin Chu
c541063245
Format coding conventions documentation (#11405)
Add proper formatting to code blocks to make the doc more readable.

- Wrap code blocks with `
- Fix typos
2022-05-09 10:19:15 -07:00
symphonylyh
c2de603c10
Contrib ops for TRT plugin: Disentangled Attention Plugin (#11287)
* Add disentangled attention TRT plugin as contrib op

* update plugin name & remove null character

* update onnx-tensorrt submodule with my beta version

* use suggested plugin name & simpler shape propagation

* update onnx-tensorrt gitsubmodule to temporary fork

* update onnx-tensorrt to temporary commit

* redirect submodule back to latest 8.2-GA release of onnx-tensorrt repo

Co-authored-by: HHH-ComputeLab <haohangh@nvidia.com>
2022-05-08 15:25:25 -07:00
George Wu
70e501866b
Revert "[TensorRT EP] reduce CI pipelines test execution time (#11440)" (#11460)
This reverts commit 8d6ade9e08.
2022-05-07 11:41:11 -07:00
Dwayne Robinson
69b2fab810
Update DirectML from 1.8.0 to 1.8.2 (#11459) 2022-05-06 17:52:52 -07:00
RandySheriffH
8467af832f
Fix reduced pipeline by excluding test case standalone op (#11458)
* exclude reduce build from standalone op test

* exclude test from reduced op build
2022-05-06 16:19:49 -07:00
Brian Popow
3624f7c5a5
Update samples (#11420) 2022-05-06 13:32:16 -07:00
Hubert Lu
2a90922f01
Using vectorized loads (float2) for fp16 to improve performance (#11390) 2022-05-05 14:19:21 -07:00
Changming Sun
d2ae0f49b2
Make Graph::InlineFunction be able to process initializers (#11443) 2022-05-05 12:30:29 -07:00
George Wu
8d6ade9e08
[TensorRT EP] reduce CI pipelines test execution time (#11440)
* add global builder placeholder to improve CI test time for TRT EP

* fix build error

* rename var, put in unnamed namespace

* fix build error

* fix
2022-05-05 09:25:54 -07:00
Tang, Cheng
3f3c5fcd68
Unify the Compile API for mobile build and normal build (#10632)
* use the lightweight compile api as default; use dnnl ep for testing

* apply to tensorrt ep

* fix the missing files

* fix build

* fix the copy issue on linux

* migrate migraphx and openvino ep

* fix openvino build break

* fix linux build

* fix unused parameter

* fix coreml build

* use graph view's filtered initializers

* fix openvino break

* fix tvm compile api

* fix tvm / rknpu / vitisai ep build

* add IsInitializedTensor in graph_viewer; fix nuphar build

* use serializer directly as tvm ep is still static lib

* fix the type mismatch

* fix the type mismatch

* fix merge conflict

* add a comment

* fix minimal build

* fix the DML EP's legacy approach

* save type/shape in dnnl IR

* fix linux break

* fix tvm failure

* dnnl ep: move initializer referenced out of dnnl subgraph

* Revert "add IsInitializedTensor in graph_viewer; fix nuphar build"

This reverts commit 1cc3c7f08c16fee4fe3309a67209eb769d479587.

* add IsInitializedTensor to graph viewer

* add the legacy code for nuphar build to temporarily make nuphar build work

* ignore internal test for nuphar

* remove the out of date tests

* keep the legacy API in EP for a while

* turn serializer into a static function

* update comments

* fix tvm build

* Update include/onnxruntime/core/framework/execution_provider.h

Co-authored-by: Pranav Sharma <prs@microsoft.com>

* Update include/onnxruntime/core/framework/execution_provider.h

Co-authored-by: Pranav Sharma <prs@microsoft.com>

* Update onnxruntime/core/framework/execution_provider.cc

Co-authored-by: Pranav Sharma <prs@microsoft.com>

* updatee comments; add warning message for legacy compil call

* add a flag to control out of scope arg in serialization

* fix trt  build; improve the test

* resolve merege errors

* fix a typo

Co-authored-by: Cheng Tang <chenta@microsoft.com>
Co-authored-by: Cheng Tang <chenta@microsoft.com@orttrainingdev9.d32nl1ml4oruzj4qz3bqlggovf.px.internal.cloudapp.net>
Co-authored-by: Pranav Sharma <prs@microsoft.com>
2022-05-05 08:30:07 -07:00
cloudhan
eca4cbc419
Avoid using word 'crazy' (#11396)
Avoid using word 'crazy' and simplify the comment of else branch
2022-05-05 23:07:50 +08:00
Valery Chernov
5ae461ec0a
[TVM EP] update set input to remove excess copying inside TVM (#11247)
* update TVM

* small fixes

* update TVM with new set_input and NDArray API

* use set_input instead of set_one_input

Co-authored-by: Valery Chernov <valery.chernov@deelvin.com>
2022-05-05 14:25:02 +02:00
Vincent Wang
084165c748
Change MinGrad/MaxGrad to Use Distributed Logic (#11388)
* change min max grad

* resolve comments
2022-05-05 11:49:40 +08:00
Yulong Wang
860ba8820b
[js/rn] fix ORTRN for iOS (#11425)
* align ios version with onnxruntime-mobile-c

* support 'file://' in iOS

* fix lint error
2022-05-04 13:58:55 -07:00
Changming Sun
963e1ace4e
Fix SAL annotations for custom op (#11432)
Fix SAL annotations for custom op. For example, "_In_" only applies to pointers, not integers.
2022-05-04 10:47:28 -07:00
Justin Chu
a1f9847b23
[Fix] Add the extra param to match gelu in PyTorch in the contrib symbolic function (#11318)
Description:

Add the extra param to match gelu in PyTorch in the contrib symbolic function

Motivation and Context

Why is this change required? What problem does it solve?
The symbolic function in /onnxruntime/python/tools/pytorch_export_contrib_ops.py is missing a recently added parameter approximate. We add this parameter and use the exporter defined gelu if approximate is "tanh".
2022-05-04 10:36:38 -07:00
Hariharan Seshadri
1aad59fa49
Increase timeout for IOS packaging pipeline (#11431) 2022-05-04 10:00:41 -07:00
Changming Sun
57b51e72d7
Linux CI: uninstall onnx before installing it (#11428) 2022-05-04 08:49:37 -07:00
Yulong Wang
af21a04977
[js] upgrade async@3.2.3 /js/ (#11421)
* [js] upgrade async@3.2.3 /js/

* format code
2022-05-03 23:41:36 -07:00