Commit graph

1352 commits

Author SHA1 Message Date
Tianlei Wu
b2c1937523
Add EmbedLayerNormalization and SkipLayerNormalization ops for bert optimization (#2012)
* Add Embed Layer Normalization and Skip Layer Normalization ops for bert optimization.

* add float16 test for skiplayernorm

* Add test for EmbedLayerNormalization op

* fix cpu build error

* fix build warning

* update HasCudaEnvironment function

* handle cuda error
2019-10-07 17:29:43 -07:00
Changming Sun
8f7657fa32
Ignore some gcc warnings (#1996) 2019-10-07 16:32:34 -07:00
Pranav Sharma
ea60469af5
Support seq(tensor), implement 2 sequence ops that use the new type. (#1983)
* Mention OrtCreateSessionFromArray in C API doc

* fix seq of tensors

* changes on 9/30

* All tests passing

* Add SequenceAt op

* Fix shared_lib non_tensor_types test

* Address some PR comments

* Address PR comments

* Add support in python bindings to accept seq(tensor)

* Change data type from vector<Tensor> to TensorSeq

* Change data type from vector<Tensor> to TensorSeq

* Added some documentation

* Added missing test model

* Fix Linux build

* Fix Mac build

* Fix Mac build
2019-10-07 15:35:09 -07:00
Hector Li
00e24ae4fe
refactor Cuda Ops Sum, Max, Min, remove dup code (#1946)
refactor Cuda Ops Sum, Max, Min, remove dup code
2019-10-07 13:17:49 -07:00
Tianlei Wu
7b39f5090c
Add Attention op for multi-head self attention in BERT (#1984)
* Add Attention op for multi head self attention in BERT

* Add test cases

* Move op from kOnnxDomain to kMSDomain.
Limit test to run by CUDA provider only.

* fix test

* Add float16 test

* fix cpu build error

* handle cuda error

* get last cuda error when failed
2019-10-07 12:22:54 -07:00
Yang Chen
7d2f0c79bd Bumped up to op_ver 11 for a bunch of Nuphar Ops (#2025)
This change enabled op_ver 11 for a dozen of Nuphar Ops
2019-10-07 10:34:05 -07:00
Changming Sun
3c26ae5b6d
ThreadPool fix for roialign and CropAndResize (#2020) 2019-10-06 22:43:59 -07:00
Pranav Sharma
4cdb95e436
Resort to sequential execution if the inter op thread pool ptr is nullptr; (#2023) 2019-10-06 16:08:41 -07:00
stevenlix
544e53e24e Update TensorRT to version 6.0.1.5 (#1966)
* remove onnx-tensorrt submodule

* add new onnx-tensorrt submodule (experiment) for trt6

* update engine build for trt6

* update compile and compute for tensorrt6.0

* Update tensorrt_execution_provider.cc

* Update tensorrt_execution_provider.cc

* Update tensorrt_execution_provider.cc

* Update tensorrt_execution_provider.cc

* switch to onnx-tensorrt master for TensorRT6'

* Update tensorrt_execution_provider.cc

* Handle dynamic batch size and add memcpy in TensorRT EP

* update test cases

* Update tensorrt_execution_provider.cc

* update onnx-tensorrt submodule

* Update Dockerfile.ubuntu_tensorrt

* Update Dockerfile.ubuntu_tensorrt

* Update run_dockerbuild.sh

* Update run_dockerbuild.sh

* Update install_ubuntu.sh

* Update concat_op_test.cc

* Update tensorrt_execution_provider.cc

* Upgrade TensorRT to version 6.0.1.5

* Update onnxruntime_providers.cmake

* Update CMakeLists.txt

* Update reduction_ops_test.cc

* Update install_ubuntu.sh

* Update Dockerfile.ubuntu_tensorrt

* Update Dockerfile.tensorrt

* Update BUILD.md

* Update run_dockerbuild.sh

* Update install_ubuntu.sh

* Update onnxruntime_providers.cmake

* Update install_ubuntu.sh

* Update install_ubuntu.sh

* Update gemm_test.cc

* Update gather_op_test.cc

* Update CMakeLists.txt

* Removed submodule

* update onnx-tensorrt submodule

* Add Ubuntu18.04 build option

* Add Ubuntu18.04 build option

* Add Ubuntu18.04 build option

* Add Ubuntu18.04 build option

* Remove redundency

* Fix issue that it does not add memcopy node correctly if some nodes fall back to CUDA EP.
e.g. after partition, there's TRT_Node -> Cuda_node (with CPU memory expected), we still need to add memcpy node between them.

* update for Trt Windows build

* Update onnxruntime_providers.cmake

* Disable opset11 tests on TensorRT

* Update pad_test.cc

* Update build.py

* update scripts for ubuntu18.04

* Disable warning for Windows build
2019-10-06 10:40:53 -07:00
baowenlei
4bb6385dca
Weba/merge ngemm (#2021)
* save status: add tiling layout; add avx512 skylake cpuid info

* unit tests and matmul integer model passed on skylake, need to verify model

* save commit before update master

* fix check

* address comments
2019-10-05 12:09:22 -07:00
Xavier Dupré
0b5aac0a2e fix python setup (#2022) 2019-10-05 09:46:41 -07:00
Yang Chen
e8285a7996
Added GatherElements to Nuphar (#2016)
* Added GatherElements to Nuphar

This change added GatherElements (op_ver 11) to the Nuphar provider.

* address CR feedback

* create a utilify function for accessing index safely

* address more CR

* SafeIndex -> ClampIndex
2019-10-04 23:53:02 -07:00
Colin Versteeg
1ba76c5f74 add support for empty version and score route (#1995) 2019-10-04 22:53:11 -07:00
Changming Sun
a9e04a29b3
Ignore a test: ParallelExecutor.StatusPropagation (#2019) 2019-10-04 22:51:47 -07:00
Scott McKay
2a2e6e6641
Handle nullptr for NodeArg.Shape() (#2009) 2019-10-05 15:00:19 +10:00
Hariharan Seshadri
f528da35f2
Update ONNX to a newer commit (#2015)
* Update ONNX to a newer version

* PR comments
2019-10-04 19:41:00 -07:00
Dmitri Smirnov
f5a8a23951 Replace std::regex with re2 bc CentOS std::regex is broken (#2017) 2019-10-04 18:47:03 -07:00
daquexian
e071a1249b Android CI (#1600) 2019-10-04 17:39:51 -07:00
Colin Versteeg
bfa1b0e96e Fix logger regression (#2011)
* Fix regression in creating default logger from custom function

* fix model naming issue in tests

* fix version in addition to model name
2019-10-04 16:39:40 -07:00
shahasad
b322e072b9
added the overridableinitializers api (#1977) 2019-10-04 16:38:00 -07:00
ybrnathan
19873c70dc
Implement Cuda Kernel of Where Op (#1997)
* Implement Cuda Kernel of  Where Op

* Fix the template
2019-10-04 15:32:41 -07:00
Yufeng Li
a6bf1d0ad8
use mlaserf (#1999)
1. use MlasErf for Gelu. Eigen's erf is very slow.
2. change the ErfUpperAbsRange to 3.925 because MlasErf doesn't return 1 for 3.725
Motivation and Context
2019-10-04 15:17:26 -07:00
Scott McKay
fdbe365c37
Add BitShift operator (#1981)
* Add BitShift operator. Enable uint32 and uint64 support initially.
2019-10-05 07:48:58 +10:00
Colin Versteeg
d5d1719c1f Fix integration_tests/test_main.py to have correct exit code (#2010) 2019-10-04 14:25:28 -07:00
Changming Sun
ace0b2ca1c
CentOS CI (#1998) 2019-10-04 10:48:43 -07:00
Hariharan Seshadri
534660bf2c
Support opset-11 Range CPU kernel (#1980)
* Initial commit

* Update

* Update

* Update

* Add tests to MKLDNN exclsuion

* Update

* PR feedback

* Revert
2019-10-04 09:34:14 -07:00
Tracy Sharpe
a7414287a9
fix Conv/Add fusion with no bias (#1978) 2019-10-04 07:22:55 -07:00
Hariharan Seshadri
d80f324fd3 Add Pad test to NGraph specific exclusion list (#2006)
* Add Pad test to NGraph specific exclusion list
2019-10-04 22:49:24 +10:00
Scott McKay
e58827fa62
Add Unique operator. (#1900)
* Add Unique operator.
* Enable onnx tests. Disable one with incorrect expected output and add unit test to validate ORT behavior. Need onnx update to fix (will address that separately but don't want to block this checkin on that change).
2019-10-04 22:11:55 +10:00
Hariharan Seshadri
74517bb742
Support opset-11 GatherND CPU kernel (#1969)
* Initial commit

* Update

* Update

* Update

* Remove tests from exclusion

* Update

* Formatting

* Formatting

* Formatting

* Update

* Update

* Update

* Update
2019-10-03 20:52:58 -07:00
Dmitri Smirnov
627f853a44
Downgrade compiler to CentOS 4.8.5 (#1985)
Make onnxruntime CPU build and run on CentOS GCC 4.8.5
2019-10-03 15:40:46 -07:00
Hariharan Seshadri
931975e3fe
Add opset-11 Pad CPU kernel (#1963)
* Initial commit

* Update

* Update

* CUDA changes

* Update

* Update

* Update

* Update

* Update

* Update

* Fix build break

* Update

* Update

* Fix Ngraph build

* Update main.cc

* Update

* PR comments

* Update

* Update

* Update
2019-10-03 15:04:33 -07:00
Yang Chen
15138908e7
Yanchen/nuphar/scatter elems (#1992)
* Added Scatter and ScatterElements to Nuphar

Implemented Scatter (op_ver 9 - 10) and ScatterElements (op_ver 11)
nuphar.

Because TVM's compute is output-oriented, our current implementation
uses extern calls for simplicity.

* fixed build issue after rebase

* remove dead code

* Address CR

* removed dead code

* use GetAttrOrDefault

* Address more CR feedback

* add GetStrides to codegen/common/utils.h

* added a unit test for Bool input data
2019-10-03 14:58:10 -07:00
Changming Sun
c86d17754a
Dockerfile for CentOS CI build (#1986) 2019-10-03 11:46:27 -07:00
Yufeng Li
a1f8fe1463
Add layernorm operator (#1967)
* Add LayerNormalization operator
It's a fusion of bert model's layer normalization.
2019-10-03 11:32:13 -07:00
Scott McKay
6b6d0aa888 Fix the NGraph backend test exclusions (#1988)
* Add some ngraph exclusions to the backend test py

* Add '_' in pattern for some 'reduce' negative axis tests.
2019-10-03 06:50:33 -07:00
pengwa
9959e84906
Gelu fusion - kernel and transformer (#1746)
* Gelu contrib op & transformer

* Gelu kernels for CPU&cuda

* Merged PR 5034: fix a condition for gelu transformer

The ONNX models doesn't guarantee to assign an unique name to each node, so the previous condition could fail.

(cherry picked from commit e335ef5466444cb0aae45f885ea3a825ed9f1088)

* Fix builds

* remove useless comments

* fix test failure when nocontribp

* Move impelmentation under KMSdomain

* fix comments

* fix linux build

* Fix few comments

* fix linux build
2019-10-03 19:34:46 +08:00
Scott McKay
b0665262c0
Fix how failing tests are defined. Merge of old PR didn't detect conflict with some new sections in the script. (#1982) 2019-10-03 14:00:47 +10:00
Hariharan Seshadri
c79195fb3b
Add GatherElements CPU kernel (#1735)
* Support GatherElements

* More changes

* More updates

* update tests

* More changes

* Fix build

* Fix build break

* Add more tests

* PR comments

* Update
2019-10-02 18:29:03 -07:00
Scott McKay
9f633c5bd9
Update Cast op to use precision of 8 when casting floating point numbers to strings (#1210)
* Update Cast op to use precision of 8 when casting floating point numbers to strings. This matches numpy precision.

Update unit tests to include non-trivial floats in the input.

Update onnx test infrastructure to document why the test cases are disabled
2019-10-03 07:58:42 +10:00
Bowen Bao
9e975f64c3
Add scatter_nd cpu (#1603)
* Add ScatterND impl

* remove skipped tests

* add rank check for input tensor.
2019-10-02 14:34:57 -07:00
Vinitra Swamy
7c77a01ce7
OpenVINO dockerfile for ORT (#1892)
* dockerfile openvino updates
2019-10-02 14:04:04 -07:00
shahasad
103b92889e
Opset-11 support (negative axis) for reduce ops (#1929) 2019-10-02 13:45:17 -07:00
George Wu
f9bf546e3c
python session.run() fallback to CPU/CUDA provider for EP failures. (#1960)
* py fallback initial commit.

* fixes.

* update NGRAPHCustomOp::Initialize() to return Status

* fixes in session.py

* FAIL status to EP_FAIL in ngraph custom op

* disable fallback for backend api
2019-10-02 02:38:03 -07:00
baowenlei
622ea4248d
fix build instruction (#1970) 2019-10-01 20:59:53 -07:00
Hariharan Seshadri
a5e134405d
Support opset-11 Gemm kernels (#1923)
* Support optional bias in Gemm

* Fix test

* Update

* More updates

* Update

* Update

* Update gemm.cc

* Update

* Update

* Fix build break

* Update

* PR comments

* Update
2019-10-01 20:32:28 -07:00
Hariharan Seshadri
31aff686e0
Register opset-11 CPU kernel for 'If' op (#1948)
* Initial commit

* Update

* Update

* Update

* Update

* PR comments
2019-10-01 16:40:26 -07:00
Dmitri Smirnov
d1b1cdc5c4
Replace GSL with GSL-LITE submodule and fix up refs (#1920)
Remove gsl subodule and replace with a local copy of gsl-lite
  Refactor for onnxruntime::make_unique
  gsl::span size and index are now size_t
  Remove lambda auto argument type detection.
  Remove constexpr from fail_fast in gsl due to Linux not being happy.
  Comment out std::stream support due to MacOS std lib broken.
  Move make_unique into include/core/common so it is accessible for server builds.
  Relax requirements for onnxruntime/test/providers/cpu/ml/write_scores_test.cc
  due to x86 build.
  Add ONNXRUNTIME_ROOT to Server Lib includes so gsl is recognized
2019-10-01 12:43:29 -07:00
Scott McKay
983ed43f0a
Fix declarations for opset 9 versions of Gemm, MatMul and BatchNormalization (#1956)
* Fix declarations for new versions of Gemm, MatMul and BatchNormalization from opset 9.
2019-10-01 13:38:07 +10:00
shahasad
b355193841
Add Date-time stamp in NuGet package versioning for appropriate ordering of the packages (#1951) 2019-09-30 16:24:16 -07:00