Commit graph

1893 commits

Author SHA1 Message Date
Changming Sun
e38df45587
Add missing include to dnnl_conv_batchnorm.h 2020-02-21 20:17:23 -08:00
Yufeng Li
64feee1b52
Logging in framework.cc should use the session logger (#3059) 2020-02-21 17:11:14 -08:00
Changming Sun
fb871978b5
Adjust build flags for the release pipelines (#3066)
1. Add LTCG back. It was set to default OFF in my previous PR to speed up Windows build. It is only needed in release pipelines.
2. Remove --use_featurizers from all the packaging pipelines
3. Make sure all the packages have openmp
2020-02-21 16:45:42 -08:00
Adam Pocock
a7541f9753
[Java] Fix for incorrect input and output lengths in run call (#3064) 2020-02-21 13:13:02 -08:00
Scott McKay
21cc2d88b4
Move some vectors out of loops to minimize memory allocations/reallocations (#3062)
* Move some vectors out of loops to minimize memory allocations/reallocations.
Remove some unused/unnecessary vectors.
2020-02-22 06:35:13 +10:00
Changming Sun
7ffb36be44
Fix softmax cpu code for double type (#3065) 2020-02-21 12:06:13 -08:00
Changming Sun
179603775f
Use CUDA 10.1 for Linux build (#3057)
Use CUDA 10.1 for Linux build
(Windows change is already in)

Please note, cublas 10.2.1.243 is for CUDA SDK 10.1.243, not CUDA 10.2.x. CUDA 10.2.89 need cublas 10.2.2.89. They match on the last part of the digits.

libcublas10-10.1.0.105 won't work!!!

The cuda docker image by viswamy is already using 10.1, no need to change.
2020-02-21 11:55:32 -08:00
Ori Levari
be12fb3143
include winml x86 binaries in the drop-signed-nuget artifact (#3058) 2020-02-21 11:17:23 -08:00
Prabhat
752d74928c
Improve the efficiency of SVMRegressor code (#3054) 2020-02-21 13:14:11 +00:00
pengwa
92b8a7a2be
GPT2 Gelu Fusion & Test (#3009)
* GPT2 Gelu Fusion & Test

* change header path

* Refine code & add missing test onnx file

* Fix builds & refine float/double/fp16 compare.

* Fix builds

* Add Bias Check and UTs

* Fix build and uts

* Fuse with second formula & test

* minor change

* disable FastGelu to see whether the builds can pass

* Verify where is wrong

* disable for debugging

* Revert "disable for debugging"

This reverts commit 535c0817fb36fb95a75773a7f00c8b969dd5362c.

* Revert "Verify where is wrong"

This reverts commit ffc43ec1d136636ba2cee30df49f563a75e84676.

* disable the transformer for inference currently

* Enable FastGeluFusion and fix segement fault when run bertsquad10.onnx test

* Add more Unit tests convering Gelu subgraph use graph input/output

(cherry picked from commit 0739ab985240c6d9acdb8f0afd40c5fb316166af)

* Mode Bias Fusion in BiasGelu.cc

Co-authored-by: Changming Sun <chasun@microsoft.com>
2020-02-21 18:25:43 +08:00
Scott McKay
932ecaea34
Some documentation updates. (#3060) 2020-02-21 20:07:39 +10:00
William Tambellini
c033cfee92
Check python version (#356)
Check python version in build.py
2020-02-20 23:26:25 -08:00
Changming Sun
45ba325fa6
Remove USE_NSYNC macro (#3052) 2020-02-20 13:29:19 -08:00
Scott McKay
a1db87b382
Add SafeInt bounds checking to memory allocation size calculations. (#3022)
* Add SafeInt bounds checking to memory allocation size calculations.

* Fix TensorRT library includes
2020-02-20 11:41:03 -08:00
Pranav Sharma
21f9a8bdc2
Allow creation of string tensor sequence (#3048) 2020-02-20 11:27:42 -08:00
Changming Sun
cb24e2a214 Update nsync 2020-02-20 11:25:34 -08:00
Changming Sun
85c0989e6f Fix build errors in benchmark 2020-02-20 11:22:28 -08:00
Changming Sun
5306a1241b Server: Add build options for the other EPs 2020-02-20 11:14:24 -08:00
smk2007
44d5eaf3d7
WinML exists in the nuget packages but does not publish its WinMD and headers (#3037)
* publish winmd and raw headers

* Add the lib too

* add missing conditions

* Fix copy/paste condition error
2020-02-20 10:24:29 -08:00
Scott McKay
bcbc5ee815 Improve error message for type mismatch between data in initializer and graph usage of the initializer. 2020-02-19 23:52:20 -08:00
James Yuzawa
dde4df148b Add and run spotless formatter for Java and Gradle 2020-02-19 23:51:09 -08:00
Changming Sun
ef2bba316b
CUDA 10.1 for Windows(#3049) 2020-02-19 23:26:47 -08:00
Xiang Zhang
18aa0940c2
checking condition is wrong (#3025)
* checking condition is wrong

* remove unnecessary check
2020-02-19 13:29:42 -08:00
Changming Sun
e3c27536d0
Python binding doesn't need to link to the python lib on Linux 2020-02-19 12:18:47 -08:00
Dmitri Smirnov
daf8c4bee4
Remove faturizers from CPU MLDNN and NoContribOps builds. (#3039)
The first one is temp. The second one is permanent removal.
2020-02-19 06:23:36 -08:00
James Yuzawa
411b3aa801
Java build system enhancements (#2866) 2020-02-18 15:41:49 -08:00
Tracy Sharpe
ecdcd682bb
NCHWc ReorderOutput->Transpose(NHWC) fusion (#3035)
Add support to fuse ReorderOutput+Transpose(NHWC). Converting from NCHWc to NHWC tensors is a trivial copy of data and avoids the cost of a transpose node.
2020-02-18 10:23:48 -08:00
Changming Sun
71ca43b345
Revert an op version change (#3026)
Revert an op version change, it was brought in from #2999
2020-02-18 09:43:18 -08:00
Tracy Sharpe
cc8adc87c3
fix NCHWc intermediates as graph outputs (#3033)
This fixes a customer reported issue where the NCHWc optimizer was dropping graph outputs when an edge was used as both a graph output and an input to another NCHWc node.
2020-02-17 14:52:33 -08:00
daquexian
4ca50d9352 Update DNNLibrary to v0.9.0 and update NNAPI GetSupportedNodes 2020-02-17 13:24:10 -08:00
Tianlei Wu
aea76b0786
Bert optimization for onnx model exported by keras2onnx (#3014)
* Optimization for Bert and DistilBert model exported by keras2onnx
* Add model_type parameter for models from different export tools (pytorch, tf2onnx, keras2onnx).
* Split LayerNormalization and SkipLayerNormalization fusions
2020-02-15 23:59:49 -08:00
Tracy Sharpe
3626c46fad
Optimize Math::Im2col + ConvInteger pointwise (#3017)
Optimize the implementation of Math::Im2col that is currently used for ConvInteger/QLinearConv. Also, avoid Im2col for pointwise convolutions in ConvInteger.
2020-02-14 14:25:47 -08:00
Yulong Wang
f98f0f8717
fix type constraint name of Resize (#3019) 2020-02-14 14:13:32 -08:00
ytaous
2b77cb19bd
merge training kernels to master (#2999)
* merge training kernels to master

* merge training kernels to master

* revert two files

* merge training kernels to master

* merge training kernels to master

* merge training kernels to master

* merge training kernels to master

* merge training kernels to master

* merge training kernels to master

* merge training kernels to master

* merge training kernels to master

* merge training kernels to master

* merge training kernels to master

* merge training kernels to master

* merge training kernels to master

* merge training kernels to master

* merge training kernels to master

* merge training kernels to master

* merge training kernels to master

* merge training kernels to master

* merge training kernels to master
2020-02-13 14:52:35 -08:00
George Wu
584ba71485
TensorRT dockerfile updates (#3016)
* change npy installation

* update trt base image version

* update build instructions for arm64/jetson
2020-02-13 14:34:00 -08:00
Hariharan Seshadri
c9f18756b9
Avoid unnecessary copies of ModelProto from being made in the InferenceSession class (#3012)
* Avoid unneccesary copy creations of ModelProto

* Comment nit

* Nuit

* Comment refactoring

* Comment refactoring

* Fix build break

* Fix a few more instances where copies take place
2020-02-12 18:54:11 -08:00
Changming Sun
69bc8ce3c2 Upgrade protobuf to 3.11.3 2020-02-12 14:47:00 -08:00
Changming Sun
5f29c9c366
Add some test cases (#3010)
1. Add test cases for the schema registry in onnxruntime_graph
2. Use ASSERT_STATUS_OK when appropriate
2020-02-12 14:00:50 -08:00
Dmitri Smirnov
c6b499bfc9
Fix batch mode output for TfIdfVectorizer (#3008)
Fix weights indexing in TfIdfVectorizer
  Add a test case to cover batching.
2020-02-12 10:46:20 -08:00
stevenlix
da653ccdac
Upgrade TensorRT to version 7.0.0.11 (#2973)
* update onnx-tensorrt submodule to trt7 branch

* add fp16 option for TRT7

* switch to master branch of onnx tensorrt

* update submodule

* update to TensorRT7.0.0.11

* update to onnx-tensorrt for TensorRT7.0

* switch to private branch due to issues in master branch

* remove trt_onnxify

* disable warnings c4804 for TensorRT parser

* disable warnings c4702 for TensorRT parser

* add back sanity check of shape tensort input in the parser

* disable some warnings for TensorRT7

* change fp16 threshold for TensorRT

* update onn-tensorrt parser

* fix cycle issue in faster-rcnn and add cycle detection in GetCapability

* Update TensorRT container to v20.01

* Update TensorRT image name

* Update linux-multi-gpu-tensorrt-ci-pipeline.yml

* Update linux-gpu-tensorrt-ci-pipeline.yml

* disable rnn tests for TensorRT

* disable rnn tests for TensorRT

* disabled some unit test for TensorRT

* update onnx-tensorrt submodule

* update build scripts for TensorRT

* formating the code

* Update TensorRT-ExecutionProvider.md

* Update BUILD.md

* Update tensorrt_execution_provider.h

* Update tensorrt_execution_provider.cc

* Update win-gpu-tensorrt-ci-pipeline.yml

* use GetEnvironmentVar function to get env virables and switch to Win-GPU-2019 agent pool for win CI build

* change tensorrt path

* change tensorrt path

* fix win ci build issue

* update code based on the reviews

* fix build issue

* roll back to cuda10.0

* add RemoveCycleTest for TensorRT

* fix windows ci build issues

* fix ci build issues

* fix file permission

* fix out of range issue for max_workspace_size_env
2020-02-12 07:03:58 -08:00
Dmitri Smirnov
273868eaa5
Disable NuGetPackaging on Linux GPU and remove DML from the pipelines (#3006) 2020-02-11 20:08:18 -08:00
Faith Xu
ed0875909e
Reorganize and update readme (#2916)
* Reorganize and update readme

* Minor fixes

* Minor fixes based on comments

* fix typo
2020-02-11 15:59:20 -08:00
Tiago Koji Castro Shibata
e5f7e04e6f
Use target_add_dml function (#3005)
* Use target_add_dml function

* Don't access DML if disabled
2020-02-11 12:54:38 -08:00
Changming Sun
abb626ff60
Provide alternative std::mutex implementation on Windows (#3000)
Provide alternative std::mutex implementation on Windows. OrtMutex is no longer an alias of std::mutex.

We do it because:

1. This new thing is faster and much much simpler.
2. Static constructors are considered harmful. We should avoid such thing as possible as we can.
2020-02-11 11:46:08 -08:00
Dmitri Smirnov
c1997db85e Exclude faturizers from Linux NuGet packaging. 2020-02-10 22:21:52 -08:00
Dmitri Smirnov
36915b3674 Temporarily remove Featirizers from packaging-pipelines 2020-02-10 22:21:52 -08:00
smk2007
ce713823cc
enable winml in the gpu ci pipeline (#2993) 2020-02-10 22:21:13 -08:00
smk2007
5c5ac34b5c
Disable use_dml in nuget pipeline (#3001) 2020-02-10 22:09:58 -08:00
Tiago Koji Castro Shibata
fb2182f3fc
Release ARM/ARM64 Nuget packages (#2987)
* Enable ARM64 release builds

* Add ARM release

* Skip C# dll signing in ARM

* Copy ARM binaries to Nuget

* Restore nuget packages before ARM packaging

* wip

* Use host protoc at C# build

* Set ProtocDirectory on cross-compiled builds

* wip

* Fix typo
2020-02-10 16:29:27 -08:00
Hariharan Seshadri
3afb83ac3c
Support a few new C/C++ APIs (#2794)
* Initial commit

* More changes

* More changes

* Changes

* More changes

* More changes

* More changes

* More changes

* Updates

* Fix break

* PR feedback

* Nit

* Resolve conflicts

* More changes
2020-02-10 16:18:42 -08:00