Commit graph

1884 commits

Author SHA1 Message Date
pengwa
92b8a7a2be
GPT2 Gelu Fusion & Test (#3009)
* GPT2 Gelu Fusion & Test

* change header path

* Refine code & add missing test onnx file

* Fix builds & refine float/double/fp16 compare.

* Fix builds

* Add Bias Check and UTs

* Fix build and uts

* Fuse with second formula & test

* minor change

* disable FastGelu to see whether the builds can pass

* Verify where is wrong

* disable for debugging

* Revert "disable for debugging"

This reverts commit 535c0817fb36fb95a75773a7f00c8b969dd5362c.

* Revert "Verify where is wrong"

This reverts commit ffc43ec1d136636ba2cee30df49f563a75e84676.

* disable the transformer for inference currently

* Enable FastGeluFusion and fix segement fault when run bertsquad10.onnx test

* Add more Unit tests convering Gelu subgraph use graph input/output

(cherry picked from commit 0739ab985240c6d9acdb8f0afd40c5fb316166af)

* Mode Bias Fusion in BiasGelu.cc

Co-authored-by: Changming Sun <chasun@microsoft.com>
2020-02-21 18:25:43 +08:00
Scott McKay
932ecaea34
Some documentation updates. (#3060) 2020-02-21 20:07:39 +10:00
William Tambellini
c033cfee92
Check python version (#356)
Check python version in build.py
2020-02-20 23:26:25 -08:00
Changming Sun
45ba325fa6
Remove USE_NSYNC macro (#3052) 2020-02-20 13:29:19 -08:00
Scott McKay
a1db87b382
Add SafeInt bounds checking to memory allocation size calculations. (#3022)
* Add SafeInt bounds checking to memory allocation size calculations.

* Fix TensorRT library includes
2020-02-20 11:41:03 -08:00
Pranav Sharma
21f9a8bdc2
Allow creation of string tensor sequence (#3048) 2020-02-20 11:27:42 -08:00
Changming Sun
cb24e2a214 Update nsync 2020-02-20 11:25:34 -08:00
Changming Sun
85c0989e6f Fix build errors in benchmark 2020-02-20 11:22:28 -08:00
Changming Sun
5306a1241b Server: Add build options for the other EPs 2020-02-20 11:14:24 -08:00
smk2007
44d5eaf3d7
WinML exists in the nuget packages but does not publish its WinMD and headers (#3037)
* publish winmd and raw headers

* Add the lib too

* add missing conditions

* Fix copy/paste condition error
2020-02-20 10:24:29 -08:00
Scott McKay
bcbc5ee815 Improve error message for type mismatch between data in initializer and graph usage of the initializer. 2020-02-19 23:52:20 -08:00
James Yuzawa
dde4df148b Add and run spotless formatter for Java and Gradle 2020-02-19 23:51:09 -08:00
Changming Sun
ef2bba316b
CUDA 10.1 for Windows(#3049) 2020-02-19 23:26:47 -08:00
Xiang Zhang
18aa0940c2
checking condition is wrong (#3025)
* checking condition is wrong

* remove unnecessary check
2020-02-19 13:29:42 -08:00
Changming Sun
e3c27536d0
Python binding doesn't need to link to the python lib on Linux 2020-02-19 12:18:47 -08:00
Dmitri Smirnov
daf8c4bee4
Remove faturizers from CPU MLDNN and NoContribOps builds. (#3039)
The first one is temp. The second one is permanent removal.
2020-02-19 06:23:36 -08:00
James Yuzawa
411b3aa801
Java build system enhancements (#2866) 2020-02-18 15:41:49 -08:00
Tracy Sharpe
ecdcd682bb
NCHWc ReorderOutput->Transpose(NHWC) fusion (#3035)
Add support to fuse ReorderOutput+Transpose(NHWC). Converting from NCHWc to NHWC tensors is a trivial copy of data and avoids the cost of a transpose node.
2020-02-18 10:23:48 -08:00
Changming Sun
71ca43b345
Revert an op version change (#3026)
Revert an op version change, it was brought in from #2999
2020-02-18 09:43:18 -08:00
Tracy Sharpe
cc8adc87c3
fix NCHWc intermediates as graph outputs (#3033)
This fixes a customer reported issue where the NCHWc optimizer was dropping graph outputs when an edge was used as both a graph output and an input to another NCHWc node.
2020-02-17 14:52:33 -08:00
daquexian
4ca50d9352 Update DNNLibrary to v0.9.0 and update NNAPI GetSupportedNodes 2020-02-17 13:24:10 -08:00
Tianlei Wu
aea76b0786
Bert optimization for onnx model exported by keras2onnx (#3014)
* Optimization for Bert and DistilBert model exported by keras2onnx
* Add model_type parameter for models from different export tools (pytorch, tf2onnx, keras2onnx).
* Split LayerNormalization and SkipLayerNormalization fusions
2020-02-15 23:59:49 -08:00
Tracy Sharpe
3626c46fad
Optimize Math::Im2col + ConvInteger pointwise (#3017)
Optimize the implementation of Math::Im2col that is currently used for ConvInteger/QLinearConv. Also, avoid Im2col for pointwise convolutions in ConvInteger.
2020-02-14 14:25:47 -08:00
Yulong Wang
f98f0f8717
fix type constraint name of Resize (#3019) 2020-02-14 14:13:32 -08:00
ytaous
2b77cb19bd
merge training kernels to master (#2999)
* merge training kernels to master

* merge training kernels to master

* revert two files

* merge training kernels to master

* merge training kernels to master

* merge training kernels to master

* merge training kernels to master

* merge training kernels to master

* merge training kernels to master

* merge training kernels to master

* merge training kernels to master

* merge training kernels to master

* merge training kernels to master

* merge training kernels to master

* merge training kernels to master

* merge training kernels to master

* merge training kernels to master

* merge training kernels to master

* merge training kernels to master

* merge training kernels to master

* merge training kernels to master
2020-02-13 14:52:35 -08:00
George Wu
584ba71485
TensorRT dockerfile updates (#3016)
* change npy installation

* update trt base image version

* update build instructions for arm64/jetson
2020-02-13 14:34:00 -08:00
Hariharan Seshadri
c9f18756b9
Avoid unnecessary copies of ModelProto from being made in the InferenceSession class (#3012)
* Avoid unneccesary copy creations of ModelProto

* Comment nit

* Nuit

* Comment refactoring

* Comment refactoring

* Fix build break

* Fix a few more instances where copies take place
2020-02-12 18:54:11 -08:00
Changming Sun
69bc8ce3c2 Upgrade protobuf to 3.11.3 2020-02-12 14:47:00 -08:00
Changming Sun
5f29c9c366
Add some test cases (#3010)
1. Add test cases for the schema registry in onnxruntime_graph
2. Use ASSERT_STATUS_OK when appropriate
2020-02-12 14:00:50 -08:00
Dmitri Smirnov
c6b499bfc9
Fix batch mode output for TfIdfVectorizer (#3008)
Fix weights indexing in TfIdfVectorizer
  Add a test case to cover batching.
2020-02-12 10:46:20 -08:00
stevenlix
da653ccdac
Upgrade TensorRT to version 7.0.0.11 (#2973)
* update onnx-tensorrt submodule to trt7 branch

* add fp16 option for TRT7

* switch to master branch of onnx tensorrt

* update submodule

* update to TensorRT7.0.0.11

* update to onnx-tensorrt for TensorRT7.0

* switch to private branch due to issues in master branch

* remove trt_onnxify

* disable warnings c4804 for TensorRT parser

* disable warnings c4702 for TensorRT parser

* add back sanity check of shape tensort input in the parser

* disable some warnings for TensorRT7

* change fp16 threshold for TensorRT

* update onn-tensorrt parser

* fix cycle issue in faster-rcnn and add cycle detection in GetCapability

* Update TensorRT container to v20.01

* Update TensorRT image name

* Update linux-multi-gpu-tensorrt-ci-pipeline.yml

* Update linux-gpu-tensorrt-ci-pipeline.yml

* disable rnn tests for TensorRT

* disable rnn tests for TensorRT

* disabled some unit test for TensorRT

* update onnx-tensorrt submodule

* update build scripts for TensorRT

* formating the code

* Update TensorRT-ExecutionProvider.md

* Update BUILD.md

* Update tensorrt_execution_provider.h

* Update tensorrt_execution_provider.cc

* Update win-gpu-tensorrt-ci-pipeline.yml

* use GetEnvironmentVar function to get env virables and switch to Win-GPU-2019 agent pool for win CI build

* change tensorrt path

* change tensorrt path

* fix win ci build issue

* update code based on the reviews

* fix build issue

* roll back to cuda10.0

* add RemoveCycleTest for TensorRT

* fix windows ci build issues

* fix ci build issues

* fix file permission

* fix out of range issue for max_workspace_size_env
2020-02-12 07:03:58 -08:00
Dmitri Smirnov
273868eaa5
Disable NuGetPackaging on Linux GPU and remove DML from the pipelines (#3006) 2020-02-11 20:08:18 -08:00
Faith Xu
ed0875909e
Reorganize and update readme (#2916)
* Reorganize and update readme

* Minor fixes

* Minor fixes based on comments

* fix typo
2020-02-11 15:59:20 -08:00
Tiago Koji Castro Shibata
e5f7e04e6f
Use target_add_dml function (#3005)
* Use target_add_dml function

* Don't access DML if disabled
2020-02-11 12:54:38 -08:00
Changming Sun
abb626ff60
Provide alternative std::mutex implementation on Windows (#3000)
Provide alternative std::mutex implementation on Windows. OrtMutex is no longer an alias of std::mutex.

We do it because:

1. This new thing is faster and much much simpler.
2. Static constructors are considered harmful. We should avoid such thing as possible as we can.
2020-02-11 11:46:08 -08:00
Dmitri Smirnov
c1997db85e Exclude faturizers from Linux NuGet packaging. 2020-02-10 22:21:52 -08:00
Dmitri Smirnov
36915b3674 Temporarily remove Featirizers from packaging-pipelines 2020-02-10 22:21:52 -08:00
smk2007
ce713823cc
enable winml in the gpu ci pipeline (#2993) 2020-02-10 22:21:13 -08:00
smk2007
5c5ac34b5c
Disable use_dml in nuget pipeline (#3001) 2020-02-10 22:09:58 -08:00
Tiago Koji Castro Shibata
fb2182f3fc
Release ARM/ARM64 Nuget packages (#2987)
* Enable ARM64 release builds

* Add ARM release

* Skip C# dll signing in ARM

* Copy ARM binaries to Nuget

* Restore nuget packages before ARM packaging

* wip

* Use host protoc at C# build

* Set ProtocDirectory on cross-compiled builds

* wip

* Fix typo
2020-02-10 16:29:27 -08:00
Hariharan Seshadri
3afb83ac3c
Support a few new C/C++ APIs (#2794)
* Initial commit

* More changes

* More changes

* Changes

* More changes

* More changes

* More changes

* More changes

* Updates

* Fix break

* PR feedback

* Nit

* Resolve conflicts

* More changes
2020-02-10 16:18:42 -08:00
Dmitri Smirnov
7437928f47
Replace hardcoded State serialization for Featurizer kernel tests (#2992)
Use in flight serialization for transformers State instead on hard coded values.
2020-02-10 10:02:09 -08:00
Changming Sun
64deb8030f Update ABI_Dev_Notes.md (#2959) 2020-02-07 20:09:56 -08:00
Xiang Zhang
b8d7d81506
Calling WinML enables telemetry by default for 1.2 (#2988) 2020-02-07 15:52:27 -08:00
smk2007
ca7626d739
Winml should be built against v2 of the capi as that includes the new type info apis (#2991) 2020-02-07 12:53:30 -08:00
Dmitri Smirnov
c8ea154e55
Package data_frame_tool, include featurizers into Manilinux2010 (#2989)
* Package data_frame_tool, exclude featurizers from Manilinux2010 as their fail to build.
2020-02-07 11:38:42 -08:00
Tianlei Wu
62383b0328
Improve BERT optimization script: Gelu and LayerNorm for models from Tensorflow 2.* (#2957)
* Add unit test.
Add an option --use_onnxruntime to use onnxruntime to do optimization for pytorch model.
Update layer norm and gelu for tensorflow 2.1 keras bert model.
Add logging and use f-strings.
Add extra checking for tensorflow model reshape fusion.
Allow output model to json for test purpose.
update match parent path utility function to return index

* remove function not used.
2020-02-07 11:01:03 -08:00
Ori Levari
0beb75ce77
populate file metadata for onnxruntime.dll (#2978) 2020-02-07 11:00:28 -08:00
Dmitri Smirnov
4f4f4bcd92
Handle dynamic shapes and reshape the input according to the model (#2986)
Put type validation in separate method.
2020-02-06 16:46:07 -08:00
Changming Sun
ec07fdc5b0
Update ThirdPartyNotices.txt for eigen (#2985) 2020-02-06 16:09:10 -08:00