Commit graph

1913 commits

Author SHA1 Message Date
Yufeng Li
ca2ed17ba7
Bump up version number to 1.2 (#3097) 2020-02-26 17:25:16 -08:00
Yufeng Li
f1ba531d9c
Disable test_zfnet512 and test_bvlc_reference_caffenet for x86 in C# tests (#3094) 2020-02-26 14:40:55 -08:00
Changming Sun
04063aa324 optimizer shouldn't depend on inference session 2020-02-26 13:42:14 -08:00
Hariharan Seshadri
bf7afbef23
Changes in the props file to support .NET + AnyCPU configuration (#3091) 2020-02-25 20:28:36 -08:00
Ori Levari
5e0f7412cd
Properly handle downlevel and WCOS scenarios (#3075) 2020-02-25 17:47:02 -08:00
Dmitri Smirnov
57384d5077
Add disclaimer to DataFrameTool header (#3085) 2020-02-25 10:11:45 -08:00
Changming Sun
d7500b26bd
Remove Publish Build Symbols from pre-checkin CI build (#3088) 2020-02-25 08:02:36 -08:00
stevenlix
f4a5d17294
Upgrade to CUDA10.2 for TensorRT (#3084)
* Switch to CUDA10.2

* Update win-gpu-tensorrt-ci-pipeline.yml

* Update win-gpu-tensorrt-ci-pipeline.yml

* remove dynamic_shape

* update onnx-tensorrt submodule

* check if input shape is specified for TensorRT subgraph input and enable some TensorRT unit tests

* fix format issue

* add shape inference instruction for TensorRT

* update according to the reviews

* Update win-gpu-tensorrt-ci-pipeline.yml
2020-02-25 05:36:01 -08:00
Hariharan Seshadri
d7f2cdcc7e
Fix target platform of managed OnnxRuntime dll and enable x86 .NET testing (#3056)
* WIP: Re-enable x86 .NET testing in Release pipelines

Enabling x86 testing will make sure that ORT packages doesn’t break x86 projects of customers

* Remove setting some env variables

* Comment out a test failing on x86 builds

* More changes

* Minor fix

* More changes

* More changes

* s

* s

* s

* Revert minor change

* More changes

* More changes

* More changes 2

* explicitly set platform target

* Delete bin and obj folders

* Clean output dirs

* Add back TargetFramwork

* Disable x86 .net framework tests

* Skip x86 tests in MKLML pipeline
2020-02-24 23:02:59 -08:00
Adam Pocock
b23b7f0fea
[java] Adds the provider compile-time flags where the JNI code expects them. (#3082) 2020-02-24 15:47:26 -08:00
Ryan Lai
a506911208
No need to create a copy of graph proto when checking to see if there is fp16 input (#3061)
* Don't create a copy of model proto when checking to see if there is fp16 input

* PRcomments about making functions const

* Loop through nodeargs in graph object to see if there are fp16 datatypes

* Rename check to checking only inputs
2020-02-24 15:14:29 -08:00
Dmitri Smirnov
dae9a31719
Introduce new Featurizers packaging pipeline. (#3068)
Introduce new Featruizers packaging pipeline.
2020-02-24 13:57:38 -08:00
Dmitri Smirnov
b8628404f3
Replace hardcoded include path value with the advertised setting. (#3083) 2020-02-24 13:55:00 -08:00
Faith Xu
fb7317173d
Doc updates for 1.2 release (#3069)
* Update version info

* Updates 

Add winML API, update GPU dependency section

* Updates Windows API section

* Minor update
2020-02-24 11:48:13 -08:00
Changming Sun
0d30b42c7b
Fix python gpu build (#3081)
The problem was introduced in #2999. After that, we can't publish our python wheels to pypi because the package size is too big.
2020-02-24 09:51:40 -08:00
kile0
f367fd921c
Use a custom allocator for temporary buffers in reduction_ops.cc (#2775)
* port the mimalloc allocator

* hook mimalloc opt into common.h and reduction ops

* repurpose USE_MIMALLOC to only denote subbing in of default allocator with mimalloc and some refactoring

* fix unintended cherry pick diffs

* polish alloctor_mimalloc

* explicitly disable mimalloc where it already had been disabled

* update mimalloc to pull in stl allocator

* switch mimalloc stl allocator to use mimalloc library version

* turn mimalloc on by default (only the stl changes are enabled, the python interacting ones are off already and shall remain so)

* move FastAllocVector into cpu specific code

* separate out defines into arena and stl changes

* the rest of the define renames

* bfc arena allocator

* some typos and rename the bfc arena allocator to fit existing class naming conventions

* adjustments in response to comments

* different template instantiations are friends
2020-02-23 16:04:30 +10:00
Changming Sun
61ae134469
Fix binary size report (#3080) 2020-02-22 21:01:06 -08:00
Changming Sun
3e371575ea
Relax testing tolerance of NchwcOptimizerTests::BatchNormalization (#3078) 2020-02-22 17:06:42 -08:00
Changming Sun
ae1f35fb9f
Ignore GCC no-deprecated-copy warnings (#3074) 2020-02-22 11:48:27 -08:00
Changming Sun
a0c9f679fc
Add logging to nchwc_optimizer_test.cc (#3073) 2020-02-22 08:15:00 -08:00
Changming Sun
e38df45587
Add missing include to dnnl_conv_batchnorm.h 2020-02-21 20:17:23 -08:00
Yufeng Li
64feee1b52
Logging in framework.cc should use the session logger (#3059) 2020-02-21 17:11:14 -08:00
Changming Sun
fb871978b5
Adjust build flags for the release pipelines (#3066)
1. Add LTCG back. It was set to default OFF in my previous PR to speed up Windows build. It is only needed in release pipelines.
2. Remove --use_featurizers from all the packaging pipelines
3. Make sure all the packages have openmp
2020-02-21 16:45:42 -08:00
Adam Pocock
a7541f9753
[Java] Fix for incorrect input and output lengths in run call (#3064) 2020-02-21 13:13:02 -08:00
Scott McKay
21cc2d88b4
Move some vectors out of loops to minimize memory allocations/reallocations (#3062)
* Move some vectors out of loops to minimize memory allocations/reallocations.
Remove some unused/unnecessary vectors.
2020-02-22 06:35:13 +10:00
Changming Sun
7ffb36be44
Fix softmax cpu code for double type (#3065) 2020-02-21 12:06:13 -08:00
Changming Sun
179603775f
Use CUDA 10.1 for Linux build (#3057)
Use CUDA 10.1 for Linux build
(Windows change is already in)

Please note, cublas 10.2.1.243 is for CUDA SDK 10.1.243, not CUDA 10.2.x. CUDA 10.2.89 need cublas 10.2.2.89. They match on the last part of the digits.

libcublas10-10.1.0.105 won't work!!!

The cuda docker image by viswamy is already using 10.1, no need to change.
2020-02-21 11:55:32 -08:00
Ori Levari
be12fb3143
include winml x86 binaries in the drop-signed-nuget artifact (#3058) 2020-02-21 11:17:23 -08:00
Prabhat
752d74928c
Improve the efficiency of SVMRegressor code (#3054) 2020-02-21 13:14:11 +00:00
pengwa
92b8a7a2be
GPT2 Gelu Fusion & Test (#3009)
* GPT2 Gelu Fusion & Test

* change header path

* Refine code & add missing test onnx file

* Fix builds & refine float/double/fp16 compare.

* Fix builds

* Add Bias Check and UTs

* Fix build and uts

* Fuse with second formula & test

* minor change

* disable FastGelu to see whether the builds can pass

* Verify where is wrong

* disable for debugging

* Revert "disable for debugging"

This reverts commit 535c0817fb36fb95a75773a7f00c8b969dd5362c.

* Revert "Verify where is wrong"

This reverts commit ffc43ec1d136636ba2cee30df49f563a75e84676.

* disable the transformer for inference currently

* Enable FastGeluFusion and fix segement fault when run bertsquad10.onnx test

* Add more Unit tests convering Gelu subgraph use graph input/output

(cherry picked from commit 0739ab985240c6d9acdb8f0afd40c5fb316166af)

* Mode Bias Fusion in BiasGelu.cc

Co-authored-by: Changming Sun <chasun@microsoft.com>
2020-02-21 18:25:43 +08:00
Scott McKay
932ecaea34
Some documentation updates. (#3060) 2020-02-21 20:07:39 +10:00
William Tambellini
c033cfee92
Check python version (#356)
Check python version in build.py
2020-02-20 23:26:25 -08:00
Changming Sun
45ba325fa6
Remove USE_NSYNC macro (#3052) 2020-02-20 13:29:19 -08:00
Scott McKay
a1db87b382
Add SafeInt bounds checking to memory allocation size calculations. (#3022)
* Add SafeInt bounds checking to memory allocation size calculations.

* Fix TensorRT library includes
2020-02-20 11:41:03 -08:00
Pranav Sharma
21f9a8bdc2
Allow creation of string tensor sequence (#3048) 2020-02-20 11:27:42 -08:00
Changming Sun
cb24e2a214 Update nsync 2020-02-20 11:25:34 -08:00
Changming Sun
85c0989e6f Fix build errors in benchmark 2020-02-20 11:22:28 -08:00
Changming Sun
5306a1241b Server: Add build options for the other EPs 2020-02-20 11:14:24 -08:00
smk2007
44d5eaf3d7
WinML exists in the nuget packages but does not publish its WinMD and headers (#3037)
* publish winmd and raw headers

* Add the lib too

* add missing conditions

* Fix copy/paste condition error
2020-02-20 10:24:29 -08:00
Scott McKay
bcbc5ee815 Improve error message for type mismatch between data in initializer and graph usage of the initializer. 2020-02-19 23:52:20 -08:00
James Yuzawa
dde4df148b Add and run spotless formatter for Java and Gradle 2020-02-19 23:51:09 -08:00
Changming Sun
ef2bba316b
CUDA 10.1 for Windows(#3049) 2020-02-19 23:26:47 -08:00
Xiang Zhang
18aa0940c2
checking condition is wrong (#3025)
* checking condition is wrong

* remove unnecessary check
2020-02-19 13:29:42 -08:00
Changming Sun
e3c27536d0
Python binding doesn't need to link to the python lib on Linux 2020-02-19 12:18:47 -08:00
Dmitri Smirnov
daf8c4bee4
Remove faturizers from CPU MLDNN and NoContribOps builds. (#3039)
The first one is temp. The second one is permanent removal.
2020-02-19 06:23:36 -08:00
James Yuzawa
411b3aa801
Java build system enhancements (#2866) 2020-02-18 15:41:49 -08:00
Tracy Sharpe
ecdcd682bb
NCHWc ReorderOutput->Transpose(NHWC) fusion (#3035)
Add support to fuse ReorderOutput+Transpose(NHWC). Converting from NCHWc to NHWC tensors is a trivial copy of data and avoids the cost of a transpose node.
2020-02-18 10:23:48 -08:00
Changming Sun
71ca43b345
Revert an op version change (#3026)
Revert an op version change, it was brought in from #2999
2020-02-18 09:43:18 -08:00
Tracy Sharpe
cc8adc87c3
fix NCHWc intermediates as graph outputs (#3033)
This fixes a customer reported issue where the NCHWc optimizer was dropping graph outputs when an edge was used as both a graph output and an input to another NCHWc node.
2020-02-17 14:52:33 -08:00
daquexian
4ca50d9352 Update DNNLibrary to v0.9.0 and update NNAPI GetSupportedNodes 2020-02-17 13:24:10 -08:00