Commit graph

1934 commits

Author SHA1 Message Date
Dmitri Smirnov
e2894c5ffb
Fix package name overrides (#3150)
Add env var with the package name.
2020-03-05 17:10:55 -08:00
Yufeng Li
1d2b8115e2
Support u8u8 in quantization tool (#3140) 2020-03-05 14:42:46 -08:00
KeDengMS
ade4fa108f
Disable delayload for cuda dlls (#3147)
This change fixes #3129. When running onnxruntime as dll on Windows, CUDA does some internal cleanups when process exits. After this, any call to CUDA would cause crash. Delayload makes thread_local destructor to happen after CUDA cleanup, thus the crash.
2020-03-05 14:40:22 -08:00
Dmitri Smirnov
2c446a7f2f
Add push to ORT-NIGHTLY. (#3146) 2020-03-05 11:38:22 -08:00
Yufeng Li
fbb658e603
Implement QuantizeLinear and DequantizeLinear (#3098)
* Implement QuantizeLinear and DequantizeLinear
2020-03-04 13:30:20 -08:00
take-cheeze
83753bcbe3 Suppress maybe uninitialized warning in gcc-9 2020-03-04 11:52:40 -08:00
Dmitri Smirnov
ef8768a53f
Override native package name. Preserve managed package name the same. (#3133)
Override native package name. Preserve managed package name the same.
  Specify pckage name for validation purposes.
 Fix up validation package name parameter.
2020-03-04 10:12:55 -08:00
Prabhat
a2eeb126b9
Optimised kernel_dot() in SVM op (#3135) 2020-03-04 16:30:40 +00:00
Tianlei Wu
9d874c1225
Add bert performance and correctness test tools (#3108)
(1) Add performance test tool for bert model.
(2) Add accuracy test tool to compare inference results of original and optimized bert models.
(3) Add test data generator tool to create test data for onnxruntime_perf_test.exe
(4) Improve bert optimization script: Verify model producer for model_type; Add warning if model is not fully optimized.
(5) Add shape optimizer tool to assist developing optimization script.
(6) Update readme.
2020-03-03 23:18:08 -08:00
Yufeng Li
84ad4eda8b
Implement MatmulInteger on GPU (#3070)
* Implement MatmulInteger
2020-03-03 16:36:33 -08:00
Changming Sun
12605f05d1
Fix CUDA PATH (#3131)
Previously, we put the "bin" folder of all the CUDA verions in the system PATH. And 10.2 is in the front. It's a mess.
So I've removed all of them from the system PATH env. But I need to add one of them back through build scripts.

(The problem only affect the C# test, not the C/C++ tests that forked from build.py).
2020-03-03 14:34:19 -08:00
smk2007
6cdd2b4934
Enable DML Nuget Package for x64 or x86 architectures (#3120)
* add dml gpu pipelines

* add x86 to the gpu dml dev build pipeline

* Enable DML x86 builds

* Fix uint64_t -> size_t warning

* fix warnings

* enable dml on x86 ci builds

* operatorHelper 773 error uint32_t vs uint64_t

* operatorHelper 773 error uint32_t vs uint64_t

* make x86 pipeline use the gpu pool

* more warnings

* fix x86 directml path

* make dml nuget package

* disable tf_pnasnet_large

* disable zfnet512

* make validation use wildcards

* disable x86 dml gpu tests

* add args.

* update gpu.yml

* change nupkg wildcard

* add debug statements

* package x86 dml nupkg

* dont drop managed nuget again from dml pipeline build

* Add DML EULA

* directml license should be renamed to not clobber the existing license

* casing on dml package....

* {} to ()

* fix license name

* disable dml from x86 ci

* typo and cr feedback

* remove featurizers

* ship the dml pdb as well
2020-03-02 20:18:46 -08:00
Dmitri Smirnov
e45326b5df
Create NuGet packaging pipeline for ORT Featurizers (#3125)
Create a new pipeline to publish ORT with Featurizers
  Update pipeline for two separate packages.
  Change package names.
2020-03-02 17:00:56 -08:00
Tracy Sharpe
b538cb7e46
NCHWc Upsample/Mul optimizations (#3116)
Extend the NCHWc layout optimizer to handle Resize(mode=nearest) and Mul.
2020-03-02 14:40:49 -08:00
Hariharan Seshadri
4188b1111a
Add a summary for each ExecutionProviderAppend methods in SessionOptions.cs (#3111)
* Add a summary for each ExecutionProviderAppend methods in SessionOptions.cs

OnnxRuntime managed dll is EP agnostic meaning it will expose all methods pertaining to all possible EPs supported by OnnxRuntime in general. Not all these methods are really "available" to use for a .NET developer unless they have the correpsonding native onnxruntime shared library. Adding a summary line so that intellisense points that out.

* remove empty line
2020-02-28 21:46:57 -08:00
edgchen1
37f5fd8fb8
Add support for loading TensorProtos with external data from optimizer Initializer (#3045)
- Added support for loading TensorProtos with external data from the optimizer Initializer class.
- Added some file path utilities.
2020-02-28 13:19:16 -08:00
Changming Sun
c6ed077441
Add d2FH4- flag to cuda (#3105) 2020-02-27 20:22:07 -08:00
Hariharan Seshadri
86b755774f
Create a separate Nuget hosting just managed assemblies (#3020)
* Initial commit

* More changes

* More changes

* More changes 3

* More changes 4

* More changes 5

* More changes 5

* More changes 6

* More changes 7

* More changes 8

* Remove C# ifdefs

* More changes 10

* More changes 11

* YAML changes for other release pipelines

* Add release notes metadata

* Props and Targets change

* Add CSHarp proj

* More changes 12

* More changes

* Minor fix

* Minor fix

* Fix yaml

* Some missing logic for winml

* Minor update

* Fix casing for winmd file

* Fix casing

* Add targets and props for managed section into native nuget

* revert file

* a
2020-02-27 18:00:17 -08:00
Dmitri Smirnov
5008fc5b00
Featurizers: Import fix for Linux build adjust linkage (#3089)
Advance FeaturizersLibrary
  SetAbsError on Output
2020-02-27 15:49:18 -08:00
Changming Sun
d72639ef77
Fix CUDA 10.1 DLL names (#3102) 2020-02-27 14:43:16 -08:00
daquexian
37a905f557
Make Java API available on Android (#3030) 2020-02-27 08:23:50 -08:00
Yufeng Li
ca2ed17ba7
Bump up version number to 1.2 (#3097) 2020-02-26 17:25:16 -08:00
Yufeng Li
f1ba531d9c
Disable test_zfnet512 and test_bvlc_reference_caffenet for x86 in C# tests (#3094) 2020-02-26 14:40:55 -08:00
Changming Sun
04063aa324 optimizer shouldn't depend on inference session 2020-02-26 13:42:14 -08:00
Hariharan Seshadri
bf7afbef23
Changes in the props file to support .NET + AnyCPU configuration (#3091) 2020-02-25 20:28:36 -08:00
Ori Levari
5e0f7412cd
Properly handle downlevel and WCOS scenarios (#3075) 2020-02-25 17:47:02 -08:00
Dmitri Smirnov
57384d5077
Add disclaimer to DataFrameTool header (#3085) 2020-02-25 10:11:45 -08:00
Changming Sun
d7500b26bd
Remove Publish Build Symbols from pre-checkin CI build (#3088) 2020-02-25 08:02:36 -08:00
stevenlix
f4a5d17294
Upgrade to CUDA10.2 for TensorRT (#3084)
* Switch to CUDA10.2

* Update win-gpu-tensorrt-ci-pipeline.yml

* Update win-gpu-tensorrt-ci-pipeline.yml

* remove dynamic_shape

* update onnx-tensorrt submodule

* check if input shape is specified for TensorRT subgraph input and enable some TensorRT unit tests

* fix format issue

* add shape inference instruction for TensorRT

* update according to the reviews

* Update win-gpu-tensorrt-ci-pipeline.yml
2020-02-25 05:36:01 -08:00
Hariharan Seshadri
d7f2cdcc7e
Fix target platform of managed OnnxRuntime dll and enable x86 .NET testing (#3056)
* WIP: Re-enable x86 .NET testing in Release pipelines

Enabling x86 testing will make sure that ORT packages doesn’t break x86 projects of customers

* Remove setting some env variables

* Comment out a test failing on x86 builds

* More changes

* Minor fix

* More changes

* More changes

* s

* s

* s

* Revert minor change

* More changes

* More changes

* More changes 2

* explicitly set platform target

* Delete bin and obj folders

* Clean output dirs

* Add back TargetFramwork

* Disable x86 .net framework tests

* Skip x86 tests in MKLML pipeline
2020-02-24 23:02:59 -08:00
Adam Pocock
b23b7f0fea
[java] Adds the provider compile-time flags where the JNI code expects them. (#3082) 2020-02-24 15:47:26 -08:00
Ryan Lai
a506911208
No need to create a copy of graph proto when checking to see if there is fp16 input (#3061)
* Don't create a copy of model proto when checking to see if there is fp16 input

* PRcomments about making functions const

* Loop through nodeargs in graph object to see if there are fp16 datatypes

* Rename check to checking only inputs
2020-02-24 15:14:29 -08:00
Dmitri Smirnov
dae9a31719
Introduce new Featurizers packaging pipeline. (#3068)
Introduce new Featruizers packaging pipeline.
2020-02-24 13:57:38 -08:00
Dmitri Smirnov
b8628404f3
Replace hardcoded include path value with the advertised setting. (#3083) 2020-02-24 13:55:00 -08:00
Faith Xu
fb7317173d
Doc updates for 1.2 release (#3069)
* Update version info

* Updates 

Add winML API, update GPU dependency section

* Updates Windows API section

* Minor update
2020-02-24 11:48:13 -08:00
Changming Sun
0d30b42c7b
Fix python gpu build (#3081)
The problem was introduced in #2999. After that, we can't publish our python wheels to pypi because the package size is too big.
2020-02-24 09:51:40 -08:00
kile0
f367fd921c
Use a custom allocator for temporary buffers in reduction_ops.cc (#2775)
* port the mimalloc allocator

* hook mimalloc opt into common.h and reduction ops

* repurpose USE_MIMALLOC to only denote subbing in of default allocator with mimalloc and some refactoring

* fix unintended cherry pick diffs

* polish alloctor_mimalloc

* explicitly disable mimalloc where it already had been disabled

* update mimalloc to pull in stl allocator

* switch mimalloc stl allocator to use mimalloc library version

* turn mimalloc on by default (only the stl changes are enabled, the python interacting ones are off already and shall remain so)

* move FastAllocVector into cpu specific code

* separate out defines into arena and stl changes

* the rest of the define renames

* bfc arena allocator

* some typos and rename the bfc arena allocator to fit existing class naming conventions

* adjustments in response to comments

* different template instantiations are friends
2020-02-23 16:04:30 +10:00
Changming Sun
61ae134469
Fix binary size report (#3080) 2020-02-22 21:01:06 -08:00
Changming Sun
3e371575ea
Relax testing tolerance of NchwcOptimizerTests::BatchNormalization (#3078) 2020-02-22 17:06:42 -08:00
Changming Sun
ae1f35fb9f
Ignore GCC no-deprecated-copy warnings (#3074) 2020-02-22 11:48:27 -08:00
Changming Sun
a0c9f679fc
Add logging to nchwc_optimizer_test.cc (#3073) 2020-02-22 08:15:00 -08:00
Changming Sun
e38df45587
Add missing include to dnnl_conv_batchnorm.h 2020-02-21 20:17:23 -08:00
Yufeng Li
64feee1b52
Logging in framework.cc should use the session logger (#3059) 2020-02-21 17:11:14 -08:00
Changming Sun
fb871978b5
Adjust build flags for the release pipelines (#3066)
1. Add LTCG back. It was set to default OFF in my previous PR to speed up Windows build. It is only needed in release pipelines.
2. Remove --use_featurizers from all the packaging pipelines
3. Make sure all the packages have openmp
2020-02-21 16:45:42 -08:00
Adam Pocock
a7541f9753
[Java] Fix for incorrect input and output lengths in run call (#3064) 2020-02-21 13:13:02 -08:00
Scott McKay
21cc2d88b4
Move some vectors out of loops to minimize memory allocations/reallocations (#3062)
* Move some vectors out of loops to minimize memory allocations/reallocations.
Remove some unused/unnecessary vectors.
2020-02-22 06:35:13 +10:00
Changming Sun
7ffb36be44
Fix softmax cpu code for double type (#3065) 2020-02-21 12:06:13 -08:00
Changming Sun
179603775f
Use CUDA 10.1 for Linux build (#3057)
Use CUDA 10.1 for Linux build
(Windows change is already in)

Please note, cublas 10.2.1.243 is for CUDA SDK 10.1.243, not CUDA 10.2.x. CUDA 10.2.89 need cublas 10.2.2.89. They match on the last part of the digits.

libcublas10-10.1.0.105 won't work!!!

The cuda docker image by viswamy is already using 10.1, no need to change.
2020-02-21 11:55:32 -08:00
Ori Levari
be12fb3143
include winml x86 binaries in the drop-signed-nuget artifact (#3058) 2020-02-21 11:17:23 -08:00
Prabhat
752d74928c
Improve the efficiency of SVMRegressor code (#3054) 2020-02-21 13:14:11 +00:00