Commit graph

1828 commits

Author SHA1 Message Date
Maxim Kalinin
d5efbcb8d8
MLAS: Apply 'small-M' optimization for column-vectors (#2971)
Apply 'small-M' optimization for column-vectors in MlasSgemmOperation
2020-02-04 14:51:20 -08:00
Changming Sun
7ff5c0e5a3
CMake changes (#2961)
1. Add support for vstest. 
2. Add support for vcpkg. To use it:
  ```bat
   vcpkg install zlib:x64-windows benchmark:x64-windows gtest:x64-windows protobuf:x64-windows pybind11:x64-windows re2:x64-windows
   mkdir build
   cmake ..\cmake -DCMAKE_BUILD_TYPE=Debug -A x64 -T host=x64 -DCMAKE_TOOLCHAIN_FILE=C:\vcpkg\scripts\buildsystems\vcpkg.cmake -DVCPKG_TARGET_TRIPLET=x64-windows -Donnxruntime_PREFER_SYSTEM_LIB=ON
  ```
3. New cmake option: onnxruntime_PREFER_SYSTEM_LIB, which allows user using the preinstall libs instead of the things in onnxruntime submodule.
4. New cmake option: onnxruntime_ENABLE_MEMLEAK_CHECKER, which allows user turn on/off the memory leak checker by @RyanUnderhill in Windows Debug Build. The checker doesn't work with vstest.
4. Fix the post merge pipeline(Mainly for test coverage report).
5. Ignore the compile warning from the Featurizer library code
6. Apply "/utf-8" VC compile flag to our code. Without this, you can't build onnxruntime on Chinese Windows.
7. Remove the SingleUnitTestProject cmake option because it's deprecated more than one year and nobody is using it.
8. Move opaque api tests to onnxruntime_test_all
9. Enable "/W4" on CUDA ep's C++ code(Not the *.cu files), and fix some warnings, add some extra checks.
10. Delete the onnxruntime::test::TestEnvironment class.
11. Add a DLLmain for onnxruntime.dll. 
12. Allow dynamic link to libprotobuf
2020-02-03 19:33:14 -08:00
Yufeng Li
1d79926d27
Add int64_t support for split (#2944)
* Add int64_t support for split
2020-01-31 15:25:42 +10:00
edgchen1
79f1756d8e Removed build.py unused test_data_url and test_data_checksum options. 2020-01-30 16:14:14 -08:00
Changming Sun
1e4080061b
Added support for double in batch norm (#2941) 2020-01-30 13:54:38 -08:00
Scott McKay
51595d6a4a Add Int64 support to Neg 2020-01-30 10:32:43 -08:00
Adam Pocock
c23bd93ea5 The closed variable in OrtEnvironment should be volatile to ensure threads see changes to it. 2020-01-29 23:39:37 -08:00
Adam Pocock
4cc0031177 Java - Fixed a reference counting bug in the OrtEnvironment close method. Added a unit test for the bug. 2020-01-29 23:39:37 -08:00
Adam Pocock
7bb5c357a8 Adding some unicode to the String inference test to make sure it doesn't fall prey to #2690. 2020-01-29 23:39:37 -08:00
Yufeng Li
f33759883c
keep output of bert model as float32 when converting to float16 (#2913) 2020-01-29 18:21:43 -08:00
Mina Asham
a319ad29b3 Do not link onnxruntime jni output with JNI_LIBRARIES
- Linking onnxruntime with JNI_LIBRARIES includes some unnecessary links to native libraries (e.g. libawt) which are not actually used or required by the output onnx library. This causes unsatisfied link exceptions when trying to load the onnx library without including these libraries.
2020-01-29 17:07:07 -08:00
Adam Pocock
8a1de1a582 Java - requesting JNI_VERSION_1_6 rather than JNI_VERSION_1_8 to support Android. 2020-01-29 16:09:40 -08:00
Ryan Hill
f60badc1f3
Documentation for API Breaking Changes (#2927)
* Documentation for API Breaking Changes
* Add version 2 of the API, plus update documentation
* Add a static assert to ensure version 1 of API we shipped does not change in size.
2020-01-29 15:55:29 -08:00
Hariharan Seshadri
1239de3efe
Fix output shape computation in ConvTranspose op (#2688)
* Fix output shape computation in ConvTranspose op

* Update test
2020-01-29 15:42:43 -08:00
niklas-mohr-jdas
e03e6676c0
Add support for int64 to int64 LabelEncoder (#2926) 2020-01-29 20:00:38 +10:00
jignparm
645d8fb213
Jignparm/upgrade macos vm image (#2928)
* update MacOS image to 10.14

* Update to macos 10.14
2020-01-28 20:24:45 -08:00
daquexian
653ce36ccf Disable stacktrace on android 2020-01-28 11:06:46 -08:00
James Yuzawa
49ce4891bc Add noexecstack linker flag 2020-01-28 10:28:56 -08:00
Changming Sun
0279682147 Add document for onnxruntime server. 2020-01-27 13:39:41 -08:00
Maher Jendoubi
546d8f71ab Contributing: fix typos (#2905) 2020-01-27 13:39:08 -08:00
Saquib Nadeem Hashmi
daff4240f0 Updated README.md (#2910)
Corrected spelling mistake.
2020-01-27 13:37:22 -08:00
Yufeng Li
cd876720d9
Only fuse when output count of add is 1 (#2884)
* Only fuse when output count of add is 1

* add unit test for add with multi output
2020-01-24 13:47:34 -08:00
Scott McKay
a92e924ab2
Revert "Use IArenaAllocator::Reserve for initializers and mem pattern planner blocks (#2835)" (#2904)
This reverts commit 724ff0753b.
2020-01-24 14:02:30 +10:00
Changming Sun
e0c9cdaa73
Fix the nuget pipelines (#2901) 2020-01-23 20:02:18 -08:00
Tracy Sharpe
17b72d5578
Fix NCHWc BatchNormalization regression (#2903)
Fix the BatchNormalization optimization in the NCHWC optimizer. If the node has the optional training outputs specified, then skip the transform.
2020-01-23 18:54:11 -08:00
Jeff
ba336b5583
Disable DML EP on software adapter, fix float16 fallback bug, re-enable DML in CI (#2896)
* Re-enable DML in CI pipeline

* Fix bug with float16 fallback + fusion, and disallow DML EP with software adapter

* Address PR comments
2020-01-23 15:18:28 -08:00
Changming Sun
201b089a36
Fix some warnings on Windows (#2560)
1. Enable warning "4503" # Decorated name length exceeded.
2. Enable warning "4146" # unary minus operator applied to unsigned type.
3. Enable float64 support for the Softmax operator
4. Enable compliance checks for Windows x86 32bits build
5. Use TryBatchParallelFor to replace some fallback code in mlas pooling.cc
6. Fix Android CI pipeline.
2020-01-22 15:59:11 -08:00
Pranav Sharma
49725f896c
Disable openmp for the nocontribops pipeline. (#2888) 2020-01-22 12:07:44 -08:00
Scott McKay
fc51473b09
Update BFCArena logic to use backoff if cudaMalloc fails. Makes behaviour equivalent to when a CPU allocation fails. Add unit test. (#2748)
Clear error when throwing an exception for a failed CUDA call so that there is only one error mechanism being used at a time.
Minor improvements to logging to aid debugging of BFCArena behaviour.
2020-01-22 14:21:21 +10:00
edgchen1
061f10fcd5 Fixed typo in ORT_RETURN_IF_NOT() message. (#2862) 2020-01-21 20:03:41 -08:00
Scott McKay
9f5e8c4ae8
InferenceSession::Run needs to call OnRunEnd for any EP that OnRunStart was called for so they can cleanup. Currently it only calls OnRunEnd if the Status is OK. Due to this the CUDA EP will throw during shutdown as the per-thread information has not been cleaned up prior to the CUDA library shutting down. (#2881)
Also update onnxruntime_perf_test to catch the exception from the call to Run and return a Status. Otherwise it exits with an 'unknown exception' error.
2020-01-22 12:17:52 +10:00
RandySheriffH
38b34babe0
Rashuai/boost cuda TopK performance (#2826)
* Implement Bitonic and Radix TopK

* remove needless print out

* fix com err

* add negative support

* fix comments

Co-authored-by: Randy <45701928+RandyShuai@users.noreply.github.com>
2020-01-21 13:40:38 -08:00
Tracy Sharpe
08113b80cc
Optimize BatchNormalization to NCHWc Conv (#2855)
Update the NCHWc transformer to convert BatchNormalization ops to NCHWc convolutions when the input tensor is already in NCHWc.
2020-01-20 16:35:03 -08:00
Ashwini Khade
807a59c55d
Add calibration tool (#2845)
* add calibration tool

* add model for e2e example

* format readme

* some more formatting updates

* plus a few more updates

* plus review comments

* plus updates

* more updates
2020-01-20 14:49:35 -08:00
Xavier Dupré
22d9f3998e
Fix positive raw scores for TreeEnsembleClassifier (#2824)
Fix positive raw scores for TreeEnsembleClassifier
2020-01-20 16:48:37 +01:00
Hariharan Seshadri
b21576eeb0 Support non-sequence tensor fed through as a python list (#2782)
* Support list feeds in Python
2020-01-20 09:45:10 +10:00
KeDengMS
f9f25ec047
Fix spurious component detection warning (#2857)
Fix spurious component detection warning
Use component detection template for all pipelines
2020-01-18 20:10:35 -08:00
Yufeng Li
25d7ad187f
Add float16 support back in the bert fusion script (#2870)
* Add float16 support back in the bert fusion script

* update readme
2020-01-17 20:00:39 -08:00
Yufeng Li
95f3eb6aeb
Bert fusion script for Tensorflow squad (#2858) 2020-01-17 15:27:04 -08:00
Tracy Sharpe
01f3a33c38
update protoc path to match protobuf version (#2865) 2020-01-17 14:48:39 -08:00
Changming Sun
e6f7658ade Update Windows GPU build to use cudnn 7.6 2020-01-17 12:23:13 -08:00
Pranav Sharma
3853ddf9c7
Fix topk type handling to accommodate more types. (#2842)
* Fix topk type handling to accommodate more types + add unit test for int64_t.

* Fix Linux build
2020-01-17 11:57:29 -08:00
Changming Sun
47e27ec9a1
Disable DML in Windows GPU CI build (#2856)
Disable DML in Windows GPU CI build for now, because there are some wired model test failure and I don't know how to fix it. Will seek help from WinML team.
2020-01-16 18:47:30 -08:00
Scott McKay
724ff0753b
Use IArenaAllocator::Reserve for initializers and mem pattern planner blocks (#2835)
* Use IArenaAllocator::Reserve for initializers and mem pattern planner blocks.
2020-01-17 07:41:48 +10:00
Tracy Sharpe
928b6bb210
MLAS: enable threading for quantized GEMMs (#2844) 2020-01-15 19:25:40 -08:00
Tianlei Wu
5db8543018
update optimization doc for BERT related fusions (#2819)
* Add bert related transformers to doc
* Add execution provider and comment for bert optimizations
* Add comment about accuracy impact of approximation
2020-01-15 16:01:11 -08:00
Changming Sun
56030f8d74 Fix Linux CUDA nuget packaging pipeline break 2020-01-14 21:13:41 -08:00
Tiago Koji Castro Shibata
cff266e1b9 Fix cgmanifest.json generating script (#2770)
* Fix protobuf submodule name

* Workaround pygit2 bug
2020-01-14 14:59:07 -08:00
Ori Levari
db05436fc0 User/orilevari/32bit comparison warning (#2800)
* use correct type for for loop

* explicitly specify void for parameters of OrtGetApiBase because the function is defined in c, so when the function is just (), it is interpreted as having an unknown number of parameters. This was causing compiler warning C4276.
2020-01-14 14:59:07 -08:00
Ashwini Khade
8643f3ebbb
add domain check for nodes + update documentation (#2831) 2020-01-14 11:15:50 -08:00