Commit graph

896 commits

Author SHA1 Message Date
Hariharan Seshadri
fc9a895b46
Add shape inference logic for Crop (contrib) op (#1157)
* Add shape inference logic for Crop contrib op

* Fix build break

* More refinements

* PR feedback

* PR feedback 2
2019-06-13 12:45:09 -07:00
Ryan Hill
6c17567d7b
Add C++ headers to nuget package (#1218) 2019-06-13 11:38:19 -07:00
Scott McKay
065e9dc1ba
Block size mismatches are expected if sequence length varies or there are NonZero ops. Reduce log severity of message due to that. (#1211) 2019-06-13 19:13:31 +10:00
Scott McKay
fa2eea7339
Use cmath instead of math.h for fabs (#1217)
* Use cmath instead of math.h so qualify some fabs calls with std:: due to change.

* Remove accidental text
2019-06-13 10:58:49 +10:00
Scott McKay
c1a34a8ba6
Add ability to dump node input/output (#1202)
Address #1155

Add debug helper methods to be able to dump input name and shape information for node inputs, and the data from node outputs.

As the input data comes from graph inputs, initializers or node outputs we don't dump it.

Must be manually enabled by building with '--cmake_extra_defines onnxruntime_DEBUG_NODE_INPUTS_OUTPUTS=ON'
2019-06-13 06:47:50 +10:00
Hector Li
bbdd1d658b
use cudnnRNNForwardInferenceEx for unpacked (padded) layout case (#899)
1. Use cudnnRNNForwardInferenceEx for unpacked (padded) layout case, place the sequence_lens data on CPU
2. Fix hard code device ID issue. In cuda kernel, it should get the device id from provider.
2019-06-12 09:51:15 -07:00
Ryan Hill
38963d81eb
Simplify some CustomOp code (#1206)
test_inference.cc reformatted with clang-format
2019-06-11 17:00:04 -07:00
Scott McKay
87d65389e6
Add ability to change the logging severity of the default logger. (#1165)
Add ability to set the session and run logger severity via SessionOptions and RunOptions
Inherit severity from the next logger up if logger severity isn't specified in SessionOptions or RunOptions
Expose ability to set default logger severity in python bindings.
2019-06-12 08:54:03 +10:00
Changming Sun
15bcde5053
Fix a build break in tf_test_session.h (#1205)
This file need tensorflow C API package to build, so it is not part of the CI.
2019-06-11 14:05:24 -07:00
Changming Sun
6d5ea08936
Add nsync lib to onnxruntime_mlas_test's deps (#1199)
* Add nsync lib to onnxruntime_mlas_test's deps
2019-06-11 13:01:16 -07:00
edgchen1
0b9b429fe1
CPU GRU and LSTM Ops: Address corner case where output is uninitialized (#1193)
* Updated CPU GRU to zero Y output between max specified sequence length and max sequence length implied by input shape.

* Updated CPU LSTM to zero Y output between max specified sequence length and max sequence length implied by input shape.

* Disabled LSTMTest.ONNXRuntime_TestLSTMSequenceLengthShorterThanInputSequenceLength for nGraph execution provider and added TODO to investigate failure.
2019-06-11 09:58:34 -07:00
Sreekanth Yalachigere
24d6b0f5c4 MKL-DNN Subgraphs (#1116)
* subgraph with memcpy fix

* Linux compile errors fix

* Linux compile errors fix

* subgraph with memcpy fix

* Linux compile errors fix

* Linux compile errors fix

* memcpy (PR1020) fix implemented

* check graph viewer GetNode for nullptr at other plances

* documents

* Review changes (UseSubgraph simplified)

* static_cast<int> removed

* static_cast<int> removed 2

* fall back to CPU implementation in GetCapability()

* check shape for null. fall back to CPU implementation in GetCapability()

* backend data errors fixed

* PR review changes

* disable Opset10 tests

* removed tests from main.cc of test runner. added a check at GetCapability()

* backend data and Model-Zoo related fixes

* subgraph with memcpy fix

* Linux compile errors fix

* Linux compile errors fix

* subgraph with memcpy fix

* Linux compile errors fix

* memcpy (PR1020) fix implemented

* documents

* Review changes (UseSubgraph simplified)

* static_cast<int> removed

* fall back to CPU implementation in GetCapability()

* check shape for null. fall back to CPU implementation in GetCapability()

* backend data errors fixed

* PR review changes

* disable Opset10 tests

* removed tests from main.cc of test runner. added a check at GetCapability()

* backend data and Model-Zoo related fixes

* patch to run tests and models separatly
2019-06-10 20:18:56 -07:00
Ryan Hill
3c3186c761
Convert more C APIs to return OrtStatus (#1194)
* Change SessionOptions APIs to always return a status, for consistency and ease of use (a couple returned 0 or -1 for success/failure)
2019-06-10 18:36:04 -07:00
RandySheriffH
011957995e
add immutable exclude list (#1181)
* add immutable exclude list

* trigger callback

* switch to set for failed cases

* tweak conditions

* restore space

* cancel judgement

* fix comment

* restore onnx install sequence

* better comment and set const string

* set set const

* revert change on install_onnx.sh
2019-06-10 17:32:42 -07:00
Dmitri Smirnov
a92998c235
Uncomment ConstantOfShape tests. (#1059)
Advance ONNX submodule to 5c51f0dbbe88ee1536f17ee7bd462b2ab3772c52
  This commit in ONNX contains a fix to ConstantOfShape test data.
  Uncomment ConstantOfShape.
  Update test script, make sure exclusions are uniform.
2019-06-10 14:36:36 -07:00
ybrnathan
f72c1501c5
Optimize concat PrepareForCompute perf by reserve vector size. (#1198) 2019-06-10 12:54:17 -07:00
shahasad
97dfd5ee21
Add code coverage (#1192)
* added the runcoverage powershell script

* updated the run coverage script. added installation to the windows CI for trying

* exclude other parts of win ci

* fix in the download script

* fix in the download script

* fix in the download script

* fix in the download script

* fix in the download script

* fix in the download script

* fix in the download script

* fix in the download script

* fix in the download script

* added the runtestcoverage script to the pipeline

* some typo fix

* formatting

* re-commenting previously commented block

* cleaned up the powershell script

* fix path in pipeline

* fix path in pipeline

* fixed model path

* some fixes

* excluded long running tests

* add the publish job

* uncomment other tasks

* fixed excluded tests

* some format correction

* stopped running the test debug

* try placing the tes-all at the beginning

* try running the failing test only

* edit run_coverage

* some fix

* skip onnx_model_test

* Added memory size log in powershell script

* try running the onnxruntime_test_all.exe separately from codecov

* enable error reporting, and double memory size in powershell

* corrected the set-item

* remove memory resize, since we are already at max 2 GB

* fixed the tvm.dll issue

* added back the onnx tests in codecov. added back the regular test run

* cleanup

* remove * from the the module path

* add junction target resolution for modules dir

* remove junction-resolution

* reduced tests

* added target extraction for the junction paths in build machine

* added the appropriate change in win ci pipeline to call the updated ps script

* fix typo

* added back all the tests that were disabled

* try fixing the source root

* cleanup and enable all tests

* increase timeout for windows CPU CI due to codecoverage

* templatized the code coverage steps. Conitnue on error with any codecoverage step

* change quote marks
2019-06-09 22:30:41 -07:00
Changming Sun
280ab9a2d0
Let mlas use the session threadpool for gemm functions (#1196) 2019-06-09 19:57:04 -07:00
Changming Sun
be36385a8c
Delete docker/scripts/install_deps_x86.sh and enable onnx tests for x86 (#1191) 2019-06-08 16:17:18 -07:00
Raymond Yang
6b586bc041 Avoid warning status in python release pipeline (#1195) 2019-06-08 00:22:32 -07:00
gapar2
71466fc805 Fixes two broken links. (#1081) 2019-06-07 20:20:02 -07:00
Changming Sun
ccab8165eb
Delete scripts/install_ubuntu_x86.sh (#1189)
* Delete scripts/install_ubuntu_x86.sh to reduce duplicated code
2019-06-07 15:48:52 -07:00
Konstantinos Karanasos
32c6c71e86 Convert Shape operator to initializer (#1159)
This PR introduces a rewrite rule that replaces a Shape node with an initializer when the shape of the input is statically known through shape inference.
2019-06-07 14:15:19 -07:00
RandySheriffH
cdb27de090
implement python opeartor (#1045)
* implement python opeartor

* format code

* remove dup

* limit type

* format code

* cancel default logging func

* add comment

* fix compile err

* fix comments

* switch to c++ style cast

* implement interop framework

* fix format

* move includes

* cancel needless linking

* fix comment

* add UT

* exclude def
2019-06-07 11:50:23 -07:00
Hector Li
8d68098c20
ConstantOfShape CUDA implementation (#1168)
* ConstantOfShape CUDA implementation

* Enhance the fallback logic, so the case that Shape -> ... -> ConstantOfShape won't fallback ConstantOfShape to CPU provider

* move shared code to cpu implementation

* do the fill based on sizeof(data_type)

* update method access level
2019-06-07 11:41:58 -07:00
Dmitri Smirnov
e43e64bf84
Implement Equal for CUDA. (#1183) 2019-06-07 11:11:50 -07:00
Xavier Dupré
d33dbb23b2
replace onnxmltools by keras-onnx in one example (#1151) 2019-06-07 12:03:46 +02:00
Changming Sun
d8ac0d64d0
Make C API capable of defining CUDA custom ops (#1178)
Recreate the PR on behalf of Rui Xia, for #779
2019-06-06 13:45:32 -07:00
Ryan Hill
b68bb51dd0
Change SessionOptions APIs to always return a status (#1171)
* Change SessionOptions APIs to always return a status, for consistency and ease of use (a couple returned 0 or -1 for success/failure)
2019-06-06 13:24:24 -07:00
G. Ramalingam
b23ab6a06e
Implementation of sparse tensor (#1121)
* Initial implementation of sparse tensor

* minor cleanup

* minor cleanup (remove empty line)

* simplify template usage in test-case

* address linux build error

* fix constructor order to address compiler warning

* Address PR comments

* handle allocation in optimizer execution frame

* address compiler warning message and PR feedback comment

* address gcc unused warning for protobuf code

* address PR comment
2019-06-06 11:50:38 -07:00
Dmitri Smirnov
7a80770b52
Remove filtering from the backend scripts runner (#1169)
Remove filtering from the backend scripts runner as we only run it
  on the tip of onnx.
2019-06-06 10:04:55 -07:00
Klein Hu
1a86421aff Create a syslog sink for logging in !Win32 env (#1163)
* Create a syslog sink for logging in !Win32 env

* Move syslog level logic to syslog_sink.c
2019-06-06 16:35:06 +10:00
Changming Sun
88ea58a383
Add double tensor support for Div and Sub (#1172)
Needed by the preprocessing step of inception models.

TF source code: https://github.com/tensorflow/models/blob/master/research/slim/preprocessing/inception_preprocessing.py
2019-06-05 21:10:09 -07:00
Scott McKay
c6abb17b8d
Use shared threadpool in LSTM (#1167) 2019-06-06 07:16:31 +10:00
Ryan Hill
7cd2d9f3c4
Change Path_lib to not use Shlwapi.dll if compiled as WINAPI_PARTITION_APP (#1161) 2019-06-05 11:05:33 -07:00
KeDengMS
7c4494a0bc
Fix CUDA thread_local to allow multiple CUDA execution providers (#1149)
This fixes #1034: Can't Create Model Sessions on Different GPU
The root cause of the bug is that CUDA execution provider uses thread_local to save per-thread-context and allocator, and when two CUDA execution providers are running on the same thread there's a conflict. The fix is to add a std::unordered_map to differentiate EPs in the same thread.
2019-06-04 16:32:27 -07:00
Hariharan Seshadri
a863c67ef8
Fix error message for legacy opset (6 and lower) models at model load time (#1147)
* Initial commit

* Resolve comments and revert changes in files that are not needed for this change

* Single line addition revert

* Resolve comments
2019-06-04 11:56:49 -07:00
jywu-msft
7316e54153 re-enable disabled tests on nGraph after fixing remaining subgraph resolve error (#1158)
* fix subgraph resolve error for nGraph.

* input_args need to be processed in order.
2019-06-04 09:40:24 -07:00
Ryan Hill
148141dd5f Change Compute function to return a status code instead of an integer. (#1139) 2019-06-04 08:34:32 -07:00
Changming Sun
c18de6817b
Rename MLValue to OrtValue (#1154) 2019-06-03 17:29:55 -07:00
Hector Li
b8a699f70b
Update MaxPool & AveragePool to support opset 10 (#1141)
* Update MaxPool & AveragePool to support opset 10

* fix build issue

* still use cudnn for MaxPool if dilation is not set or are default 1.

* fix build issue
2019-06-02 20:42:57 -07:00
Torkel
10ea77a3d1 add details aboud adding execution providers in the C api to comments and docs (i.e. need OrtSessionOptionsAppendExecutionProvider_CUDA to get CUDA) 2019-06-02 17:38:36 -07:00
Changming Sun
6c9d815de5 Revert "Remove openmp flag (#1140)" (#1146)
This reverts commit a7137a0f9d.
2019-05-31 18:48:14 -07:00
RandySheriffH
71cbd7679c
implement cuda slice opset 10 (#1137)
* implement cuda slice opset 10

* remove needless delaration
2019-05-31 13:38:19 -07:00
Changming Sun
a7137a0f9d
Remove openmp flag (#1140)
* Remove openmp flag
2019-05-31 12:41:08 -07:00
Du Li
05110a6558
Adding custom op ConvTransposeWithDynamicPads. (#638)
* Adding custom op ConvTransposeWithDynamicPads.

* Adding custom op ConvTransposeWithDynamicPads.

* adding cuda kernels

* fix a bug

* fix build issue.

* Integrate PR comments.
2019-05-31 11:48:43 -07:00
Klein Hu
6c408c3a75 Simplify ONNX Runtime Server CI build (#1136) 2019-05-30 17:31:41 -07:00
Hariharan Seshadri
facdf77f84
Fix shape inference bug in GatherND contrib op (#1132) 2019-05-30 17:05:44 -07:00
RandySheriffH
4757933afe
Exclude test by onnx version tag (#1073)
* add version filter to failed tests

* exclude test from backend

* exclude shrink from opset 9

* fix compile err

* exclude certain version of constant shape

* enable flatten test

* fix compile err

* comment mvn test

* disable constantofshape test in x86

* disable x86 test

* get model version from imported opset

* test linux x86 case

* disable nonzero opset 10

* make mutex const

* test filter by commit id

* adjust substr offset

* Limit test platform

* remove change impacting TFModleInfo.h

* refactoring

* refactoring

* test x86 pipeline with filter

* add comment

* restrict version extraction on non-win

* restrict version extraction on non-win

* add tag

* exclude case from backend test

* remove dup

* remove dup

* make script runnable

* hard code adsolute path

* refactor log

* fix x86 compile err

* fix x86 compile err

* fix x86 compile err

* sync with latest tensorrt

* switch to regex

* fix cpu pipeline err

* test filter

* disable nonzero from all versions
2019-05-30 16:19:06 -07:00
Ryan Hill
f9f6818e4c
Add comments and organize the C++ header into the main header plus a separate one of the inline methods. (#1130) 2019-05-30 14:24:25 -07:00