Commit graph

1498 commits

Author SHA1 Message Date
Dmitri Smirnov
88c58c19d4
Improve code readability and performance. (#2257)
Improve code readability and performance. (#2257)  
  Remove one time checks from loops.
  Move out GetType<>() calls from loop as they
  go through local function statics.
  Get rid of index calculations from input and output
  so we can simlpy advance ptrs and potentially do better pre-fetch.
  Improve code readability.
2019-10-25 16:19:59 -07:00
Changming Sun
ce14b07b1c
Fix the GPU nuget pipeline failure (#2255) 2019-10-25 13:55:38 -07:00
Sreekanth Yalachigere
c7599d0705 subgraph parse error fix (#2254) 2019-10-24 22:14:30 -07:00
shahasad
6a0ee7eff6
Fix model path marshalling in csharp, and re-enable the pretrained model tests (#2236) 2019-10-24 20:39:16 -07:00
edgchen1
8be48f47dd
Added Graph::ReplaceInitializedTensor() function. (#2230)
Added Graph::ReplaceInitializedTensor() function to support updating existing initializers.
2019-10-24 19:30:24 -07:00
Ryan Hill
7b4e6636f8 Add default params to Env C++ constructor (#2246) 2019-10-24 17:20:11 -07:00
Tomasz Dołbniak
63acd4e89b Adjust the nGraph EP to the newest CI test data (#2180)
* Adjust the nGraph EP to the newest CI test data

* Increase the linux pipeline timeout for nGraph
2019-10-24 16:44:03 -07:00
Ashwini Khade
e250e51e65
clear cache when moving to next channel (#2253) 2019-10-24 16:07:45 -07:00
Ryan Hill
7494500221 Fix csharp CXX sample (#2251) 2019-10-24 15:47:51 -07:00
suryasidd
f7b4bc15e1 Updated documentation for VAD-F (#2248)
Signed-off-by: suryasidd <surya.siddharth.pemmaraju@intel.com>
2019-10-24 14:31:44 -07:00
Scott McKay
20e6a2b6da
Disable optimizers for OpTester operator unit tests (#2237)
* Disable optimizers for operator unit tests as they're intended to test the operator directly rather than something that could have been modified by an optimizer.

Disable TensorRT for Scan9 unit tests that fails when optimizers are enabled. Bug 525222 tracks that.

* Disable TRT for the lenient shape inferencing test as it uses Unsqueeze and TRT doesn't cope with that op.
2019-10-24 11:37:09 -07:00
Ryan Hill
77d8d6f767
Remove the OrtApiBase base_ member from OrtApi (#2242)
* Remove the OrtApiBase base_ member from OrtApi

* Forgot about C#
2019-10-24 11:36:23 -07:00
edgchen1
6a27cb5ad6 Fixed tensor reference to const data and cleaned up Env API. (#1979) 2019-10-24 10:28:13 -07:00
Scott McKay
557243dce9
Add handling of zero dim to broadcasting and a few other ops (#2217)
Handle dim value of zero in broadcasting, reduce, pool, conv, onehot, clip, *softmax, transpose ops on CPU and GPU. Add unit tests for all changes.
2019-10-24 10:17:14 -07:00
Dmitri Smirnov
fef721c4f2
Update build instructions. Make samples build and run. (#2244)
Making samples build. Update build instructions.
2019-10-24 10:06:47 -07:00
kile0
bede664af7 mimalloc allocator (#2071) 2019-10-23 22:34:00 -07:00
Tracy Sharpe
90858b732a handle Concat-11 (#2240) 2019-10-23 19:12:40 -07:00
Pranav Sharma
eb75388cd5
Minor optimization: don't log all node names if they've all been assigned to a single provider. (#2229) 2019-10-23 15:16:15 -07:00
Scott McKay
41d55ea274
Update the GraphProto for subgraphs when saving the Graph. (#647)
* Update the GraphProto for subgraphs when saving the Graph. This is required to produce a valid overall Graph if the Graph has been optimized.
2019-10-23 15:14:06 -07:00
Ryan Hill
6fca8b0a94 Move CXX API global into the header (#2228) 2019-10-23 14:15:53 -07:00
Nathan
90b6ff927f Implement EyeLike Cuda Kernel to improve performance (#2139) 2019-10-23 13:48:39 -07:00
Klein Hu
ac9425c119 Update ORTSRV integration test model path (#2225)
* Update ORTSRV integration test model path
2019-10-23 13:48:20 -07:00
Changming Sun
4b62241c77
Update ONNX to 1.6.1 (#2235) 2019-10-23 13:47:45 -07:00
Ashwini Khade
81d901cb60 remove nuphar scripts (#2233) 2019-10-23 13:47:26 -07:00
Jeff
ab39f7ec99
Jeffbloo/fix dml rnn failures (#2234)
* Address a possible cause of incorrect DML kernel registrations and re-enable tests

* Re-enable DML build
2019-10-23 13:46:16 -07:00
Sreekanth Yalachigere
041a1cab54 update libmkldnn.so version (#2220) 2019-10-22 07:23:41 -07:00
Pranav Sharma
6ac3e31047 Avoid printing a misleading message exposed by a previous commit + register float type for Slice cuda op. (#2219) 2019-10-22 07:23:05 -07:00
Pranav Sharma
04b8a4bb34 Log node placement info in verbose logging mode + turn on verbose mode in onnx_test_runner -vv switch. (#2221) 2019-10-22 07:22:18 -07:00
Pranav Sharma
384c686f40 Update README and other files with the correct cuda version used for 1.0 release. (#2222) 2019-10-22 07:21:48 -07:00
Negin Raoof
ff2c4a4d7e Added more input data types for pad (#2179)
* added more input data types for pad

* replacing the comments

* replacing the comments

* added first set of tests

* added tests

* added more tests

* keep NGRAPH test

* avoid type cast

* avoid type conversion for value float to T

* fixed tabs

* Update tests exclusion list

* Nits

* comments fix

* Format files

* Nit updates

* rebased
2019-10-22 00:41:35 -07:00
Ashwini Khade
b23cc04f34 Handle the case when scales is constant but size is 0 (#2218) 2019-10-21 23:28:43 -07:00
edgchen1
856c6cae0a Edgchen1/endian utils (#2181) 2019-10-21 22:28:35 -07:00
Scott McKay
3507197599 The Conv+BN fusion is missing a check that the BN node does not have optional outputs. If it does, it cannot be fused with the Conv node. (#2216)
Add the necessary checks.
Trim the output defs in case there are any optional outputs (already checked they don't exist so known to be unused) before finalizing fusion as we copy those to the Conv node to maintain the output names.

Add unit tests for both cases.
2019-10-21 21:32:23 -07:00
manashgoswami
b344670351 Updated Perf Test readme (#2212) 2019-10-21 21:28:29 -07:00
Hariharan Seshadri
5eb42f4452
Add uint8 support for BitShift operator (#2214)
* Add uint8 support for BitShift operator

* Remove more tests from exclusion

* Updates
2019-10-21 21:10:29 -07:00
Scott McKay
91122a2cf5 Fix GELU fusion (#2213)
* Split graph_utils methods for finalization of fusion in order to support more than 2 nodes being fused into one.
Update GELU fusion to use graph_utils to set up the input/output edges for the fused node, and removing nodes that are being replaced.
2019-10-21 20:18:45 -07:00
Changming Sun
aef055ebe8
Update nuget pipeline to use CentOS6 (#2211) 2019-10-21 17:55:36 -07:00
Faith Xu
303a78c301 Update Python documentation (#2210) 2019-10-21 16:56:31 -07:00
Hariharan Seshadri
e8ec52814f Remove some node tests from exclusion (#2168) 2019-10-21 16:22:41 -07:00
Hector Li
cdcca43aa3
Skip GemmNoTrans_f16 test for CUDA if the hardware does not support fp16
Skip GemmNoTrans_f16 test for CUDA if the hardware does not support fp16

Motivation and Context
Unblock the multi_gpu build pipeline. The build agent uses Nvidia K80 GPU which doesn't have fp16 support.
2019-10-21 13:15:20 -07:00
Ashwini Khade
d197079473
quantization script updates (#2208) 2019-10-21 12:25:52 -07:00
Scott McKay
b6b44c90ac Fix MaxUnpool behaviour when output_shape is provided and doesn't match the inferred shape (#2193)
* MaxUnpool should reconstitute what was pooled by MaxPool. The kernel_shape, pads and strides inputs are purely to infer an output shape, if output_shape is not explicitly provided.

The unpool should not be adding new padding, so output_shape is not about auto generating pad values and inserting pads.

The current ORT implementation misinterprets the usage of output_shape and inserts pads instead of just allocating an output of the specified shape, and directly unpooling into it.

Update to simply find the correct output shape to use, and simply unpool into that.

Update unit tests to reflect this.

* Exclude maxunpool_export_with_output_shape which has invalid data in the output.

* Fix test name in backend test series exclusion
2019-10-21 11:56:13 -07:00
Scott McKay
6699c19010 Add script to find if optimizer updates are required due to onnx operator updates. (#1957)
* Add script to find calls to graph_utils::IsSupportedOptypeVersionAndDomain where the latest supported version is prior to the latest defined version.
2019-10-21 11:38:30 -07:00
Ashwini Khade
dc5efbf5ce
enable exclude outside for resize op (#2203)
* enable exclude outside for resize mode

* fix centos error

* updates per review + plus more data types for resize

* fix typo in error message

* reset wrong fix
2019-10-21 11:18:33 -07:00
Scott McKay
3cda9f717b Relax shape inferencing error handling if model uses an old opset (#2199) 2019-10-21 10:51:22 -07:00
Paul McDaniel
02dc3a9dcb build break for arm64, adding advapi32.lib (#2206) 2019-10-21 08:48:28 -07:00
shahasad
fcf50ca081 Fix nuget mklml pipeline (#2204)
* some fixes on nuget CPU pipeline

* revert d738c89536

* fix for MKLML package

* fix if else
2019-10-21 08:46:28 -07:00
Scott McKay
5c86889beb
Fix linux build issue with debug dump of shapes and data. (#2202)
Add option to dump just shapes or shapes and data.
2019-10-20 20:35:48 -07:00
Hariharan Seshadri
07e9f500da
Add back Keras Mask RCNN to CUDA test exclusion (#2201) 2019-10-20 17:13:02 -07:00
Nathan
aae18a3fe3 Upgrade onehot to OpSet 11 (#2185)
* Upgrade onehot to OpSet 11

* Move Onehot  test out of  blacklist

* Add negative indices support besides negative axis.

* PR comments - 1

* PR comments-2
2019-10-20 10:44:20 -07:00