Commit graph

1485 commits

Author SHA1 Message Date
Scott McKay
557243dce9
Add handling of zero dim to broadcasting and a few other ops (#2217)
Handle dim value of zero in broadcasting, reduce, pool, conv, onehot, clip, *softmax, transpose ops on CPU and GPU. Add unit tests for all changes.
2019-10-24 10:17:14 -07:00
Dmitri Smirnov
fef721c4f2
Update build instructions. Make samples build and run. (#2244)
Making samples build. Update build instructions.
2019-10-24 10:06:47 -07:00
kile0
bede664af7 mimalloc allocator (#2071) 2019-10-23 22:34:00 -07:00
Tracy Sharpe
90858b732a handle Concat-11 (#2240) 2019-10-23 19:12:40 -07:00
Pranav Sharma
eb75388cd5
Minor optimization: don't log all node names if they've all been assigned to a single provider. (#2229) 2019-10-23 15:16:15 -07:00
Scott McKay
41d55ea274
Update the GraphProto for subgraphs when saving the Graph. (#647)
* Update the GraphProto for subgraphs when saving the Graph. This is required to produce a valid overall Graph if the Graph has been optimized.
2019-10-23 15:14:06 -07:00
Ryan Hill
6fca8b0a94 Move CXX API global into the header (#2228) 2019-10-23 14:15:53 -07:00
Nathan
90b6ff927f Implement EyeLike Cuda Kernel to improve performance (#2139) 2019-10-23 13:48:39 -07:00
Klein Hu
ac9425c119 Update ORTSRV integration test model path (#2225)
* Update ORTSRV integration test model path
2019-10-23 13:48:20 -07:00
Changming Sun
4b62241c77
Update ONNX to 1.6.1 (#2235) 2019-10-23 13:47:45 -07:00
Ashwini Khade
81d901cb60 remove nuphar scripts (#2233) 2019-10-23 13:47:26 -07:00
Jeff
ab39f7ec99
Jeffbloo/fix dml rnn failures (#2234)
* Address a possible cause of incorrect DML kernel registrations and re-enable tests

* Re-enable DML build
2019-10-23 13:46:16 -07:00
Sreekanth Yalachigere
041a1cab54 update libmkldnn.so version (#2220) 2019-10-22 07:23:41 -07:00
Pranav Sharma
6ac3e31047 Avoid printing a misleading message exposed by a previous commit + register float type for Slice cuda op. (#2219) 2019-10-22 07:23:05 -07:00
Pranav Sharma
04b8a4bb34 Log node placement info in verbose logging mode + turn on verbose mode in onnx_test_runner -vv switch. (#2221) 2019-10-22 07:22:18 -07:00
Pranav Sharma
384c686f40 Update README and other files with the correct cuda version used for 1.0 release. (#2222) 2019-10-22 07:21:48 -07:00
Negin Raoof
ff2c4a4d7e Added more input data types for pad (#2179)
* added more input data types for pad

* replacing the comments

* replacing the comments

* added first set of tests

* added tests

* added more tests

* keep NGRAPH test

* avoid type cast

* avoid type conversion for value float to T

* fixed tabs

* Update tests exclusion list

* Nits

* comments fix

* Format files

* Nit updates

* rebased
2019-10-22 00:41:35 -07:00
Ashwini Khade
b23cc04f34 Handle the case when scales is constant but size is 0 (#2218) 2019-10-21 23:28:43 -07:00
edgchen1
856c6cae0a Edgchen1/endian utils (#2181) 2019-10-21 22:28:35 -07:00
Scott McKay
3507197599 The Conv+BN fusion is missing a check that the BN node does not have optional outputs. If it does, it cannot be fused with the Conv node. (#2216)
Add the necessary checks.
Trim the output defs in case there are any optional outputs (already checked they don't exist so known to be unused) before finalizing fusion as we copy those to the Conv node to maintain the output names.

Add unit tests for both cases.
2019-10-21 21:32:23 -07:00
manashgoswami
b344670351 Updated Perf Test readme (#2212) 2019-10-21 21:28:29 -07:00
Hariharan Seshadri
5eb42f4452
Add uint8 support for BitShift operator (#2214)
* Add uint8 support for BitShift operator

* Remove more tests from exclusion

* Updates
2019-10-21 21:10:29 -07:00
Scott McKay
91122a2cf5 Fix GELU fusion (#2213)
* Split graph_utils methods for finalization of fusion in order to support more than 2 nodes being fused into one.
Update GELU fusion to use graph_utils to set up the input/output edges for the fused node, and removing nodes that are being replaced.
2019-10-21 20:18:45 -07:00
Changming Sun
aef055ebe8
Update nuget pipeline to use CentOS6 (#2211) 2019-10-21 17:55:36 -07:00
Faith Xu
303a78c301 Update Python documentation (#2210) 2019-10-21 16:56:31 -07:00
Hariharan Seshadri
e8ec52814f Remove some node tests from exclusion (#2168) 2019-10-21 16:22:41 -07:00
Hector Li
cdcca43aa3
Skip GemmNoTrans_f16 test for CUDA if the hardware does not support fp16
Skip GemmNoTrans_f16 test for CUDA if the hardware does not support fp16

Motivation and Context
Unblock the multi_gpu build pipeline. The build agent uses Nvidia K80 GPU which doesn't have fp16 support.
2019-10-21 13:15:20 -07:00
Ashwini Khade
d197079473
quantization script updates (#2208) 2019-10-21 12:25:52 -07:00
Scott McKay
b6b44c90ac Fix MaxUnpool behaviour when output_shape is provided and doesn't match the inferred shape (#2193)
* MaxUnpool should reconstitute what was pooled by MaxPool. The kernel_shape, pads and strides inputs are purely to infer an output shape, if output_shape is not explicitly provided.

The unpool should not be adding new padding, so output_shape is not about auto generating pad values and inserting pads.

The current ORT implementation misinterprets the usage of output_shape and inserts pads instead of just allocating an output of the specified shape, and directly unpooling into it.

Update to simply find the correct output shape to use, and simply unpool into that.

Update unit tests to reflect this.

* Exclude maxunpool_export_with_output_shape which has invalid data in the output.

* Fix test name in backend test series exclusion
2019-10-21 11:56:13 -07:00
Scott McKay
6699c19010 Add script to find if optimizer updates are required due to onnx operator updates. (#1957)
* Add script to find calls to graph_utils::IsSupportedOptypeVersionAndDomain where the latest supported version is prior to the latest defined version.
2019-10-21 11:38:30 -07:00
Ashwini Khade
dc5efbf5ce
enable exclude outside for resize op (#2203)
* enable exclude outside for resize mode

* fix centos error

* updates per review + plus more data types for resize

* fix typo in error message

* reset wrong fix
2019-10-21 11:18:33 -07:00
Scott McKay
3cda9f717b Relax shape inferencing error handling if model uses an old opset (#2199) 2019-10-21 10:51:22 -07:00
Paul McDaniel
02dc3a9dcb build break for arm64, adding advapi32.lib (#2206) 2019-10-21 08:48:28 -07:00
shahasad
fcf50ca081 Fix nuget mklml pipeline (#2204)
* some fixes on nuget CPU pipeline

* revert d738c89536

* fix for MKLML package

* fix if else
2019-10-21 08:46:28 -07:00
Scott McKay
5c86889beb
Fix linux build issue with debug dump of shapes and data. (#2202)
Add option to dump just shapes or shapes and data.
2019-10-20 20:35:48 -07:00
Hariharan Seshadri
07e9f500da
Add back Keras Mask RCNN to CUDA test exclusion (#2201) 2019-10-20 17:13:02 -07:00
Nathan
aae18a3fe3 Upgrade onehot to OpSet 11 (#2185)
* Upgrade onehot to OpSet 11

* Move Onehot  test out of  blacklist

* Add negative indices support besides negative axis.

* PR comments - 1

* PR comments-2
2019-10-20 10:44:20 -07:00
Pranav Sharma
69970d1f2a
Include the new Privacy.md file in all release packages. (#2200) 2019-10-20 07:58:36 -07:00
Scott McKay
cf7ee5f6e8 Support negative axis in unsqueeze elimination (#2158)
* Handle negative axes in UnsqueezeElimination.

* Address PR comments

* Trigger rebuild. Some CI builds are unable to be retried
2019-10-20 01:15:27 -07:00
Konstantinos Karanasos
33c639a022 Slice elimination support for opsets 10 and 11 (#2171)
* work on slice elimination for opset 10

* more work on slice elimination

* first working version

* adding python notebook for building models; fixing test

* fixing build error in macOS
2019-10-20 01:14:55 -07:00
Changming Sun
cff7879d89
Update C API pipeline to use CentOS 6 (#2198) 2019-10-19 22:25:42 -07:00
Nathan
0dd781fd57 Perf tuning doc update with latest API (#2128)
* Update perf tuning md

* Remove AppendExecutionProvider
2019-10-19 21:03:09 -07:00
stevenlix
a9f01a5f29
Fixed node index remapping issue in TensorRT graph partitioning (#2155)
* Fixed node index mapping issue during graph partitioning

* add test for node index mapping

* Update BUILD.md

* Update TensorRT-ExecutionProvider.md
2019-10-19 20:31:56 -07:00
Dmitri Smirnov
7b18bd563f Commit formatting and end of lines (#2161) 2019-10-19 19:49:40 -07:00
Hariharan Seshadri
763af5d42a Remove Keras Mask RCNN from CUDA test exclusion list (#2196) 2019-10-19 18:31:50 -07:00
Tianlei Wu
18b192a45b Implement Range Cuda Kernel to improve performance (#2148) 2019-10-19 18:28:52 -07:00
shahasad
7efc9bdcc7 Some condition fixes on nuget pipeline, to get it green (#2195) 2019-10-19 18:28:12 -07:00
Xavier Dupré
836d22cd4c Update readme.rst for pypi, change documentation style (#1663) 2019-10-19 18:26:34 -07:00
Hariharan Seshadri
ac3d2ad897 Implement ConcatFromSequence (#2106) 2019-10-19 18:26:10 -07:00
Paul McDaniel
d1159b7008 Adding platform telemetry (#2109) 2019-10-19 18:25:57 -07:00