Commit graph

1463 commits

Author SHA1 Message Date
Scott McKay
91122a2cf5 Fix GELU fusion (#2213)
* Split graph_utils methods for finalization of fusion in order to support more than 2 nodes being fused into one.
Update GELU fusion to use graph_utils to set up the input/output edges for the fused node, and removing nodes that are being replaced.
2019-10-21 20:18:45 -07:00
Changming Sun
aef055ebe8
Update nuget pipeline to use CentOS6 (#2211) 2019-10-21 17:55:36 -07:00
Faith Xu
303a78c301 Update Python documentation (#2210) 2019-10-21 16:56:31 -07:00
Hariharan Seshadri
e8ec52814f Remove some node tests from exclusion (#2168) 2019-10-21 16:22:41 -07:00
Hector Li
cdcca43aa3
Skip GemmNoTrans_f16 test for CUDA if the hardware does not support fp16
Skip GemmNoTrans_f16 test for CUDA if the hardware does not support fp16

Motivation and Context
Unblock the multi_gpu build pipeline. The build agent uses Nvidia K80 GPU which doesn't have fp16 support.
2019-10-21 13:15:20 -07:00
Ashwini Khade
d197079473
quantization script updates (#2208) 2019-10-21 12:25:52 -07:00
Scott McKay
b6b44c90ac Fix MaxUnpool behaviour when output_shape is provided and doesn't match the inferred shape (#2193)
* MaxUnpool should reconstitute what was pooled by MaxPool. The kernel_shape, pads and strides inputs are purely to infer an output shape, if output_shape is not explicitly provided.

The unpool should not be adding new padding, so output_shape is not about auto generating pad values and inserting pads.

The current ORT implementation misinterprets the usage of output_shape and inserts pads instead of just allocating an output of the specified shape, and directly unpooling into it.

Update to simply find the correct output shape to use, and simply unpool into that.

Update unit tests to reflect this.

* Exclude maxunpool_export_with_output_shape which has invalid data in the output.

* Fix test name in backend test series exclusion
2019-10-21 11:56:13 -07:00
Scott McKay
6699c19010 Add script to find if optimizer updates are required due to onnx operator updates. (#1957)
* Add script to find calls to graph_utils::IsSupportedOptypeVersionAndDomain where the latest supported version is prior to the latest defined version.
2019-10-21 11:38:30 -07:00
Ashwini Khade
dc5efbf5ce
enable exclude outside for resize op (#2203)
* enable exclude outside for resize mode

* fix centos error

* updates per review + plus more data types for resize

* fix typo in error message

* reset wrong fix
2019-10-21 11:18:33 -07:00
Scott McKay
3cda9f717b Relax shape inferencing error handling if model uses an old opset (#2199) 2019-10-21 10:51:22 -07:00
Paul McDaniel
02dc3a9dcb build break for arm64, adding advapi32.lib (#2206) 2019-10-21 08:48:28 -07:00
shahasad
fcf50ca081 Fix nuget mklml pipeline (#2204)
* some fixes on nuget CPU pipeline

* revert d738c89536

* fix for MKLML package

* fix if else
2019-10-21 08:46:28 -07:00
Scott McKay
5c86889beb
Fix linux build issue with debug dump of shapes and data. (#2202)
Add option to dump just shapes or shapes and data.
2019-10-20 20:35:48 -07:00
Hariharan Seshadri
07e9f500da
Add back Keras Mask RCNN to CUDA test exclusion (#2201) 2019-10-20 17:13:02 -07:00
Nathan
aae18a3fe3 Upgrade onehot to OpSet 11 (#2185)
* Upgrade onehot to OpSet 11

* Move Onehot  test out of  blacklist

* Add negative indices support besides negative axis.

* PR comments - 1

* PR comments-2
2019-10-20 10:44:20 -07:00
Pranav Sharma
69970d1f2a
Include the new Privacy.md file in all release packages. (#2200) 2019-10-20 07:58:36 -07:00
Scott McKay
cf7ee5f6e8 Support negative axis in unsqueeze elimination (#2158)
* Handle negative axes in UnsqueezeElimination.

* Address PR comments

* Trigger rebuild. Some CI builds are unable to be retried
2019-10-20 01:15:27 -07:00
Konstantinos Karanasos
33c639a022 Slice elimination support for opsets 10 and 11 (#2171)
* work on slice elimination for opset 10

* more work on slice elimination

* first working version

* adding python notebook for building models; fixing test

* fixing build error in macOS
2019-10-20 01:14:55 -07:00
Changming Sun
cff7879d89
Update C API pipeline to use CentOS 6 (#2198) 2019-10-19 22:25:42 -07:00
Nathan
0dd781fd57 Perf tuning doc update with latest API (#2128)
* Update perf tuning md

* Remove AppendExecutionProvider
2019-10-19 21:03:09 -07:00
stevenlix
a9f01a5f29
Fixed node index remapping issue in TensorRT graph partitioning (#2155)
* Fixed node index mapping issue during graph partitioning

* add test for node index mapping

* Update BUILD.md

* Update TensorRT-ExecutionProvider.md
2019-10-19 20:31:56 -07:00
Dmitri Smirnov
7b18bd563f Commit formatting and end of lines (#2161) 2019-10-19 19:49:40 -07:00
Hariharan Seshadri
763af5d42a Remove Keras Mask RCNN from CUDA test exclusion list (#2196) 2019-10-19 18:31:50 -07:00
Tianlei Wu
18b192a45b Implement Range Cuda Kernel to improve performance (#2148) 2019-10-19 18:28:52 -07:00
shahasad
7efc9bdcc7 Some condition fixes on nuget pipeline, to get it green (#2195) 2019-10-19 18:28:12 -07:00
Xavier Dupré
836d22cd4c Update readme.rst for pypi, change documentation style (#1663) 2019-10-19 18:26:34 -07:00
Hariharan Seshadri
ac3d2ad897 Implement ConcatFromSequence (#2106) 2019-10-19 18:26:10 -07:00
Paul McDaniel
d1159b7008 Adding platform telemetry (#2109) 2019-10-19 18:25:57 -07:00
George Wu
b1096424f0
mkldnn cleanup. add EP_FAIL paths for unsupported mkldnn nodes. (#2182)
* mkldnn cleanup. add EP_FAIL paths for unsupported mkldnn modes.

* disable flaky dml test.
2019-10-19 17:06:16 -07:00
Dmitri Smirnov
acec4b446f Make CentOS 6 CUDA build and run (#2159)
* Add manylinux1 source code changes

* Disable a python test
2019-10-19 15:33:31 -07:00
Pranav Sharma
96b33f4597
Add support for double to SplitToSequence op to allow test_sequence_model7 to pass. (#2173) 2019-10-19 14:11:19 -07:00
Pranav Sharma
f8c30b8aa9
Disable DML builds for now until further investigation since the tests are very flaky. (#2194) 2019-10-19 12:13:25 -07:00
Ashwini Khade
6d4682581e resize opset11 (#2088) 2019-10-19 08:10:37 -07:00
Changming Sun
021073b5e5
Update python packaging pipelines (#2167) 2019-10-19 07:42:54 -07:00
Pranav Sharma
d15021f42b Disable some tests for DML and Android EP pending investigation. (#2187)
* Disable some tests for DML and Android EP pending investigation.

* Add lstm_with_peephole as well

* Disable gru_defaults as well
2019-10-19 00:07:51 -07:00
shahasad
35dae992f1
Fix nuget gpu ci test error (#2164)
* fix nuget version extraction script for Gpu packages
* fix cuda version in gpu end-to-end test
2019-10-18 23:01:26 -07:00
Ashwini Khade
ecf5ae8b76 Askhade/disable csharptests (#2172)
Disable flaky c# test
  For agility
2019-10-18 11:00:50 -07:00
Patrice Vignola
62281ff3b9 Add empty tensor support to the DML EP (#2163) 2019-10-18 10:17:57 -07:00
Ashwini Khade
5eb4e81f80
move some optimizers to level1 (#1566)
* move some optimizers to level1

* move matmul add fusion to level 1

* bug fix in the test code

* fix make_uniques + add test exceptions

* add exception for tests in c# too
2019-10-18 09:29:31 -07:00
Zhang Lei
47bc351265 Fix wrong usage of GetScratchBuffer, like: (#2101)
auto p = GetScratchBuffer<T>(size).get();
which cause cuda memory freed immediately, but p will be used later.
This also cause random error.
2019-10-18 08:15:38 -07:00
Ashwini Khade
fc3c168402
Graph Optimizations Doc (#2050)
* Initial draft

* updates per review

* fix link

* plus one more link fix

* small changes to the optimizer documentation

* some more changes

* done

* update C_API with doc link
2019-10-18 08:03:40 -07:00
Changming Sun
13f8b49d58
Fix kernel registry bug (#2137) 2019-10-17 23:10:54 -07:00
Klein Hu
2bf1778a5c Fix ORTSRV docker build (#2160)
* Fix ORTSRV docker build

* More Dockerfile changes
2019-10-17 21:10:21 -07:00
Changming Sun
00e2d1c604 update (#2140) 2019-10-17 19:28:10 -07:00
Clarke Rahrig
0e6ac2961e Adding a line beak to BUILD.md (#2156)
Adding a line break between the DirectML and NUPHAR sections of build guidance.
2019-10-17 15:18:06 -07:00
daquexian
f6c82358e9 Use built-in wstring_convert on Android (#2095) 2019-10-17 15:09:56 -07:00
Faith Xu
86af54ded8
Add roadmap file (#2127)
* Add roadmap file

* Minor updates

* fixes based on feedback

* Add IOT section
2019-10-17 13:03:25 -07:00
Tomasz Dołbniak
72110d3508 Patch for the MKLDNN v1 segfaults (#2145) 2019-10-17 12:10:00 -07:00
Pranav Sharma
4006e29601
Implement SplitToSequence op. (#2131)
* Implement SplitToSequence op.

* Fix bug in uneven splits, add tests for axis > 0.
2019-10-17 12:02:22 -07:00
Pranav Sharma
70e7eaf1e8 Update DML transformers with the new Graph API and re-enable DML in the GPU CI build. (#2147) 2019-10-17 11:46:14 -07:00