onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-07-16 18:31:27 +00:00

Author	SHA1	Message	Date
Scott McKay	557243dce9	Add handling of zero dim to broadcasting and a few other ops (#2217 ) Handle dim value of zero in broadcasting, reduce, pool, conv, onehot, clip, *softmax, transpose ops on CPU and GPU. Add unit tests for all changes.	2019-10-24 10:17:14 -07:00
Dmitri Smirnov	fef721c4f2	Update build instructions. Make samples build and run. (#2244 ) Making samples build. Update build instructions.	2019-10-24 10:06:47 -07:00
kile0	bede664af7	mimalloc allocator (#2071 )	2019-10-23 22:34:00 -07:00
Tracy Sharpe	90858b732a	handle Concat-11 (#2240 )	2019-10-23 19:12:40 -07:00
Pranav Sharma	eb75388cd5	Minor optimization: don't log all node names if they've all been assigned to a single provider. (#2229 )	2019-10-23 15:16:15 -07:00
Scott McKay	41d55ea274	Update the GraphProto for subgraphs when saving the Graph. (#647 ) * Update the GraphProto for subgraphs when saving the Graph. This is required to produce a valid overall Graph if the Graph has been optimized.	2019-10-23 15:14:06 -07:00
Ryan Hill	6fca8b0a94	Move CXX API global into the header (#2228 )	2019-10-23 14:15:53 -07:00
Nathan	90b6ff927f	Implement EyeLike Cuda Kernel to improve performance (#2139 )	2019-10-23 13:48:39 -07:00
Klein Hu	ac9425c119	Update ORTSRV integration test model path (#2225 ) * Update ORTSRV integration test model path	2019-10-23 13:48:20 -07:00
Changming Sun	4b62241c77	Update ONNX to 1.6.1 (#2235 )	2019-10-23 13:47:45 -07:00
Ashwini Khade	81d901cb60	remove nuphar scripts (#2233 )	2019-10-23 13:47:26 -07:00
Jeff	ab39f7ec99	Jeffbloo/fix dml rnn failures (#2234 ) * Address a possible cause of incorrect DML kernel registrations and re-enable tests * Re-enable DML build	2019-10-23 13:46:16 -07:00
Sreekanth Yalachigere	041a1cab54	update libmkldnn.so version (#2220 )	2019-10-22 07:23:41 -07:00
Pranav Sharma	6ac3e31047	Avoid printing a misleading message exposed by a previous commit + register float type for Slice cuda op. (#2219 )	2019-10-22 07:23:05 -07:00
Pranav Sharma	04b8a4bb34	Log node placement info in verbose logging mode + turn on verbose mode in onnx_test_runner -vv switch. (#2221 )	2019-10-22 07:22:18 -07:00
Pranav Sharma	384c686f40	Update README and other files with the correct cuda version used for 1.0 release. (#2222 )	2019-10-22 07:21:48 -07:00
Negin Raoof	ff2c4a4d7e	Added more input data types for pad (#2179 ) * added more input data types for pad * replacing the comments * replacing the comments * added first set of tests * added tests * added more tests * keep NGRAPH test * avoid type cast * avoid type conversion for value float to T * fixed tabs * Update tests exclusion list * Nits * comments fix * Format files * Nit updates * rebased	2019-10-22 00:41:35 -07:00
Ashwini Khade	b23cc04f34	Handle the case when scales is constant but size is 0 (#2218 )	2019-10-21 23:28:43 -07:00
edgchen1	856c6cae0a	Edgchen1/endian utils (#2181 )	2019-10-21 22:28:35 -07:00
Scott McKay	3507197599	The Conv+BN fusion is missing a check that the BN node does not have optional outputs. If it does, it cannot be fused with the Conv node. (#2216 ) Add the necessary checks. Trim the output defs in case there are any optional outputs (already checked they don't exist so known to be unused) before finalizing fusion as we copy those to the Conv node to maintain the output names. Add unit tests for both cases.	2019-10-21 21:32:23 -07:00
manashgoswami	b344670351	Updated Perf Test readme (#2212 )	2019-10-21 21:28:29 -07:00
Hariharan Seshadri	5eb42f4452	Add uint8 support for BitShift operator (#2214 ) * Add uint8 support for BitShift operator * Remove more tests from exclusion * Updates	2019-10-21 21:10:29 -07:00
Scott McKay	91122a2cf5	Fix GELU fusion (#2213 ) * Split graph_utils methods for finalization of fusion in order to support more than 2 nodes being fused into one. Update GELU fusion to use graph_utils to set up the input/output edges for the fused node, and removing nodes that are being replaced.	2019-10-21 20:18:45 -07:00
Changming Sun	aef055ebe8	Update nuget pipeline to use CentOS6 (#2211 )	2019-10-21 17:55:36 -07:00
Faith Xu	303a78c301	Update Python documentation (#2210 )	2019-10-21 16:56:31 -07:00
Hariharan Seshadri	e8ec52814f	Remove some node tests from exclusion (#2168 )	2019-10-21 16:22:41 -07:00
Hector Li	cdcca43aa3	Skip GemmNoTrans_f16 test for CUDA if the hardware does not support fp16 Skip GemmNoTrans_f16 test for CUDA if the hardware does not support fp16 Motivation and Context Unblock the multi_gpu build pipeline. The build agent uses Nvidia K80 GPU which doesn't have fp16 support.	2019-10-21 13:15:20 -07:00
Ashwini Khade	d197079473	quantization script updates (#2208 )	2019-10-21 12:25:52 -07:00
Scott McKay	b6b44c90ac	Fix MaxUnpool behaviour when output_shape is provided and doesn't match the inferred shape (#2193 ) * MaxUnpool should reconstitute what was pooled by MaxPool. The kernel_shape, pads and strides inputs are purely to infer an output shape, if output_shape is not explicitly provided. The unpool should not be adding new padding, so output_shape is not about auto generating pad values and inserting pads. The current ORT implementation misinterprets the usage of output_shape and inserts pads instead of just allocating an output of the specified shape, and directly unpooling into it. Update to simply find the correct output shape to use, and simply unpool into that. Update unit tests to reflect this. * Exclude maxunpool_export_with_output_shape which has invalid data in the output. * Fix test name in backend test series exclusion	2019-10-21 11:56:13 -07:00
Scott McKay	6699c19010	Add script to find if optimizer updates are required due to onnx operator updates. (#1957 ) * Add script to find calls to graph_utils::IsSupportedOptypeVersionAndDomain where the latest supported version is prior to the latest defined version.	2019-10-21 11:38:30 -07:00
Ashwini Khade	dc5efbf5ce	enable exclude outside for resize op (#2203 ) * enable exclude outside for resize mode * fix centos error * updates per review + plus more data types for resize * fix typo in error message * reset wrong fix	2019-10-21 11:18:33 -07:00
Scott McKay	3cda9f717b	Relax shape inferencing error handling if model uses an old opset (#2199 )	2019-10-21 10:51:22 -07:00
Paul McDaniel	02dc3a9dcb	build break for arm64, adding advapi32.lib (#2206 )	2019-10-21 08:48:28 -07:00
shahasad	fcf50ca081	Fix nuget mklml pipeline (#2204 ) * some fixes on nuget CPU pipeline * revert `d738c89536` * fix for MKLML package * fix if else	2019-10-21 08:46:28 -07:00
Scott McKay	5c86889beb	Fix linux build issue with debug dump of shapes and data. (#2202 ) Add option to dump just shapes or shapes and data.	2019-10-20 20:35:48 -07:00
Hariharan Seshadri	07e9f500da	Add back Keras Mask RCNN to CUDA test exclusion (#2201 )	2019-10-20 17:13:02 -07:00
Nathan	aae18a3fe3	Upgrade onehot to OpSet 11 (#2185 ) * Upgrade onehot to OpSet 11 * Move Onehot test out of blacklist * Add negative indices support besides negative axis. * PR comments - 1 * PR comments-2	2019-10-20 10:44:20 -07:00
Pranav Sharma	69970d1f2a	Include the new Privacy.md file in all release packages. (#2200 )	2019-10-20 07:58:36 -07:00
Scott McKay	cf7ee5f6e8	Support negative axis in unsqueeze elimination (#2158 ) * Handle negative axes in UnsqueezeElimination. * Address PR comments * Trigger rebuild. Some CI builds are unable to be retried	2019-10-20 01:15:27 -07:00
Konstantinos Karanasos	33c639a022	Slice elimination support for opsets 10 and 11 (#2171 ) * work on slice elimination for opset 10 * more work on slice elimination * first working version * adding python notebook for building models; fixing test * fixing build error in macOS	2019-10-20 01:14:55 -07:00
Changming Sun	cff7879d89	Update C API pipeline to use CentOS 6 (#2198 )	2019-10-19 22:25:42 -07:00
Nathan	0dd781fd57	Perf tuning doc update with latest API (#2128 ) * Update perf tuning md * Remove AppendExecutionProvider	2019-10-19 21:03:09 -07:00
stevenlix	a9f01a5f29	Fixed node index remapping issue in TensorRT graph partitioning (#2155 ) * Fixed node index mapping issue during graph partitioning * add test for node index mapping * Update BUILD.md * Update TensorRT-ExecutionProvider.md	2019-10-19 20:31:56 -07:00
Dmitri Smirnov	7b18bd563f	Commit formatting and end of lines (#2161 )	2019-10-19 19:49:40 -07:00
Hariharan Seshadri	763af5d42a	Remove Keras Mask RCNN from CUDA test exclusion list (#2196 )	2019-10-19 18:31:50 -07:00
Tianlei Wu	18b192a45b	Implement Range Cuda Kernel to improve performance (#2148 )	2019-10-19 18:28:52 -07:00
shahasad	7efc9bdcc7	Some condition fixes on nuget pipeline, to get it green (#2195 )	2019-10-19 18:28:12 -07:00
Xavier Dupré	836d22cd4c	Update readme.rst for pypi, change documentation style (#1663 )	2019-10-19 18:26:34 -07:00
Hariharan Seshadri	ac3d2ad897	Implement ConcatFromSequence (#2106 )	2019-10-19 18:26:10 -07:00
Paul McDaniel	d1159b7008	Adding platform telemetry (#2109 )	2019-10-19 18:25:57 -07:00

1 2 3 4 5 ...

1485 commits