onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-07-18 18:52:16 +00:00

Author	SHA1	Message	Date
Hariharan Seshadri	07e9f500da	Add back Keras Mask RCNN to CUDA test exclusion (#2201 )	2019-10-20 17:13:02 -07:00
Nathan	aae18a3fe3	Upgrade onehot to OpSet 11 (#2185 ) * Upgrade onehot to OpSet 11 * Move Onehot test out of blacklist * Add negative indices support besides negative axis. * PR comments - 1 * PR comments-2	2019-10-20 10:44:20 -07:00
Pranav Sharma	69970d1f2a	Include the new Privacy.md file in all release packages. (#2200 )	2019-10-20 07:58:36 -07:00
Scott McKay	cf7ee5f6e8	Support negative axis in unsqueeze elimination (#2158 ) * Handle negative axes in UnsqueezeElimination. * Address PR comments * Trigger rebuild. Some CI builds are unable to be retried	2019-10-20 01:15:27 -07:00
Konstantinos Karanasos	33c639a022	Slice elimination support for opsets 10 and 11 (#2171 ) * work on slice elimination for opset 10 * more work on slice elimination * first working version * adding python notebook for building models; fixing test * fixing build error in macOS	2019-10-20 01:14:55 -07:00
Changming Sun	cff7879d89	Update C API pipeline to use CentOS 6 (#2198 )	2019-10-19 22:25:42 -07:00
Nathan	0dd781fd57	Perf tuning doc update with latest API (#2128 ) * Update perf tuning md * Remove AppendExecutionProvider	2019-10-19 21:03:09 -07:00
stevenlix	a9f01a5f29	Fixed node index remapping issue in TensorRT graph partitioning (#2155 ) * Fixed node index mapping issue during graph partitioning * add test for node index mapping * Update BUILD.md * Update TensorRT-ExecutionProvider.md	2019-10-19 20:31:56 -07:00
Dmitri Smirnov	7b18bd563f	Commit formatting and end of lines (#2161 )	2019-10-19 19:49:40 -07:00
Hariharan Seshadri	763af5d42a	Remove Keras Mask RCNN from CUDA test exclusion list (#2196 )	2019-10-19 18:31:50 -07:00
Tianlei Wu	18b192a45b	Implement Range Cuda Kernel to improve performance (#2148 )	2019-10-19 18:28:52 -07:00
shahasad	7efc9bdcc7	Some condition fixes on nuget pipeline, to get it green (#2195 )	2019-10-19 18:28:12 -07:00
Xavier Dupré	836d22cd4c	Update readme.rst for pypi, change documentation style (#1663 )	2019-10-19 18:26:34 -07:00
Hariharan Seshadri	ac3d2ad897	Implement ConcatFromSequence (#2106 )	2019-10-19 18:26:10 -07:00
Paul McDaniel	d1159b7008	Adding platform telemetry (#2109 )	2019-10-19 18:25:57 -07:00
George Wu	b1096424f0	mkldnn cleanup. add EP_FAIL paths for unsupported mkldnn nodes. (#2182 ) * mkldnn cleanup. add EP_FAIL paths for unsupported mkldnn modes. * disable flaky dml test.	2019-10-19 17:06:16 -07:00
Dmitri Smirnov	acec4b446f	Make CentOS 6 CUDA build and run (#2159 ) * Add manylinux1 source code changes * Disable a python test	2019-10-19 15:33:31 -07:00
Pranav Sharma	96b33f4597	Add support for double to SplitToSequence op to allow test_sequence_model7 to pass. (#2173 )	2019-10-19 14:11:19 -07:00
Pranav Sharma	f8c30b8aa9	Disable DML builds for now until further investigation since the tests are very flaky. (#2194 )	2019-10-19 12:13:25 -07:00
Ashwini Khade	6d4682581e	resize opset11 (#2088 )	2019-10-19 08:10:37 -07:00
Changming Sun	021073b5e5	Update python packaging pipelines (#2167 )	2019-10-19 07:42:54 -07:00
Pranav Sharma	d15021f42b	Disable some tests for DML and Android EP pending investigation. (#2187 ) * Disable some tests for DML and Android EP pending investigation. * Add lstm_with_peephole as well * Disable gru_defaults as well	2019-10-19 00:07:51 -07:00
shahasad	35dae992f1	Fix nuget gpu ci test error (#2164 ) * fix nuget version extraction script for Gpu packages * fix cuda version in gpu end-to-end test	2019-10-18 23:01:26 -07:00
Ashwini Khade	ecf5ae8b76	Askhade/disable csharptests (#2172 ) Disable flaky c# test For agility	2019-10-18 11:00:50 -07:00
Patrice Vignola	62281ff3b9	Add empty tensor support to the DML EP (#2163 )	2019-10-18 10:17:57 -07:00
Ashwini Khade	5eb4e81f80	move some optimizers to level1 (#1566 ) * move some optimizers to level1 * move matmul add fusion to level 1 * bug fix in the test code * fix make_uniques + add test exceptions * add exception for tests in c# too	2019-10-18 09:29:31 -07:00
Zhang Lei	47bc351265	Fix wrong usage of GetScratchBuffer, like: (#2101 ) auto p = GetScratchBuffer<T>(size).get(); which cause cuda memory freed immediately, but p will be used later. This also cause random error.	2019-10-18 08:15:38 -07:00
Ashwini Khade	fc3c168402	Graph Optimizations Doc (#2050 ) * Initial draft * updates per review * fix link * plus one more link fix * small changes to the optimizer documentation * some more changes * done * update C_API with doc link	2019-10-18 08:03:40 -07:00
Changming Sun	13f8b49d58	Fix kernel registry bug (#2137 )	2019-10-17 23:10:54 -07:00
Klein Hu	2bf1778a5c	Fix ORTSRV docker build (#2160 ) * Fix ORTSRV docker build * More Dockerfile changes	2019-10-17 21:10:21 -07:00
Changming Sun	00e2d1c604	update (#2140 )	2019-10-17 19:28:10 -07:00
Clarke Rahrig	0e6ac2961e	Adding a line beak to BUILD.md (#2156 ) Adding a line break between the DirectML and NUPHAR sections of build guidance.	2019-10-17 15:18:06 -07:00
daquexian	f6c82358e9	Use built-in wstring_convert on Android (#2095 )	2019-10-17 15:09:56 -07:00
Faith Xu	86af54ded8	Add roadmap file (#2127 ) * Add roadmap file * Minor updates * fixes based on feedback * Add IOT section	2019-10-17 13:03:25 -07:00
Tomasz Dołbniak	72110d3508	Patch for the MKLDNN v1 segfaults (#2145 )	2019-10-17 12:10:00 -07:00
Pranav Sharma	4006e29601	Implement SplitToSequence op. (#2131 ) * Implement SplitToSequence op. * Fix bug in uneven splits, add tests for axis > 0.	2019-10-17 12:02:22 -07:00
Pranav Sharma	70e7eaf1e8	Update DML transformers with the new Graph API and re-enable DML in the GPU CI build. (#2147 )	2019-10-17 11:46:14 -07:00
Sreekanth Yalachigere	95fef56dc8	MKL-DNN EP (#2149 ) * make_unique, make_shared * make_unique, make_shared	2019-10-16 22:49:45 -07:00
Pranav Sharma	6445e7182c	Disable DML build temporarily until we fix the removal of the IsNodeOutputsInGraphOutputs Graph API. (#2144 )	2019-10-16 15:04:40 -07:00
Tianlei Wu	b0f8ec7a7d	Script for converting BERT model for performance optimization (#2037 ) * script for converting ONNX model for BERT performance optimization * Remove code that not needed anymore. * refine the script * Support BERT model exported from PyTorch 1.3 Keep opset version Exact match in Attention, Layer normalziation fusions. * read batch_size from input model directly	2019-10-15 16:50:08 -07:00
Faith Xu	ec136ac60f	Documentation Refresh (#1990 ) Various documentation updates, primarily for EP and main readme page	2019-10-15 15:58:02 -07:00
Scott McKay	3fcb4ee7d4	Refine optimizers (#1407 ) * Refine optimizers * Address PR comments * Changes from PR comments and discussion. * Fixed signed/unsigned mismatch * Address PR comments * Address PR comments * Fix linux build * Fix issue with mkldnn logic. * Turn off optimizers by default for operator unit tests. * Handle edge case of graph with no nodes in partitioner so all execution providers don't need to. * Comment out change to turn off optimizers for unit tests. Add details on what needs to be done to re-enable.	2019-10-15 14:49:59 -07:00
Sreekanth Yalachigere	485c24b62d	MKL-DNN 1.0 (#2134 ) * MKL-DNN 1.0 * changed libmkldnn version to 1	2019-10-15 12:06:34 -07:00
Hariharan Seshadri	6857bb8aba	Fix bug in GatherElements (#2130 ) * Fix bug in GatherElements * Uncomment some tests * Updates * Nits * Nits * Nits	2019-10-15 07:54:42 -07:00
shahasad	7ef02f14d2	Add missing test model file for symbolic dimensions (#2123 )	2019-10-15 06:55:51 -07:00
Adrian Tsai	4090d0d0de	Add DirectML Execution Provider (#2057 ) This change adds a new execution provider powered by [DirectML](https://aka.ms/DirectML). DirectML is a high-performance, hardware-accelerated DirectX 12 library for machine learning on Windows. DirectML provides GPU acceleration for common machine learning tasks across a broad range of supported hardware and drivers. The DirectML execution provider is capable of greatly improving evaluation time of models using commodity GPU hardware, without sacrificing broad hardware support or requiring vendor-specific extensions to be installed. Note that the DML EP code was moved verbatim from the existing WindowsAI project, which is why it doesn't yet conform to the onnxruntime coding style. This is something that can be fixed later; we would like to keep formatting/whitespace changes to a minimum for the time being to make it easier to port fixes from WindowsAI to ORT during this transition. Summary of changes: * Initial commit of DML EP files under onnxruntime/core/providers/dml * Add cmake entries for building the DML EP and for pulling down the DirectML redist using nuget * Add a submodule dependency on the Windows Implementation Library (WIL) * Add docs under docs/execution_providers/DirectML-ExecutionProvider.md * Add support for DML EP to provider tests and perf tests * Add support for DML EP to fns_candy_style_transfer sample * Add entries to the C ABI for instantiating the DML EP	2019-10-15 06:13:07 -07:00
KeDengMS	b101f1bcee	Nuphar: Fix a bug in weight layout where read may go out of bound (#2129 )	2019-10-15 00:11:41 -07:00
Yang Chen	5c2803f2d5	various fixes for shape inference script (#2124 ) * use dilations for computing effective kernel shape for conv/pool ops * when auto_pad is 'VALID', total_pads should be empty * added support for ArrayFeatureExtractor and ZipMap * check out_shape only if the output has shape, i.e. output is of TensorType or SparseTensorType	2019-10-14 19:44:29 -07:00
Hariharan Seshadri	95ab5ad39f	Support non-spatial mode in BatchNormalization (#2092 ) * Initial commit * Update * Update * Fix build break * Update * More changes * Update type * Exclude Nuphar for non-spatial tests * Update * Resolve PR comments	2019-10-14 18:14:14 -07:00
Yufeng Li	2536553136	use cublasHgemm for Volta GPU (#2074 ) * use cublasHgemm for Volta GPU	2019-10-14 17:29:13 -07:00

1 2 3 4 5 ...

1450 commits