onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-07-11 17:48:34 +00:00

Author	SHA1	Message	Date
Dmitri Smirnov	e43e64bf84	Implement Equal for CUDA. (#1183 )	2019-06-07 11:11:50 -07:00
Xavier Dupré	d33dbb23b2	replace onnxmltools by keras-onnx in one example (#1151 )	2019-06-07 12:03:46 +02:00
Changming Sun	d8ac0d64d0	Make C API capable of defining CUDA custom ops (#1178 ) Recreate the PR on behalf of Rui Xia, for #779	2019-06-06 13:45:32 -07:00
Ryan Hill	b68bb51dd0	Change SessionOptions APIs to always return a status (#1171 ) * Change SessionOptions APIs to always return a status, for consistency and ease of use (a couple returned 0 or -1 for success/failure)	2019-06-06 13:24:24 -07:00
G. Ramalingam	b23ab6a06e	Implementation of sparse tensor (#1121 ) * Initial implementation of sparse tensor * minor cleanup * minor cleanup (remove empty line) * simplify template usage in test-case * address linux build error * fix constructor order to address compiler warning * Address PR comments * handle allocation in optimizer execution frame * address compiler warning message and PR feedback comment * address gcc unused warning for protobuf code * address PR comment	2019-06-06 11:50:38 -07:00
Dmitri Smirnov	7a80770b52	Remove filtering from the backend scripts runner (#1169 ) Remove filtering from the backend scripts runner as we only run it on the tip of onnx.	2019-06-06 10:04:55 -07:00
Klein Hu	1a86421aff	Create a syslog sink for logging in !Win32 env (#1163 ) * Create a syslog sink for logging in !Win32 env * Move syslog level logic to syslog_sink.c	2019-06-06 16:35:06 +10:00
Changming Sun	88ea58a383	Add double tensor support for Div and Sub (#1172 ) Needed by the preprocessing step of inception models. TF source code: https://github.com/tensorflow/models/blob/master/research/slim/preprocessing/inception_preprocessing.py	2019-06-05 21:10:09 -07:00
Scott McKay	c6abb17b8d	Use shared threadpool in LSTM (#1167 )	2019-06-06 07:16:31 +10:00
Ryan Hill	7cd2d9f3c4	Change Path_lib to not use Shlwapi.dll if compiled as WINAPI_PARTITION_APP (#1161 )	2019-06-05 11:05:33 -07:00
KeDengMS	7c4494a0bc	Fix CUDA thread_local to allow multiple CUDA execution providers (#1149 ) This fixes #1034: Can't Create Model Sessions on Different GPU The root cause of the bug is that CUDA execution provider uses thread_local to save per-thread-context and allocator, and when two CUDA execution providers are running on the same thread there's a conflict. The fix is to add a std::unordered_map to differentiate EPs in the same thread.	2019-06-04 16:32:27 -07:00
Hariharan Seshadri	a863c67ef8	Fix error message for legacy opset (6 and lower) models at model load time (#1147 ) * Initial commit * Resolve comments and revert changes in files that are not needed for this change * Single line addition revert * Resolve comments	2019-06-04 11:56:49 -07:00
jywu-msft	7316e54153	re-enable disabled tests on nGraph after fixing remaining subgraph resolve error (#1158 ) * fix subgraph resolve error for nGraph. * input_args need to be processed in order.	2019-06-04 09:40:24 -07:00
Ryan Hill	148141dd5f	Change Compute function to return a status code instead of an integer. (#1139 )	2019-06-04 08:34:32 -07:00
Changming Sun	c18de6817b	Rename MLValue to OrtValue (#1154 )	2019-06-03 17:29:55 -07:00
Hector Li	b8a699f70b	Update MaxPool & AveragePool to support opset 10 (#1141 ) * Update MaxPool & AveragePool to support opset 10 * fix build issue * still use cudnn for MaxPool if dilation is not set or are default 1. * fix build issue	2019-06-02 20:42:57 -07:00
Torkel	10ea77a3d1	add details aboud adding execution providers in the C api to comments and docs (i.e. need OrtSessionOptionsAppendExecutionProvider_CUDA to get CUDA)	2019-06-02 17:38:36 -07:00
Changming Sun	6c9d815de5	Revert "Remove openmp flag (#1140 )" (#1146 ) This reverts commit `a7137a0f9d`.	2019-05-31 18:48:14 -07:00
RandySheriffH	71cbd7679c	implement cuda slice opset 10 (#1137 ) * implement cuda slice opset 10 * remove needless delaration	2019-05-31 13:38:19 -07:00
Changming Sun	a7137a0f9d	Remove openmp flag (#1140 ) * Remove openmp flag	2019-05-31 12:41:08 -07:00
Du Li	05110a6558	Adding custom op ConvTransposeWithDynamicPads. (#638 ) * Adding custom op ConvTransposeWithDynamicPads. * Adding custom op ConvTransposeWithDynamicPads. * adding cuda kernels * fix a bug * fix build issue. * Integrate PR comments.	2019-05-31 11:48:43 -07:00
Klein Hu	6c408c3a75	Simplify ONNX Runtime Server CI build (#1136 )	2019-05-30 17:31:41 -07:00
Hariharan Seshadri	facdf77f84	Fix shape inference bug in GatherND contrib op (#1132 )	2019-05-30 17:05:44 -07:00
RandySheriffH	4757933afe	Exclude test by onnx version tag (#1073 ) * add version filter to failed tests * exclude test from backend * exclude shrink from opset 9 * fix compile err * exclude certain version of constant shape * enable flatten test * fix compile err * comment mvn test * disable constantofshape test in x86 * disable x86 test * get model version from imported opset * test linux x86 case * disable nonzero opset 10 * make mutex const * test filter by commit id * adjust substr offset * Limit test platform * remove change impacting TFModleInfo.h * refactoring * refactoring * test x86 pipeline with filter * add comment * restrict version extraction on non-win * restrict version extraction on non-win * add tag * exclude case from backend test * remove dup * remove dup * make script runnable * hard code adsolute path * refactor log * fix x86 compile err * fix x86 compile err * fix x86 compile err * sync with latest tensorrt * switch to regex * fix cpu pipeline err * test filter * disable nonzero from all versions	2019-05-30 16:19:06 -07:00
Ryan Hill	f9f6818e4c	Add comments and organize the C++ header into the main header plus a separate one of the inline methods. (#1130 )	2019-05-30 14:24:25 -07:00
Changming Sun	66a6f2b0e3	Add a check for onnxruntime_USE_MKLML AND onnxruntime_USE_OPENMP (#1131 ) * Add a check for onnxruntime_USE_MKLML AND onnxruntime_USE_OPENMP	2019-05-30 10:23:18 -07:00
Klein Hu	4e231ad907	Split binary/symbol and then upload ortsrv nightly build to blob storage (#1120 ) * Upload ortsrv nightly build to blob storage * Fix the binary directory * Temporarily disable binary split * Split binary in build container * Update azcopy command * Update getopts * Pass blob sas url as string * Avoid binary split on Windows * Update build_server logic in build.py	2019-05-29 17:46:07 -07:00
Konstantinos Karanasos	2072d34d15	Fixing broken links for graph transformations in High Level Design doc (#1128 )	2019-05-29 15:36:59 -07:00
Changming Sun	f51b081ec2	Revert "fixing clang build failure" (#1055 ) Not needed any more. Because we don't build the date library. And Sheil says: "It’s a little bit intrusive for callers to be forced onto cpp14 just because they are consuming onnxruntime."	2019-05-29 14:03:22 -07:00
Ke Zhang	ef66395060	remove unnecessary lock and code clean up (#1114 ) * refactoring the ep codes. * remove unnecessary lock. * fix the comment to claim KernelRegistryManager is not thread safe. * clarify that APIs to add custom op in inferencesession is not thread safe.	2019-05-29 13:20:38 -07:00
jywu-msft	eda4c5cb47	fix ngraph build (#1124 ) * disable repeat_dim_overflow test for ngraph only. * fix concat error.	2019-05-28 19:41:47 -07:00
Dmitri Smirnov	8c7e4eb3fb	Fix run_model api. (#1111 )	2019-05-28 16:36:57 -07:00
Hariharan Seshadri	e19bc2d074	Raise max request size in ORT server (#1119 )	2019-05-28 15:50:23 -07:00
KeDengMS	1ea3e8633c	CUDA opset9: Update Cast/MatMul version, add Erf (#1106 ) * CUDA opset9: Update Cast/MatMul version, add Erf * Address CR * More fixes on node placement logic * Fix typo * Update CUDA ops Gemm and BatchNormalization to be registered in opset10	2019-05-28 13:52:13 -07:00
Pranav Sharma	71560843f8	Add Boost dependency for onnxruntime server. (#1112 )	2019-05-28 11:37:49 -07:00
jignparm	2cf56639ed	Minor update to NuGet package tests -- allow model download in separate step (#1115 ) * Update docker scripts to not fetch model data * Update related files	2019-05-28 03:01:10 -07:00
Scott McKay	b9e8aac5f6	Enable some more tests (#1101 ) Restructure a little so permanently disabled tests are clearer.	2019-05-25 08:41:42 -07:00
Ryan Hill	8808efd9e3	Add zero size check before setting thread pool so that the zero default value won't cause a failure. (#1109 )	2019-05-24 23:27:36 -07:00
Scott McKay	f6df36b68b	Add rewrite rule to handle Relu + Clip (#1105 ) * Remove Relu if followed by Clip. Update Clip 'min' if necessary. Add unit test. * Rename to match behaviour a little better. * Update to match latest RewriteRule interface	2019-05-24 19:42:02 -07:00
Klein Hu	b54a292ba2	Add version and latest commit id to ONNX Runtime Server (#1078 ) * Add version and latest commit id to ORT Server * Update cmake * Change build id to build number * Use target_compile_definitions instead of add_definitions	2019-05-24 17:30:40 -07:00
daquexian	ea29a664cd	Fix android build when API<23, fix android test, update build doc and pipeline (#884 )	2019-05-24 16:26:32 -07:00
stevenlix	2d92b95729	Enable Unsqueeze elimination (#1104 )	2019-05-24 16:15:42 -07:00
Konstantinos Karanasos	ee6217972b	Fix when rewrite rule gets registered to multiple op types; update constness of rule methods; enable dropout elimination (#1098 )	2019-05-24 13:47:55 -07:00
Ryan Hill	9129a652c5	Ryanunderhill/cxx api2 (#1091 ) More C++ API improvements and cleanup Add templates to tensor creation Add run method that allows preallocated outputs Simplify CreateTensor<T> to multiply by sizeof(T) Convert io_types code Optimize away vector copies in Session::Run	2019-05-24 11:15:51 -07:00
stevenlix	723d5c782a	Improve TensorRT GetCapability to Enable More Models (#1012 ) * Improve TensorRT GetCapability Accuracy * Update onnxruntime_providers.cmake * made changes based on feedback * update unit tests for TensorRT * update onnx-tensorrt submodule to v5.0 branch * remove uncessary comments * convert int32 to int64 at inferencing output * add more data types in compute * change returns in compute * use StatusCode as return in compute	2019-05-24 10:12:55 -07:00
KeDengMS	b44a30bca7	CUDA CPU/GPU sync optimization (#1100 ) * CUDA CPU/GPU sync optimization Even though CUDA device is capable of handling certain ops, it may be better to leave them on CPU especially for dynamic shape computations starting from Shape. * Fix TensorRT test crash when fused graph may have null node in topological sort	2019-05-24 09:25:48 -07:00
KeDengMS	cd9b9e6102	Accelerate CUDA transpose (#1102 ) For NCHW <-> NHWC and 2D case	2019-05-23 21:10:50 -07:00
R. G. Esteves	f4a9ccae99	Enable nGraph Debug ci test. (#1000 ) * Enable nGraph Debug ci test. * nGraph doesn't work with stack trace. * Fix corrupt patch.	2019-05-23 19:58:35 -07:00
Scott McKay	4a8d75386b	Clarify/state expected usage of non-const references and 'auto' in coding conventions (#1096 ) * As we consistently use non-const reference for modifiable arguments that cannot be null, update the conventions to reflect that. Add a note on qualifying 'auto' to make the intent clearer and it easier to notice accidental copies. * Address PR comment by adding a statement around disabling copy/assignment/move for new classes until needed.	2019-05-23 19:28:01 -07:00
Pranav Sharma	11243253f2	Disable mvn model test when contrib ops are disabled since this model uses a contrib op (#1097 )	2019-05-23 17:01:35 -07:00

1 2 3 4 5 ...

871 commits