onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-07-16 18:31:27 +00:00

Author	SHA1	Message	Date
Changming Sun	104f3b2a59	Exclude candy from CUDA tests	2019-11-05 15:22:09 -08:00
Changming Sun	143ae98a37	Fix a bug in onnxruntime_pybind_state.cc when TENSORRT is enabled (#2326 )	2019-11-05 15:04:50 -08:00
George	8a102c6e99	apply eigen patch only for ACL.	2019-11-05 13:53:53 -08:00
Changming Sun	5ce4d4fc49	Fix a test failure when it runs on FreeBSD	2019-11-04 23:47:37 -08:00
Yufeng Li	035913d42f	Support int32_t for Reduction (#2317 )	2019-11-04 20:52:01 -08:00
manashgoswami	d5c36bfff2	Updated links in docs (#2303 ) * Update README.md * Update README.md * Update README.md	2019-11-03 09:10:56 -08:00
Faith Xu	556bae17a5	Fix versions table (#2309 ) * Update table values * Fix onnxml opset version	2019-11-03 08:58:21 -08:00
Yulong Wang	cba93f7c8d	fix Gelu CPU: remove MayInplace() declaration (#2306 )	2019-11-01 18:10:05 -07:00
Yulong Wang	204a6872d3	remove unused param 'input_count' in ConcatImpl (#2304 )	2019-11-01 15:50:11 -07:00
Tianlei Wu	a6b2c9fc09	Fix mask in EmbedLayerNormalization (#2300 )	2019-11-01 13:49:55 -07:00
KeDengMS	6e65dcf588	[NupharEP] symbolic_shape_infer improvements (#2299 ) - Improves symbolic shape inference in following ways: 1. Extend suggested merge to map to literals with --auto_merge. For example, MatMul of ['ax1', 'ax2'] x [128, 256] would now map 'ax2' to 128 2. Add --int_max option to simplify computations like Min(100000, 'dim') to be 'dim'. This helps ops like Slice to generate correct shape, i.e. start=0, end=Min(100000, dim - 2) on dim. It was previously treated as equal, since sympy cannot determine Min(100000, dim - 2) < dim. - Fix a bug in create_shared script on Windows, that AOT dll is not generated because of failure in link, when there are too many obj files - Fix a bug for Split since TOPI does not support split on symbolic dimension. - Some build warning fixes for NupharEP.	2019-11-01 11:34:52 -07:00
Tianlei Wu	bc85d43809	Dump cuda tensor data (#2243 ) * dump cuda tensor * move data_type definition * Dump cuda tensors for cuda build only. Output tensor location (if it is not in CPU or pinned) * update for cuda build * Update for code review feedback * update for CR feedback * use data transfer manager for tensor copy	2019-10-31 21:09:10 -07:00
Scott McKay	7a5de9c958	Add a python script with a number of helper actions for creating/editing/dumping onnx test runner format pb files (#2294 ) * Add a python script with a number of helper actions for creating/editing/dumping onnx test running format pb files.	2019-11-01 06:39:14 +10:00
mikecaraman	358b517d49	[v2] Add ACL (Arm Compute Library) execution provider (#2258 ) * Guard unused parameter Guard unused parameter for Linux Arm and other cases. * Add ACL (Arm Compute Library) execution provider Add a new execution provider targeting Arm architecture based on Arm Compute Library. Validated on NXP i.MX8QM CPU with ResNet50, MobileNetv2 and VGG models. All unit tests are passing. Comparative performance improvements for ResNet50v1 model obtained with onnxruntime_perf_test: A72 2xA72 A53 4xA53 ACL vs CPU 16% 9% 21% 13% Usage documentation available in ACL-ExecutionProvider. * Fix eigen unused parameter Fix eigen unused parameter error for Arm cross-compilation.	2019-10-31 12:25:36 -07:00
Yulong Wang	bf7fa091cc	NonMaxSuppression cuda implementation (#2082 )	2019-10-31 11:53:22 -07:00
Changming Sun	67755adfd8	Bug Fix: NodeArg class has a move constructor but doesn't have a move assignment operator	2019-10-31 10:29:54 -07:00
RandySheriffH	d6849bd26c	Rashuai/cuda top k (#1919 ) * implement cuda topk * implement heap * add type support * refactor interface * add support for sorting by index * add test case * use cub device radix sort * register for opset 9 and 10 * add opset 9/10 delaration * refactor code * refactor code * fix comment * fix comment * switch to scratched mem	2019-10-31 10:26:00 -07:00
Hariharan Seshadri	4bcd8bfca1	Fix CUDA Reduce ops (#2268 ) * Add some tests for Reduction ops * Exclude tensorrt for new tests * Fix bug in CUDA Reduce ops * Fix nit	2019-10-31 10:11:59 -07:00
Changming Sun	a5da5ff6f4	Remove onnxruntime_USE_EIGEN_THREADPOOL cmake option	2019-10-30 21:51:54 -07:00
KeDengMS	ff64d1f55b	Relax check for optimized model saving (#2291 ) So user may save model with layout optimization.	2019-10-30 21:48:49 -07:00
Maik Riechert	ecfbb1bb99	Add missing guards to profiling calls (#1374 ) * guard remaining profiler calls * enforce proper usage of profile class	2019-10-31 14:28:49 +10:00
George Wu	aa041026e3	update Dockerfile.openvino (#2286 ) * install miniconda before openvino installation. add networkx, defusedxml dependencies. * apt-get update * apt-get update * merge Intel changes.	2019-10-30 13:58:24 -07:00
Tomasz Dołbniak	427e627805	Support for the Expand op with constant shape inputs (#2278 ) * Disable the Expand op for non-const shape inputs * Check if an input is constant with IsConstantInitializer	2019-10-30 13:22:45 -07:00
KeDengMS	e18c9582a8	[NupharEP] performance improvements (#2283 ) * [Nuphar EP] performance improvements 1. Add new ops: Shape, Expand 2. Add support for steps in Slice 3. Simplify Gather 4. Always inline alias nodes 5. Transpose nodes with inner loop being symbolic falls back to CPU provider when vectorization is not possible 6. Add opt_inproj option to model_editor to extract MatMuls inside Scan for input projection to outside	2019-10-30 10:15:04 -07:00
zhijxu	63e9961637	fix typo	2019-10-30 09:57:56 -07:00
zhijxu	8dabe0502b	merge two RUN to avoid making docker image too larger	2019-10-30 09:57:56 -07:00
Changming Sun	7b11f05a97	Update version number	2019-10-30 08:13:09 -07:00
Changming Sun	2172a9e5ed	Fix an issue in the nuget run tests scripts	2019-10-30 08:13:09 -07:00
Changming Sun	138a7f194e	Add cleanup step	2019-10-30 08:13:09 -07:00
Scott McKay	002d9c335b	Add node and op type info to error message if there's a type or shape inferencing exception thrown by the ONNX checker (#2277 ) * Add node and op type info to error message if there's a type or shape inferencing exception thrown by the ONNX checker. * Fix line break from auto format * Remove unused param from unit test code.	2019-10-29 23:42:13 -07:00
Ashwini Khade	8d231a32f2	Remove the libc version check in C# code (#2282 )	2019-10-29 21:31:38 -07:00
Scott McKay	983a616bda	Revert to using opset 7 as the default for OpTester. Add explanation as to why that is: (#2256 )	2019-10-30 09:42:21 +10:00
Scott McKay	47f40ca204	Cleanup add/remove of initializer (#2274 ) * Cleanup add/remove of initializer * Address PR comments	2019-10-30 06:50:41 +10:00
Dmitri Smirnov	09eb8ff8b8	make sure samples run (#2276 ) Adjust instructions for imagenet. Fix compile warning. Fix input/output names. Update build/run instructions.	2019-10-29 10:07:35 -07:00
Changming Sun	d1096b524f	Fix fns candy build error on Linux	2019-10-29 02:54:25 -07:00
Nathan	b0359b5622	register where kernel (#2245 )	2019-10-28 15:59:29 -07:00
KeDengMS	5611a528f6	onnxruntime_perf_test: Add -u option to save optimized model (#2227 )	2019-10-28 12:36:31 -07:00
Ryan Hill	3ecdd985cb	MNIST Sample Fix (#2259 ) * Fix Global variable initialization order * Remove static initialization, and add error messages	2019-10-28 11:22:45 -07:00
zhijxu	0b88eff43a	add docker file to build onnxruntime with different execution providers	2019-10-28 10:03:18 -07:00
zhijxu	3926ce33f4	add --parallel to speed up compiling source code	2019-10-28 10:03:18 -07:00
zhijxu	4eda08534f	make smaller step in Dockerfile so image layers can be reused as much as possible	2019-10-28 10:03:18 -07:00
zhijxu	ce23d628a5	fix bug in cmake/onnxruntime_server.cmake	2019-10-28 10:03:18 -07:00
zhijxu	be7c24247f	register execution provider when onnxruntime server creating sessions	2019-10-28 10:03:18 -07:00
pulkittomar	1fa956fb3f	Undo integration test skip (#1917 )	2019-10-27 09:47:31 -07:00
Yuri	a2596b706b	FreeBSD compatibility patch. * Treat the 'amd64' architecture the same way as 'x86_64' * Use thr_self() instead of gettid() on FreeBSD	2019-10-26 12:44:12 -07:00
Dmitri Smirnov	88c58c19d4	Improve code readability and performance. (#2257 ) Improve code readability and performance. (#2257) Remove one time checks from loops. Move out GetType<>() calls from loop as they go through local function statics. Get rid of index calculations from input and output so we can simlpy advance ptrs and potentially do better pre-fetch. Improve code readability.	2019-10-25 16:19:59 -07:00
Changming Sun	ce14b07b1c	Fix the GPU nuget pipeline failure (#2255 )	2019-10-25 13:55:38 -07:00
Sreekanth Yalachigere	c7599d0705	subgraph parse error fix (#2254 )	2019-10-24 22:14:30 -07:00
shahasad	6a0ee7eff6	Fix model path marshalling in csharp, and re-enable the pretrained model tests (#2236 )	2019-10-24 20:39:16 -07:00
edgchen1	8be48f47dd	Added Graph::ReplaceInitializedTensor() function. (#2230 ) Added Graph::ReplaceInitializedTensor() function to support updating existing initializers.	2019-10-24 19:30:24 -07:00

1 2 3 4 5 ...

1543 commits