onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-06-24 02:47:54 +00:00

Author	SHA1	Message	Date
pengwa	6c271c63ac	add test cases for commit c019bb9355a511f471e55e7302b26e1d370ed46a (#1556 )	2019-08-04 17:18:45 +08:00
jywu-msft	8a6bfe00af	roll back model test update for ngraph provider. (#1551 )	2019-08-02 15:53:32 -07:00
Yufeng Li	a098be12ba	Register kernel for Greater int64 (#1546 ) Register int64 for Greater and refactor the register code	2019-08-02 14:01:43 -07:00
Ke Zhang	cb71c69d5e	checking execution provider logic updated. (#1547 )	2019-08-02 13:29:39 -07:00
daquexian	93cb29f958	[WIP] NNAPI EP Update (#1540 )	2019-08-01 22:25:56 -07:00
Scott McKay	9fb8867a24	Don't create implicit input for outer scope value if there is a subgraph input with the same name. (#1186 ) * If there is an outer scope value that matches a subgraph input, don't create an implicit input from the outer scope value. Minor unrelated change for issue noticed while debugging: Use unordered_set for implicit inputs so we don't add them multiple times. * Add unit test based on onnx issue.	2019-08-02 07:23:41 +10:00
Ke Zhang	1cf5ebc4c5	copyfromhost/copytohost are not needed for mkldnn ep (#1532 ) * memcpy is not necessary for mkldnn ep to copy from/to host. * update	2019-08-01 13:22:15 -07:00
Hariharan Seshadri	624411bb69	Upload correct ESRP signed package (#1531 ) (#1534 )	2019-08-01 10:56:18 -07:00
Changming Sun	3045a5f88b	Update test data (#1512 ) * Update test data	2019-08-01 10:42:08 -07:00
Hariharan Seshadri	465b30e3ca	Bug fix for shape of optional output in Dropout op (#1507 ) * Bug fix for shape of optional output in Dropout op * Exclude new test from NGraph EP * Account for the fact that mask could be of different type in different opset variants of the op * Make accompanying Cuda changes * Fix build break * Exclude Opset 7 test for tensorRT EP * PR comments	2019-07-31 22:37:11 -07:00
Hector Li	57e2482089	Fix a bug in Expand cuda op implementation. (#1528 ) Description: crash if the output shape has 0 in it. because the code to / output_shape[i] Fix: If the output shape has 0 which means output_shape.Size() is 0, so output should be null.	2019-07-31 21:21:49 -07:00
Ashwini Khade	b599360014	enable sse4.1 optimizations for gemmlowp (#1529 )	2019-07-31 18:44:02 -07:00
Hariharan Seshadri	28a6f6b11b	Add back MacOS leg of the Python packaging job (#1523 ) (#1526 ) * Add MacOS leg of Python packaging job * Update copy files source directory for Mac OS leg * Add a task to display the binaries directories contents after build wheel creation * Revert some changes * Add task to log * Update * Remove unnecessary logs	2019-07-31 15:57:26 -07:00
Hariharan Seshadri	4d768b3a0f	Fix inclusion of ARM binary in the release pkg (#1513 ) (#1521 ) * Fix inclusion of ARM binary in the release pkg * Add lib and pdb as well	2019-07-31 15:57:03 -07:00
shahasad	fb5d0fc538	Publish nuget package to azure blob store (#1525 ) Publish daily build NuGet package to Azure blob store for sharing among internal partners	2019-07-31 14:17:54 -07:00
Tracy Sharpe	0b0e32909a	NCHWc: Enable Conv/Add fusion for stride=2 convolutions (#1518 ) Update the NCHWc graph transformer to allow Conv/Add fusion for convolutions where stride=2.	2019-07-31 12:30:05 -07:00
Scott McKay	14d46ee890	Init prev_Ht for zero length sequence to avoid valgrind warning. (#1516 ) Couple of performance cleanups - don't create debug label string unless dumping matrixes - use raw pointer in fill_n calls	2019-07-31 14:46:00 +10:00
Jorgen Thelin	fb7bdd177b	Profiler-IsEnabled (#1503 ) Avoid use of Hungarian naming convention for cross-platform API code. I'm taking my cue here from the "ONNX Runtime coding conventions and standard" document which say we use the "Google C++ style guide", and that says "Do not use Hungarian notation" https://github.com/microsoft/onnxruntime/blob/master/docs/Coding_Conventions_and_Standards.md https://google.github.io/styleguide/cppguide.html#Windows_Code X-ref: internal PR 4824	2019-07-30 13:32:01 -07:00
shahasad	a86486ab7f	Post binary sizes to dashboard database (#1517 ) Python script and necessary changes in the azure-pipelines yaml file to post the binary size data from NuGet package build. Currently only posted from CPU pipeline. GPU and other pipelines may be added as necessary.	2019-07-30 08:59:43 -07:00
Pranav Sharma	44ab301586	More C API changes. (#1519 ) * Mention OrtCreateSessionFromArray in C API doc * Cleanup a few inconsistencies in the C API. * updates * More updates	2019-07-29 18:35:28 -07:00
Dwayne Robinson	cf73f63cb9	Enable float16 MatMul+Add -> GEMM fusion for performance boost (#1506 )	2019-07-29 15:18:02 -07:00
Ke Zhang	cf5a4b5856	remove the GetStream from cuda ep. (#1514 ) * remove the GetStream from cuda ep. * fix comments	2019-07-29 15:01:29 -07:00
Yufeng Li	d6a30485be	Rename Tensor.Size() to Tensor.SizeInBytes() (#1502 ) Rename Tensor.Size() to Tensor.SizeInBytes()	2019-07-26 14:15:53 -07:00
Hariharan Seshadri	6f538dc861	Support missing optional attribute in Squeeze operator (#1505 ) * Make Squeeze operator support no axes attribute cases * Fix build break * Resolve PR comments and exclude tensorrt for the new tests	2019-07-26 11:16:35 -07:00
Hector Li	717e764e8e	Move Class CudnnDropout to cudnn_common.h (#1492 ) 1. Move non_max_suppression_test.cc to object_detection folder 2. Move Class CudnnDropout to cudnn_common.h so that can share it with other ops. Move the cuda memory allocation part out of CudnnDropout to avoid memory leak.	2019-07-26 10:41:13 -07:00
Emma Yu	8589be69b2	Organized build instructions (#1504 )	2019-07-26 09:12:24 -07:00
xkszltl	33ae28ccb1	Empty double quota `""` is passed to `find_package(Thread)`, causing a test command `gcc ... "" ...` failed while trying to compile a source file with empty name. (#1508 ) ``` [user@******** /]# gcc "" gcc: error: : No such file or directory gcc: fatal error: no input files compilation terminated. ```	2019-07-26 03:11:37 -07:00
xkszltl	be16b274fc	Upgrade mklml and set march with official option. (#1469 ) 1. There's formal way for setting march. 2. Upgrade to new MKLML. Besides, the mem patch can be drop for v1.0.0 since it's fixed in upstream.	2019-07-25 19:37:59 -07:00
Changming Sun	1a115ed8cb	Fix buffer overrun bug in CPU upsample op (#1501 )	2019-07-25 17:05:05 -07:00
Hariharan Seshadri	6df4bc2ebe	Update scripts to access pipeline variables correctly (#1499 ) * Update scripts to access IsReleaseBuild pipeline variable correctly * Correct access of PACKAGENAME pipeline variable * Fix Linux CUDA 10 package tests * Enable C# GPU test * Update	2019-07-25 15:30:32 -07:00
Changming Sun	e0829b2b13	Revert the last changes on tree ensemble classifier (#1498 ) Revert PR #1015 and PR #1276 . Because PR #1015 is causing test failures, but I can't revert it individually. I have to revert it together with #1276	2019-07-25 13:55:10 -07:00
Changming Sun	4ace393bea	Fix sign-compare warnings with gcc	2019-07-25 13:00:51 -07:00
Changming Sun	6625eecd09	Temp fix for a crash in fused graph (#1488 )	2019-07-25 13:00:25 -07:00
Changming Sun	a7223ed801	Fix android build (#1489 )	2019-07-25 13:00:00 -07:00
Scott McKay	f052966972	Remove special casing of "None" as a dim_param (#1482 ) * Remove special casing of "None" as a dim_param	2019-07-25 17:18:14 +10:00
Yufeng Li	a8e3ff47fd	Add no scale check for resize and upsample (#1484 ) Description: Describe your changes. Add no scale check for resize and upsample Motivation and Context Why is this change required? What problem does it solve? If it fixes an open issue, please link to the issue here.	2019-07-24 22:29:52 -07:00
shahasad	258ff06e42	Revert "publish nuget package to azure blob (#1309 )" (#1485 ) This reverts commit `1601650161`.	2019-07-24 18:07:33 -07:00
daquexian	ec3c553501	NNAPI EP Update (#1483 ) * Update DNNLibrary * Allow fp16 by default * Add nnapi build in ci * Fix nnapi ep after #1268 * Remove unused variables * Support nnapi in onnx_test_runner * Update DNNLibrary to fix tests * Update build.py for android build support, solve conflict of tools/ci_build/build.py * Support non-ARM Android build, solve conflict of tools/ci_build/build.py * Enable android test by x86_64 android emulator * Add dnnlibrary/NNAPI support in build.py * suppress the verbose adb output * Remove debug logs * Install cmake by pip * Fix undefined host_protoc_path * cmake==3.13.2 in pypi is actually 3.12.2, so install 3.13.2.post1 instead * Fix Android ARM64 build * Use android ndk r20 instead of r19c, fix conflicts in install_deps_android.sh	2019-07-24 13:20:05 -07:00
avidiyal	c0f927c57c	docker updated to support openvino R1.1 (#1475 ) * docker updated to support openvino R1.1 * Update README.md Updated Readme to downlaod openvino R1.1	2019-07-24 13:16:23 -07:00
Changming Sun	91d32c9060	Add docs for the fns candy demo (#1479 )	2019-07-24 08:38:04 -07:00
jignparm	a8e9e1878e	Reduce artifacts size (#1477 ) * Update wildcard pattern to match only relevant archives * Update TensorRT build to add CUDA VS extensions	2019-07-23 22:23:51 -07:00
Yufeng Li	bb26865758	Optimize the resize and upsample (#1426 ) Description: Describe your changes. Optimize the resize and upsample operators Motivation and Context Why is this change required? What problem does it solve? For case with input with shape [1,128, 267, 200] and scales [1, 1, 1.97, 2], Resize and upsample get 15x gain (w/o: 1020ms, w: 71ms on my local box). It should benefit other scenarios at similar level. If it fixes an open issue, please link to the issue here.	2019-07-23 21:39:54 -07:00
Pranav Sharma	4aa4ca1502	Relax shape validation checks. Log a warning instead of returning an error. (#1476 ) * Mention OrtCreateSessionFromArray in C API doc * Relax shape validation checks. Log a warning instead of returning an error.	2019-07-23 18:47:46 -07:00
Hariharan Seshadri	c5f2f0f15b	Upgrade version number for ORT in preparation for release (#1468 ) * Update version number to 0.5.0 in preparation for release * Update to README.md to direct to Versioning doc * Resolve PR comment * Remove incorrect line generation * Minor updates to update version script * Minor comment update	2019-07-23 16:33:06 -07:00
shahasad	1601650161	publish nuget package to azure blob (#1309 )	2019-07-23 11:07:35 -07:00
Changming Sun	be02214a17	Add a comment to onnxruntime_cxx_inline.h (#1466 )	2019-07-23 08:45:37 -07:00
jignparm	b41f6eef52	Jignparm/copy cuda extensions (#1462 ) * Add CUDA extensions for v 10.0 * Add CUDA extensions for v 10.0 * update path * change 'vsts' to 'github'	2019-07-22 23:27:48 -07:00
Pranav Sharma	818c023535	Add/correct missing SAL annotations + avoid using unsigned types (except where counts are involved). (#1451 ) * Add/correct missing SAL annotations + other cosmetic changes. * Add Outptr * Don't use unsigned types	2019-07-22 23:25:53 -07:00
Scott McKay	387d4c72bb	Strip invalid dim_param and dim_value values out. Allow re-use in event of shape mismatch if buffer is large enough (#1439 ) * Remove invalid dim_param and dim_value values when creating a NodeArg. * Allow re-use of a large enough buffer if there's a shape mismatch. * Update handling in python to treat unset dimension the same as a dim_param (equivalent to None). * Fix GetTensorShapeFromTensorShapeProto to handle neither dim_param and dim_value being set.	2019-07-23 14:55:54 +10:00
Jesse Benson	bbe92035c6	OStreamSink produces interleaved output due to issuing multiple operator<< calls to the underlying stream. (#1465 )	2019-07-23 14:54:24 +10:00

... 217 218 219 220 221 ...

11997 commits