onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-07-01 03:45:06 +00:00

Author	SHA1	Message	Date
Ke Zhang	eb67eadbc6	add initializer for sub-graph. (#269 )	2019-01-02 13:10:33 -08:00
Raymond Yang	8f3a492682	Remove static linked cuda libs (#239 )	2019-01-02 10:39:37 -08:00
Yufeng Li	e2746513ab	Avoid to run profiling code completely if there is no need (#245 ) * add option to build essentials feature for optimization * Check if profiler enabled before calling it * Refine the profiler	2019-01-01 09:34:24 -08:00
Du Li	1e9be01a49	Fix a bug in Conv+Activation fusion (#236 ) * fix a bug * fix a bug * remove node in reverse topologic order. * replace vector with deque * fix bugs in conv+activation fusion * Integrating PR comments.	2018-12-31 16:56:11 -08:00
Du Li	4f49a4ab1b	Maxpool_With_Mask (#267 ) * Adding Op ConvMaxpool * Adding MaxpoolWithMask Op. * Skip length check for the first element * Fix build errors. * Fix build errors.	2018-12-29 20:15:02 -08:00
KeDengMS	b508835e7a	Update CUDA Gemm to opset 9 (#266 )	2018-12-29 19:57:45 -08:00
Tang, Cheng	126c1fd3df	do replacement based on node arg usage, instead of provider type (#263 )	2018-12-29 12:10:08 -08:00
Ryan Hill	6a090985fb	More C API changes (#259 ) * More API changes, remove 'Inference' from function names. Remove enum values. Make Status match other types. * Switch to bool instead of int, and remove stdbool	2018-12-28 14:53:19 -08:00
shahasad	29d03ffb08	Add end-to-end test to run on the nuget package (#252 ) * added end-to-end nuget package test * reset the changes in OnnxRuntime.CSharp.proj * revert the testdata directory path * revert inference tests proj file * added script for running end-to-end tests * fix in the runtest.bat * added error checking in runtest * fixed paths in the test project * added runtest.sh * fix protoc path * updated executable attributes for the runtest scripts * added some log to debug protoc failures * removed the protoc and duplicate test code, reuse unit-test code for end-to-end test * copy always * fix working dir paths in runtest.sh * added a build.py flag to download test data without running the c++ tests * added a script for running the test under docker * added script for docker run of the test	2018-12-27 15:22:46 -08:00
Du Li	8380e56409	Optimizing Gather op. (#262 )	2018-12-27 11:31:43 -08:00
Darío Hereñú	928acc32f1	Minor formatting proposals	2018-12-26 13:46:32 -08:00
Changming Sun	698ebf13a1	Add a missing header (#243 )	2018-12-21 19:06:09 -08:00
Changming Sun	79c623bece	Add more test models	2018-12-21 19:03:43 -08:00
Changming Sun	f3df7e5d32	nullptr	2018-12-21 18:31:22 -08:00
Sreekanth Yalachigere	b93eba17c7	mkldnn: relu, sum and batch norm (#238 ) * mkldnn sum derives cpu sum_6. removed final * MKLDNN Relu, Sum and BatchNorm * PR review changes * more review changes * relu primitive typo corrected	2018-12-21 15:25:10 -08:00
Pranav Sharma	3875511f9e	Fix inefficiencies in the mkldnn kernels. Some of these were (unfortunately) getting replicated in the new kernels. (#241 )	2018-12-21 15:12:23 -08:00
Dmitri Smirnov	7af1887b33	Introduce basic BFloat16 runtime support (#235 ) * Add basic support for BFloat16 type. * Advance onnx submodule for bfloat16 support. * Update install_deps for linux. * Address review comments.	2018-12-21 12:40:59 -08:00
Yufeng Li	4e74ffba91	Add word conv embedding custom op (#229 ) * run bw and fw sequentially for GRU if using MKLDNN * word conv embedding custom op * run bw and fw sequentially for GRU if using MKLDNN * Add word conv embedding custom op * fix build break in linux * fix macos build break * resolve the comments * refine the comments * remove unnessary comment * rename the function to calculate the length of eache word in a sequence * add license info and fix typo	2018-12-21 10:48:51 -08:00
Ryan Hill	a37887cfa1	More intuitive ordering to the API functions (#233 ) * More intuitive ordering to the API functions * Rename TCHAR_T	2018-12-20 13:47:48 -08:00
Tang, Cheng	c453b48b71	update kernel memory type interface (#225 ) * refactor the kernel memory type interface * remove useless change * fix comments in PR	2018-12-20 11:11:50 -08:00
jignparm	a43382e390	Jignparm/csharp gpu (#221 ) * Minor updates to exception message * update models folder to new location * update copy to preservenewest * reenable pretrained test * added some debugging info for build * update pretrained test, and tensor proto definition	2018-12-20 09:58:03 -08:00
ashku-ms	a19b624302	MaxUnpool Operator - CPU Implementation (#177 ) * Initial commit Maxunpool operator * fix gpu build failure * remove op test from excluded list * Change to ORT	2018-12-20 09:44:34 -08:00
jywu-msft	eb867be331	update mkldnn to 0.17.2 (#231 )	2018-12-20 07:32:49 -08:00
KeDengMS	abce6041c1	Print hex value for float compare when test failed (#228 ) This helps identify fp accuracy issues	2018-12-19 21:31:15 -08:00
Tang, Cheng	0dca080238	remove useless internal schema file (#226 ) * placeholder for internal contrib ops * remove useless internal file * fix build break	2018-12-19 18:17:20 -08:00
Weixian	1d95c93987	[optimization] avoid vector copy and reduce allocation. (#203 ) * few. * fix.	2018-12-19 18:16:39 -08:00
Dmitri Smirnov	255ee39af6	Fix memory leak by improper handling of std::string typed (#227 ) output buffer. Tensor returns a buffer to fully constructed std::strings and we should treat them as such.	2018-12-19 17:46:21 -08:00
Changming Sun	e97caa7787	change mkldnn so path (#210 )	2018-12-19 14:45:57 -08:00
ashku-ms	84231ba003	support hyperbolic ops (#223 ) * support hyperbolic fns This commit adds support for sinh and cosh. Support for hyperbolic inverses is not available in Eigen yet. * Make constructors explicit * remove tests from exclude list * Revert "remove tests from exclude list" This reverts commit 2112a30b57d5a899991de4847e948e700a44e85d. * remove test names from excluded list * remove tanh since its already implemented	2018-12-19 14:23:09 -08:00
Edward Chen	e63572c1f3	Updated ArrayFeatureExtractor op to retain old output shape behavior.	2018-12-19 14:12:55 -08:00
Tang, Cheng	94f8f2b05c	placeholder for internal contrib ops (#219 )	2018-12-19 13:54:12 -08:00
KeDengMS	b9cc134576	Make sure tensor sizes are 64-byte aligned (#222 ) This helps reduce misaligned access violation	2018-12-19 13:45:04 -08:00
Bowen Bao	4d010fb1ea	Add null check before calling node.op_->Deprecated(). (#211 )	2018-12-19 11:43:36 -08:00
Changming Sun	ac3a081ec5	Enable release build in Windows CI pipelines (#220 )	2018-12-19 11:12:43 -08:00
Jesse Benson	0248390e4d	Add support for checking for F16C support (https://en.wikipedia.org/wiki/F16C ). (#212 )	2018-12-19 09:25:25 -08:00
Scott McKay	ab350fa4c7	Re-structure the inference session initialization to (#217 ) - apply any transforms to the main graph and any subgraphs first - call Graph::Resolve() once on the main graph, which will recurse into the subgraphs - previously it was called after the transform on each subgraph, which results in it traversing up to the main graph to call resolve, and that resolve call recursing into all subgraphs every time. This avoids lots of unnecessary Graph::Resolve calls, and prevents subgraphs from being broken by SessionStateInitializer::InitializeAndSave calling graph_.CleanAllInitializedTensors() prior to final Graph::Resolve call. If a subgraph has optional inputs the backing initializers were removed by CleanAllInitializedTensors causing the next Resolve to incorrectly turn them into required inputs.	2018-12-19 18:56:35 +10:00
Scott McKay	334e329642	Increment/decrement UseCount for outputs so that we don't prematurely free a re-used output that is used for a dead output (output with zero users). (#214 )	2018-12-19 15:34:37 +10:00
Pranav Sharma	39f47f86ee	Adding the include folder for the C Windows pkg. (#198 ) * Adding the include folder for the C Windows pkg. * Add import lib to the pkg * Disable csharp pretrained tests temporarily	2018-12-18 21:04:42 -08:00
Scott McKay	beb326f00e	Simplify logic around creating relationship between nodes for implicit NodeArg usage. Allows using an initializer from multiple levels up to not fail. We would need to accumulate a list of initializers from all levels up otherwise, and doing so doesn't add any value. (#200 ) Improve a comment to clarify when the parent graph NodeArg lookup kicks in.	2018-12-19 09:25:42 +10:00
Changming Sun	dc8b37f4c4	update onnx (#209 ) * update onnx	2018-12-18 14:50:28 -08:00
Ke Zhang	37b74c771a	add gemmlowp as submodule. (#206 )	2018-12-18 13:57:53 -08:00
Changming Sun	c0ec7d56c4	Clean up garbage files (#208 )	2018-12-18 13:51:01 -08:00
jignparm	d4131a31d9	Disable csharp pretrained tests temporarily (#207 )	2018-12-18 21:35:23 +00:00
edgchen1	0aa1b54aaa	Updated build.py to support relative CMake/CTest paths and did some minor cleanup. (#205 )	2018-12-18 13:23:32 -08:00
Ryan Hill	773114a4f1	More C header naming changes (#202 ) * More Ort prefix changes for consistency * Fix C# methods * More C# fixes	2018-12-18 11:39:46 -08:00
jignparm	0287019c1d	C# Gpu : Minor updates to exception message (#201 ) * Minor updates to exception message * update models folder to new location * update copy to preservenewest	2018-12-18 00:30:27 -08:00
edgchen1	71c56b6d7c	Fix array feature extractor out of bounds access issue (#194 ) * Fixed out of bounds access in ArrayFeatureExtractor. * some cleanup * Updated tensor_shape.h comments. * Updated macro name. * Added copy assignment, move assignment/ctor to TensorShape. * Removed i64 literal suffix. * Fixed test. * Fixed type of x_num_dims.	2018-12-18 00:30:07 -08:00
Changming Sun	8c5d105557	fix typo	2018-12-17 18:33:36 -08:00
KeDengMS	b0f27ba0a7	Allow using MKLML header/libs when use_mklml is specified (#178 ) Allow using MKLML header/libs when use_mklml is specified	2018-12-17 14:41:42 -08:00
Randy	d0544a8082	Rashuai/gathernd op (#170 ) * define gather_nd op * add test cases * add test file * refactor the code and doc * add test cases * fix win compile err * fix win compile err * adjust indent * make constructor explicit * add coment * remove templates * remove wrong def * migrate macros * fix an issue in shape inference	2018-12-17 13:47:20 -08:00

1 2 3 4 5 ...

261 commits