onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-05-28 22:56:32 +00:00

Author	SHA1	Message	Date
Ke Zhang	13d8558404	move environment.h/cc from framework to session project/folder. (#1241 )	2019-06-17 18:01:21 -07:00
Ke Zhang	d448f39832	refactor a little bit to not ask every execution provider to give an implementation of getkernalregistry. It's not needed for a runtime-based exp actually. (#1231 )	2019-06-17 13:50:07 -07:00
Ryan Hill	38963d81eb	Simplify some CustomOp code (#1206 ) test_inference.cc reformatted with clang-format	2019-06-11 17:00:04 -07:00
Scott McKay	87d65389e6	Add ability to change the logging severity of the default logger. (#1165 ) Add ability to set the session and run logger severity via SessionOptions and RunOptions Inherit severity from the next logger up if logger severity isn't specified in SessionOptions or RunOptions Expose ability to set default logger severity in python bindings.	2019-06-12 08:54:03 +10:00
Ryan Hill	3c3186c761	Convert more C APIs to return OrtStatus (#1194 ) * Change SessionOptions APIs to always return a status, for consistency and ease of use (a couple returned 0 or -1 for success/failure)	2019-06-10 18:36:04 -07:00
Changming Sun	d8ac0d64d0	Make C API capable of defining CUDA custom ops (#1178 ) Recreate the PR on behalf of Rui Xia, for #779	2019-06-06 13:45:32 -07:00
Ryan Hill	b68bb51dd0	Change SessionOptions APIs to always return a status (#1171 ) * Change SessionOptions APIs to always return a status, for consistency and ease of use (a couple returned 0 or -1 for success/failure)	2019-06-06 13:24:24 -07:00
G. Ramalingam	b23ab6a06e	Implementation of sparse tensor (#1121 ) * Initial implementation of sparse tensor * minor cleanup * minor cleanup (remove empty line) * simplify template usage in test-case * address linux build error * fix constructor order to address compiler warning * Address PR comments * handle allocation in optimizer execution frame * address compiler warning message and PR feedback comment * address gcc unused warning for protobuf code * address PR comment	2019-06-06 11:50:38 -07:00
Klein Hu	1a86421aff	Create a syslog sink for logging in !Win32 env (#1163 ) * Create a syslog sink for logging in !Win32 env * Move syslog level logic to syslog_sink.c	2019-06-06 16:35:06 +10:00
Ryan Hill	148141dd5f	Change Compute function to return a status code instead of an integer. (#1139 )	2019-06-04 08:34:32 -07:00
Changming Sun	c18de6817b	Rename MLValue to OrtValue (#1154 )	2019-06-03 17:29:55 -07:00
Du Li	05110a6558	Adding custom op ConvTransposeWithDynamicPads. (#638 ) * Adding custom op ConvTransposeWithDynamicPads. * Adding custom op ConvTransposeWithDynamicPads. * adding cuda kernels * fix a bug * fix build issue. * Integrate PR comments.	2019-05-31 11:48:43 -07:00
Ryan Hill	f9f6818e4c	Add comments and organize the C++ header into the main header plus a separate one of the inline methods. (#1130 )	2019-05-30 14:24:25 -07:00
Ke Zhang	ef66395060	remove unnecessary lock and code clean up (#1114 ) * refactoring the ep codes. * remove unnecessary lock. * fix the comment to claim KernelRegistryManager is not thread safe. * clarify that APIs to add custom op in inferencesession is not thread safe.	2019-05-29 13:20:38 -07:00
Konstantinos Karanasos	ee6217972b	Fix when rewrite rule gets registered to multiple op types; update constness of rule methods; enable dropout elimination (#1098 )	2019-05-24 13:47:55 -07:00
Ryan Hill	9129a652c5	Ryanunderhill/cxx api2 (#1091 ) More C++ API improvements and cleanup Add templates to tensor creation Add run method that allows preallocated outputs Simplify CreateTensor<T> to multiply by sizeof(T) Convert io_types code Optimize away vector copies in Session::Run	2019-05-24 11:15:51 -07:00
Ryan Hill	3a32b0eb99	Change function kernels to use CustomOp APIs (#1020 ) * Change function signature * Convert compute to use custom op style APIs * Remove dead CustomOp function * Use CustomOp API in TensorRT EP * Switch to new API in ngraph	2019-05-20 14:57:43 -07:00
Changming Sun	2663b9c443	Remove unnecessary casts from OrtValue to MLValue(#1051 )	2019-05-17 07:52:59 -07:00
Ke Zhang	a7039601c4	setting input/output for partitioned function subgraph (#1019 ) * setting input/output for partitioned function subgraph * fixed TensorRT related failure * fix failures for tvm and ngraph * update	2019-05-17 11:19:28 +08:00
Changming Sun	99556b111d	Make MemPatternPlanner on/off switchable in model weight loading (#989 )	2019-05-16 14:39:09 -07:00
Changming Sun	e4225f8234	Combine OrtValue and MLValue into one type (#1043 ) * Combine OrtValue and MLValue into one type	2019-05-16 10:22:49 -07:00
Changming Sun	5062be612e	update (#1041 )	2019-05-15 16:41:41 -07:00
Changming Sun	e42099480e	Clean up code (#1033 ) Some trivial changes for making gcc compiler happy. Won't impact runtime behavior.	2019-05-15 10:00:39 -07:00
Ryan Hill	3408494407	More C++ API improvements and conversions (#998 ) * More C++ API improvements and conversions * Mark more constructors as explicit * Fix CSharp function name changes * Change more test cases to use C++ API	2019-05-13 13:56:54 -07:00
Mika Fischer	c0acb8b6c3	Allow loading model from in-memory byte-array (#718 )	2019-05-11 05:58:50 +08:00
Ryan Hill	f73ce305e9	C++ wrapper for ABI (#958 )	2019-05-03 19:32:46 -07:00
Konstantinos Karanasos	feab3088fb	Conv(Add\|Mul\|BN)Fusion as rewrite rules (#863 ) * Converted ConvAddFusion, ConvMulFusion, and ConvBNFusion to rewrite rules * Extended graph_utils::RemoveNode * Introduced RewriteRuleEffect enum	2019-05-01 13:23:29 -07:00
Scott McKay	0ad940027c	Use ConstPointerContainer for Node::ImplicitInputDefs() for better consistency with InputDefs() and OutputDefs(). (#894 )	2019-05-01 14:22:28 +10:00
Konstantinos Karanasos	1b7d1f2645	Convert constant folding to a transformer (#866 )	2019-04-29 18:12:49 -07:00
Ke Zhang	f39a8d1f59	allow users to set graph inputs and outputs fully. (#905 ) * allow users to set graph inputs and outputs fully. * update * update the comments of the APIs * update * remove commented-out codes. * fix test failures. * fix comments. * adding more check to throw not support exception right now.	2019-04-29 15:58:39 +08:00
ybrnathan	b0a37477db	Fix memory corruption issue when CPU->CUDA memcpy is involved (#879 )	2019-04-22 20:21:14 -07:00
Yufeng Li	0bf12e9dbf	Add option to enable/disable memory pattern back (#872 ) Memory pattern doesn't work for parallel executor by design. Enabling Memory Pattern for parallel executor logs warning and make the perf bad. Add option to enable/disable memory pattern back.	2019-04-22 13:49:41 -07:00
nivas-x86	a4d7052aeb	Add nGraph Execution Provider (#832 ) * Add nGraph Execution Provider * feedback changes 1 * feedback2 * Feedback and upgrade nGraph * Feedback 4 * Fix CI * Disable new ops	2019-04-20 17:02:35 -07:00
Konstantinos Karanasos	ada90086f7	More efficient rule-based transformer (#815 ) Introduce a quick pre-filtering of rules based on the node op types they are targeting. The goal is to avoid evaluating all rules for all nodes. Instead, for each node, we will only be evaluating the rules associated with its op type.	2019-04-18 17:10:13 -07:00
shschaefer	ff253631b5	Enable use of session based threadpool. (#854 ) * Enable use of session based threadpool. * Fix build dir issue	2019-04-18 10:20:46 -07:00
Ke Zhang	951c428ee1	Simplify the validation in Run call (#850 ) * Simpplify Run() * remove the lock * remove a file added wrongly. * fix tests * fix c# test	2019-04-18 08:38:17 +08:00
Ke Zhang	41dc3130f5	no need putting initializers (for constant node) into graph inputs. (#665 ) * constant node should not be put into graph inputs any more. * simplify graph input/output set logic. * refactor comments. * remove adding initializers as graph inputs when creating graph from scratch.	2019-04-17 07:38:08 +08:00
Ashwini Khade	14d63b5f45	generate transformers bug fix (#838 ) * fix graph transformer generation * add more tests * cosmetic changes * more changes per review	2019-04-16 14:10:33 -07:00
Tracy Sharpe	f19d9a4907	Reduce code size of kernel registration (#833 ) Some changes that reduce the size of the release onnxruntime.dll by 170KB: Change the ONNX_OPERATOR_KERNEL macros to not create a unique virtual class per kernel create lambda, but instead use a generic class with the raw function address supplied at BuildCreateKernelInfo time. Changed the exceution providers to use a table driven approach to calling the BuildCreateKernelInfo functions instead of a massive function with construct/call/delete sequences. The CreateFunc in data_types.h didn't need to be a std::function, eliminating more lambda virtual classes. N.B. To accommodate MSVC 14.11 toolchain (used for CUDA builds), the operator+() syntax cannot be used to retrieve the raw function address. The older toolchain can't resolve between cdecl/vectorcall and gives up. An explicit cast is needed to help the compiler along.	2019-04-15 16:39:59 -07:00
Tracy Sharpe	c55e2de593	Status class optimizations (#824 ) optimize onnxruntime::common::Status to reduce code size	2019-04-11 21:57:01 -07:00
Ryan Hill	fda1d0dce9	Ryanunderhill/ocr custom op (#744 ) * Adding a custom op interface to the C API to remove shared library dependency. * Remove old custom op test * Rework how custom ops handle inputs/outputs to enable custom op output shape calculation in the compute method * Add a nicer C++ API for custom ops and switch the tests to use it.	2019-04-05 18:53:20 -07:00
utsabsingharoy	36ed91ee9f	CustomRegistry should use composition instead of inheritence CustomRegistry should use composition instead of inheritence	2019-04-05 14:14:10 -07:00
Konstantinos Karanasos	512cfdd9fe	Generalize node removal (#743 ) Generalize node removal method in graph_utils. This is a higher-level method that keeps the graph consistent so that no Resolve is needed after the removal of a node. The new method supports the removal of nodes with a single input (be it an incoming node or an initializer) and a single output (but allowing multiple output edges of that output). It also takes into account the case that one of the output edges is fed to a subgraph. Also updated the rewrite rules to use this new, less restrictive method, and improved the rules' conditions. Introduced a GraphEdge struct to simplify various methods in graph_utils.	2019-04-03 22:34:20 -07:00
Ahmad El Husseini	e643ce0e08	Fix inconsistent dimension data type in C-API (#726 ) * update dimension type * update dimension type for items added after 0.2.1 * fix gpu build	2019-03-29 00:23:25 -07:00
Ashwini Khade	77b981824a	fix graph transformers and refactor tests (#696 ) * fix graph transformers and refactor tests * fix merge master * Set default optimization level to Level1 * fix build warnings for Linux * try root cause tensorrt test failures * try root cause tensorrt test failure * Test level2 transformers with all CI builds * remove ConvActivation fusion transformer * change default level back to level1 * remove providers from apply api * more changes	2019-03-26 20:38:12 -07:00
Konstantinos Karanasos	a872ba7894	Convert Unsqueeze elimination to rewrite rule + improvements in graph utils and graph transformer utils (#670 ) * Convert unsqueeze elimination to rewrite rule * Simplify the way we register predefined transformers and rules in the inference session (all details are now moved to the graph transformer utils) * Some reorganization and renaming of methods in graph_utils * Updates in graph transformers test * Update in edge removal to not perform unnecessary check of node args that led to race conditions when updating the graph * Improve documentation for rewrite rules * Remove top-down rule-based transformer (given we currently have only one type of rule-based transformer)	2019-03-26 13:58:15 -07:00
Changming Sun	a26696fb0e	Enable LTO on Linux	2019-03-22 15:30:37 -07:00
Ryan Hill	cd52431b8f	Custom op interface to the C API to remove shared library dependency (#668 ) * Adding a custom op interface to the C API to remove shared library dependency. * Fixup const issues * Renaming to make things a little simpler * Add a comment	2019-03-21 15:46:50 -07:00
Pranav Sharma	5d452b3029	Use protobuf-lite to reduce onnxruntime.dll size. (#639 ) * Test protobuf-lite * Test protobuf-lite * Test protobuf-lite * Optimize protobuf usage for LITE_RUNTIME to reduce the binary size of onnxruntime.dll. More details can be found here https://developers.google.com/protocol-buffers/docs/proto. The reduction is significant. For commit id: 4873b452151bafe49da332aaeab639ef0318fc1ca28d728, the size reduced by ~700K; from 4873728 to 4172800. * Add LITE_RUNTIME flag in in.proto files * Fix merge conflict. * Address PR comments * Forgot to add 2 files + fix linux and gpu build errors. * Fix build errors + test failures * Fix cuda tests * Fix tensor rt build * Use full protobuf for trt * Address PR comments * Print tensor shape proto as text string for easier debugging	2019-03-21 14:06:38 -07:00
Scott McKay	a3499083da	Add iterator traits aliases to ConstPointerContainer::ConstIterator (#634 ) * Add iterator traits aliases. * Add a few more pieces to make more compliant with the input iterator requirements.	2019-03-21 15:45:58 +10:00

1 2 3

118 commits