onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-07-04 04:07:22 +00:00

Author	SHA1	Message	Date
Dmitri Smirnov	d1b1cdc5c4	Replace GSL with GSL-LITE submodule and fix up refs (#1920 ) Remove gsl subodule and replace with a local copy of gsl-lite Refactor for onnxruntime::make_unique gsl::span size and index are now size_t Remove lambda auto argument type detection. Remove constexpr from fail_fast in gsl due to Linux not being happy. Comment out std::stream support due to MacOS std lib broken. Move make_unique into include/core/common so it is accessible for server builds. Relax requirements for onnxruntime/test/providers/cpu/ml/write_scores_test.cc due to x86 build. Add ONNXRUNTIME_ROOT to Server Lib includes so gsl is recognized	2019-10-01 12:43:29 -07:00
Scott McKay	bd2d6af9ca	Filter out info from non-const initializers during shape inferencing (#1806 ) * Don't return shape for non-const initializer in InferenceContextImpl::getInputType Don't return initializer for non-const initializer in InferenceContextImpl::getInputData Update graph_utils to support these scenarios - fix GetConstantInitializer to make sure a name is for an outer scope value before checking a parent graph, as local name could shadow an outer scope initializer.	2019-09-26 13:44:33 +10:00
jywu-msft	686bd36210	Remove ml_status.h, add StatusCode to pybind exception mappings (#1889 ) * initial checkin. * add onnxruntime status code to ort pybind exception mapping. * address review feedback.	2019-09-24 11:13:14 -07:00
Pranav Sharma	1a3ded6a7b	Add C API for free dim override, fix missing API mention in InferenceTest.cs, fix confusing print statement in perf_test. (#1884 ) * Mention OrtCreateSessionFromArray in C API doc * Add C API for free dim override * Add C API for free dim override, fix missing API mention in InferenceTest.cs, fix confusing print statement in perf_test. * Remaining C#files * fix c# build * Run the tests in blame mode. This option is helpful in isolating a problematic test causing the test host to crash. * fix order	2019-09-23 17:58:20 -07:00
Ryan Hill	5781222456	Ryanunderhill/api interface (#1855 ) * Convert ABI to a versioned interface. * Convert ORT_THROW_ON_ERROR to inline function to fix link errors.	2019-09-20 13:39:11 -07:00
Adrian Tsai	a7beed798e	Implement L1 graph transformer for free dimension override (#1825 ) * Implement FreeDimensionOverrideTransformer * Add test * Fix compiler warnings * Update comment * LOGS_DEFAULT * Merge from master	2019-09-20 10:52:14 -07:00
Dmitri Smirnov	6a9ae65f41	Expose GetOverridableInitializers via Python and C/C++ API (#1878 ) Implement GetOverridableInitializers() Add unit test for initializer override. Expose in Python and C/C++ API	2019-09-19 15:43:28 -07:00
Pranav Sharma	a9ce941579	Refine threading control options and move inter op thread pool to session state. (#1841 ) Description: Refine threading control options and move inter op thread pool to session state. Added thread_utils.h/cc to centralize the decision around the thread pool size under various conditions. Motivation and Context Currently the thread pool size of the parallel executor is hardcoded to 32 for some reason. This PR makes the options to configure the thread pool sizes clearer.	2019-09-18 22:36:23 -07:00
KeDengMS	80bda77203	Fix symbolic shape inference for faster_rcnn, mask_rcnn, yolov3 (#1867 ) * Fix symbolic shape inference for faster_rcnn, mask_rcnn, yolov3 Force merge when --auto_merge, on symbolic dims which sympy cannot simplify Add symbolic inference for Resize opset 10 Add support for step != 1 in Slice Add support for computed dim in TopK Bug fixes in passing symbolic dims from subgraph Fix an outdate comment in Nuphar provider header	2019-09-18 14:18:32 -07:00
Bowen Bao	8712a523a4	Bump onnx to latest (#1756 ) * Bump onnx to latest Update onnx.in.proto with changes for SparseTensor. * add temp skip tests * remove passed tests from skip list * skip more tests for new ops in opset 11 * skip crashing tests * update handling of new attribute types sparse tensor and sparse tensors * advance onnx commit and remove skip cpu_flaky_tests * temporarily skip yolo3 model test due to resize opset10 shape inference regression * update proto for onnxruntime server * advance onnx commit further	2019-09-12 11:46:49 -07:00
Pranav Sharma	f8c3442880	Part 2 of renaming AllocatorInfo to MemoryInfo. (#1804 ) * Mention OrtCreateSessionFromArray in C API doc * Part 2 of renaming AllocatorInfo to MemoryInfo. * pr comments * fix comment	2019-09-12 08:19:29 -07:00
Dmitri Smirnov	fe8915863c	Implement C API entry points for creating and fetching non-standard types to OrtValue (#1714 ) C/C++ Opage APIs Add new virtual interfaces for NonTensorType Implement entry points. Add shared header for the data container. Add export symbols. Add serialization/deserialization. Implement model with Opaque types. Rework opqaue_api_test as a standalone executable.	2019-09-11 14:52:47 -07:00
Scott McKay	3b7f047a49	General performance testing tooling improvements (#1577 ) * Miscellaneous updates to help with perf testing	2019-09-11 19:46:59 +10:00
Pranav Sharma	f9d85d654a	Add GetDataTransfer() interface in the EP. (#1773 ) * Mention OrtCreateSessionFromArray in C API doc * Add GetDataTransfer() interface in the EP. * Check return status of RegisterDataTransfer * Address PR comments	2019-09-10 14:07:17 -07:00
Scott McKay	98dbdb1e0b	Rework the feed/fetch copy setup so that it can be calculated prior to subgraph execution (#1761 ) * Rework the feed/fetch copy setup so that it can be calculated upfront by the control flow nodes. Also simplifies how it all works. Update the control flow nodes to do the calculation prior to graph execution.	2019-09-10 15:46:00 +10:00
Scott McKay	2e242a4089	Clarify naming of the API involving the RunOptions terminate flag. (#1768 ) * Clarify naming of the RunOptions terminate flag. * Update C# code to use new names.	2019-09-10 08:32:33 +10:00
Ashwini Khade	b2a2326a45	add dequantize and quantize back to contrib ops (#1712 )	2019-09-06 08:55:42 -07:00
Pranav Sharma	52fe574fed	Rename OrtAllocatorInfo to OrtMemoryInfo to make it more obvious. (#1758 ) * Mention OrtCreateSessionFromArray in C API doc * Rename OrtAllocatorInfo to OrtMemoryInfo to avoid confusion	2019-09-05 14:20:37 -07:00
KeDengMS	c9240f4e93	Implementation of Nuphar execution provider (#881 ) * Implement Nuphar execution provider Nuphar execution provider is a TVM-based compilation provider. It has shown great speedups for RNN models using Scan. This PR is mainly for a preview of the shared codegen library for other TVM-based providers. * Fix submodules * Fix TVM submodule * Update Nuphar to latest and resolve confliction * Remove stale files caused by merge -X theirs * Revert heap buffer change to not introduce onnxruntime_framework into onnxruntime_perf_test * Fix bad merge * Merge from Nuphar * Fix warning treated as error, revert some unnecessary changes * Revert some more test changes * Some more test revert or comments to make review easier New tests could be added later * One more revert of unnecessary changes * More change revert. Test could be added back later.	2019-09-01 23:01:47 -07:00
Changming Sun	81ad48080b	Remove TaskThreadPool (#1713 )	2019-08-28 18:00:10 -07:00
Pranav Sharma	4035fe842e	Don't create the default allocator every single time. Rename API accordingly. Expose Session/Run log severity levels. (#1615 ) * Mention OrtCreateSessionFromArray in C API doc * Don't create the default allocator every single time. Rename API accordingly. * Don't create the default allocator every single time. Rename API accordingly. * updates... * updates... * PR comments * fix typo in license header * fix build	2019-08-23 10:33:20 -07:00
Changming Sun	224dde7ef1	Allow user disable multiple threading (#1647 )	2019-08-19 18:12:39 -07:00
Changming Sun	6b89c7ad04	Let mlas use session thread pool (#1609 ) 1.Let mlas use session thread pool 2.Remove onnxruntime_USE_MLAS cmake option 3. Remove the win32 thread pool code inside mlas mlas will: 1.use ort thread pool if it get passed in 2.use openmp if the threadpool parameter is nullptr 3.run single threaded if the threadpool parameter is nullptr and openmp is disabled.	2019-08-16 13:21:15 -07:00
Dmitri Smirnov	17c8fe44e3	Integrate featurizers (#1573 ) Added Sample Featurizer and Infrastructure Make featurizers and unit tests compile and run with GTest. Create definitions for the first featurizer kernel. Add new operator domain. Create datetime_transformer kernel and build. Move OPAQUE types definitions for featurizers kerneles out to a separate cc. Register them with the type system. Provide unit tests for new AutoML DateTimeTransformer kernel. Make necessary adjustments to the test infrastructure to make it run with new types.	2019-08-15 13:59:59 -07:00
shahasad	0c5d2c998b	Generate documentation from the registered operator kernels (#1395 ) - Added python script for generating markdown doc from the registered opkernels. - Made some conditional changes in the pybind to expose necessary python API - Added some missing type-constraints in the op kernel registrations	2019-08-14 18:12:24 -07:00
Pranav Sharma	8d12ce45cf	Use a friendly enum for graph optimization level. (#1586 ) * Mention OrtCreateSessionFromArray in C API doc * review changes * use enum for graph optimization level * Use explicit values for enums * updates... * Add friendly enum for graph optimization levels in C, C# and Python APIs. * Fix linux build * Fix build breakage due to master merge * PR comments	2019-08-14 17:12:08 -07:00
Ke Zhang	bd64ca3019	Kezhan/execute graph refactoring (#1553 ) * checking execution provider logic updated. * fix the logic of copy input and output. * update * update * update * update * update * update * fix ngraph failure. * fix comments	2019-08-14 01:07:05 -07:00
pulkittomar	a50a63aa9e	Serialize optimized onnx model (#1470 ) * Model serialization * Removed duplicate symbol * Minor update * Review comments * add tests * Model serialization * Removed duplicate symbol * Minor update * Merged PR 1106437: Model Serialization in onnxruntime * Review comments * Merged PR 1107226: Review comments Review comments * add tests * Fixed merge conflict * Correct python tests * InferenceSesssion Refeed Test * Replace use of widechar const literal-L * Fixed failing tests * Updated comment * Removed unnecessary session options * Spell check on comments * Do not serialize when level 3 optimization specified * Updated error logs * Changed log severity to WARN	2019-08-12 18:43:40 -07:00
Pranav Sharma	a6a4c4c079	Fix perf test executable. (#1598 ) * Mention OrtCreateSessionFromArray in C API doc * Fix perf test executable due to removal of certain C APIs * fix linux build * Avoid duplication * Fix mem leak	2019-08-12 09:49:29 -07:00
stevenlix	1c5b15c2b8	Remove memory copy between TensorRT and CUDA (#1561 ) * remove memory copy between CUDA and TRT * add info to RegisterExecutionProvider input * use new IDeviceAllocator for trt allocator * remove SetDefaultInputsMemoryType from TRT EP * remove onnx-tensorrt 5.0 * add submodule onnx-tensorrt branch 5.1 * remove redundancy * Update transformer_memcpy.cc * Update tensorrt_execution_provider.cc * switch to TensorRT 5.1.5.0 * update python binding * disable failed test case on TensorRT * Update activation_op_test.cc * upgrade to TensorRT container 19.06 * update according to feedback * add comments * remove tensorrt allocator and use cuda(gpu) allocator * update onnx-tensorrt submodule * change ci build cuda directory name	2019-08-08 19:31:39 -07:00
Scott McKay	6e430c0526	A few performance improvements coming out of ssd_mobilenet and ssd_resnet34 analysis (#1578 ) * A few performance improvements: - Make the iteration in NonZero more efficient by using a raw pointer and simplifying the increment logic - add another unit test to check the new logic works with 3 dimensional tensor - gains about 2% for ssd_mobilenet - Avoid floating point operations on each iteration on Concat - about 0.5% for ssd_mobilenet and ssd_resnet34 - Put common case first in ExecutionFrame::AllocateAsPerAllocationPlan to avoid unnecessary call to IsSparseTensor - about 0.05% for ssd_mobilenet - Minor tweak to put some ctors in the TensorShape header so they can be inlined more easily	2019-08-08 07:20:00 +10:00
Pranav Sharma	a443b013dd	Remove unneeded C APIs + some refactoring. (#1555 ) * Mention OrtCreateSessionFromArray in C API doc * c api changes after review (1) * updates... * fixes * Reorder include	2019-08-07 11:05:29 -07:00
Scott McKay	9fb8867a24	Don't create implicit input for outer scope value if there is a subgraph input with the same name. (#1186 ) * If there is an outer scope value that matches a subgraph input, don't create an implicit input from the outer scope value. Minor unrelated change for issue noticed while debugging: Use unordered_set for implicit inputs so we don't add them multiple times. * Add unit test based on onnx issue.	2019-08-02 07:23:41 +10:00
Pranav Sharma	44ab301586	More C API changes. (#1519 ) * Mention OrtCreateSessionFromArray in C API doc * Cleanup a few inconsistencies in the C API. * updates * More updates	2019-07-29 18:35:28 -07:00
Yufeng Li	d6a30485be	Rename Tensor.Size() to Tensor.SizeInBytes() (#1502 ) Rename Tensor.Size() to Tensor.SizeInBytes()	2019-07-26 14:15:53 -07:00
Changming Sun	be02214a17	Add a comment to onnxruntime_cxx_inline.h (#1466 )	2019-07-23 08:45:37 -07:00
Pranav Sharma	818c023535	Add/correct missing SAL annotations + avoid using unsigned types (except where counts are involved). (#1451 ) * Add/correct missing SAL annotations + other cosmetic changes. * Add Outptr * Don't use unsigned types	2019-07-22 23:25:53 -07:00
shahasad	768ced703c	Expose provider factory C API, especially for CUDA users (#1461 ) Exposed provider factory C API, for cpu and cuda providers, into the published packages.	2019-07-22 19:03:06 -07:00
Changming Sun	df3a157dd1	Add noexcept to cxx api (#1448 )	2019-07-20 08:33:04 -07:00
Ryan Hill	9e2fa69785	Ryanunderhill/c api string arg (#1436 ) * Add string attribute interface for C API. * Add string attribute interface for C++ API accordingly. * Update comment to say that string is also valid	2019-07-19 19:53:37 -07:00
Scott McKay	07a2466d9f	Use INFO instead of WARNING for an unused graph input. (#1235 ) * Use INFO instead of WARNING for an unused graph input. * Drop severity of unused initializer as well * Update to output a warning level message if removing an initializer that is never used, and an info level message if removing an initializer that optimization has made redundant.	2019-07-15 20:29:30 +10:00
Scott McKay	61b733ce6d	Update optimizers to be able to utilize a constant initializer from an ancestor graph (#1346 ) * Now that we check for a constant initializer in an ancestor graph we also need to be able to retrieve and replace that initializer. Add helpers to do so. Update optimizers to use the new helpers. Fix bug in UnsqueezeElimination where it wasn't checking if the initializer it was replacing was constant.	2019-07-15 12:41:01 +10:00
Ke Zhang	3bf0e364e2	Move CopyTensor out of IExecutionProvider interface. (#1268 ) * add ortdevice class * add data transfer manager for copying tensors. * update * add data trasnfer for gpu * fix constexpr build break. * update * remove unnecessary header files. * remove unnecessary header files. * add dependency * add dependency * add dependency * add dependency * fix linux build break. * update * fix build break * fix build break * fix build break * update * update * update c api. * update to not use OrtCreateAllocatorInfo * change to all eps . * fix linux build break * remove useless codes. * update * move datatransfermanager in session state * update * fix cuda build break. * fix comments * fix windows GPU build. * fix comments * fix build break * fix comments * fix test failure * update * fix comments * fix onnx runtime server. * update * fix test failure. * fix comments * fix comment	2019-07-11 14:49:20 -07:00
Tracy Sharpe	823fa3f39c	Integrate MLAS NCHWc support into ONNX Runtime (#1327 ) This change integrates the NCHWc support recently added to MLAS into ONNX Runtime. When using "-o 3" optimizations, then the runtime will do a NCHWc layout optimization pass to convert standard ONNX operators such as Conv/MaxPool to the com.microsoft.nchwc domain with weights and biases reordered for speed.	2019-07-09 20:41:19 -07:00
Changming Sun	27da857b51	Fix an SAL annotation in onnxruntime_c_api.h	2019-07-09 10:14:58 -07:00
Pranav Sharma	e9ce51ead4	Make GetTensorShapeFromTensorShapeProto return TensorShape and not it's internal representation. (#1353 )	2019-07-08 11:45:55 -07:00
Scott McKay	9d3b6b3a49	Disallow overriding initializers if IR version < 4 (#1324 ) Description: Disallow overriding an initializer via a graph input if the IR version is < 4. This enforces an implicit assumption that initializers should be treated as constant, and allows constant folding to be done on a model with an older IR version. Separate constant and overridable initializers so that it's clear which ones constant folding can utilize. Update Graph to not add all initializers to the graph inputs when the graph is manually created (i.e. not loaded from a GraphProto) and the IR version is >= 4. Motivation and Context In order to do constant folding we need to know which initializers can be treated as constant and which are overridable. All initializers were required to have a matching graph input prior to IR version 4, technically making all of them overridable. The intention however was for them to be treated as constants, and this change enforces that intent. The benefit of doing so is that constant folding will work for models with IR version < 4. The cost is that if someone is actually overriding an initializer they will need to update the IR version of their model to version 4 in order to keep doing so. The belief is that this is a very small subset of usage (e.g. models involving feeding in a truncated sequence) and the cost to update that small subset is warranted by the benefit of constant folding being able to be enabled on all older models without them needing an IR version update.	2019-07-03 18:43:38 +10:00
daquexian	c65489a47f	Initial PR for NNAPI execution provider (#1220 ) * init * Update DNNLibrary * Update DNNLibrary, set compiler flags, it compiles now * Add more missing flags, add test * Update DNNLibrary * Update Compile method, fix allocator and some other bugs * Update DNNLibrary * Implement CopyTensor * Not delete state explicitly since it is managed by unique_ptr * Add the missing files when SingleUnitTestProjct is ON * misc changes * Fix wrong name in provider factory * Add my own test * Update the code of add node into graph, and add the missing initializer into graph * Fix the bug that re-build the graph produces extra output * Update DNNLibrary * Transpose nchw (ONNX) -> nhwc (NNAPI) * Add license * Add GetSupportedNodes method (implement it later) * Rename onnxruntime_nnapi_test->onnxruntime_nnapi_squeezenet_test * Update squeezenet_test.cpp after rebase master * Remove squeezenet_test.cpp since it is almost same with the c++ sample * Update DNNLibrary for GetSupportedNodes * Update GetSupportedNodes * Revert "Remove squeezenet_test.cpp since it is almost same with the c++ sample" This reverts commit a97575fd9ff49e50ba1dc8d8154790d8cd86c48d. * Update DNNLibrary * Fix multiple outputs bug * Remove GetKernelRegistry * Revert "Revert "Remove squeezenet_test.cpp since it is almost same with the c++ sample"" This reverts commit 2a0670e9cbf10ea654111ce39e198a4be0ddd838. * Set default memory type of NNAPI EP * Add CPUOutput allocator * Update DNNLibrary for multiple outputs * Fix bug of nhwc->nchw * Remove GetExecutionHandle()	2019-07-02 06:03:29 -07:00
Scott McKay	4d765dc6d0	Return error message from status instead of swallowing it. (#1221 ) * Return error message from status instead of swallowing it. * Return OrtValue* from OpKernelContext::GetOrCreateOutputMLValue * Add unstaged change.	2019-06-22 06:26:42 +10:00
Changming Sun	766c6b6163	Add an API for retrieve ORT version (#1263 ) * Add an API for retrieve ORT version	2019-06-20 15:42:12 -07:00

1 2 3 4

170 commits