onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-07-01 03:45:06 +00:00

Author	SHA1	Message	Date
Justin Stoecker	bd236ecc26	Switch to unified DirectML 1.4.0 redistributable (#5794 ) Transitions from the ORT-only DML NuGet (hosted on the onnxruntime_public feed) to the new unified DirectML NuGet (Microsoft.AI.DirectML) on nuget.org. In addition, the Microsoft.AI.MachineLearning (WinML) and Microsoft.ML.OnnxRuntime.DirectML packages now take a dependency on the Microsoft.AI.DirectML package. This means we can remove the extra copy of DML binaries in these packages since they will be installed by the DML package.	2020-11-17 13:42:23 -08:00
RandySheriffH	20ae1ea21f	Remerge custom gpu op (#5818 ) * add case for cpu custom op on gpu * format doc * restrict GPU custom op on Linux GPU CI only * separate cu file to a independent project * fix typo * include cuda_add lib * move lib def * add file header Co-authored-by: RandySheriffH <rashuai@microsoft.com>	2020-11-16 09:27:46 -08:00
alexzakv	44d3c31200	Winml_principles_change (#5727 ) * Contributing page change * Update WinML_principles.md * Update WinML_principles.md * Update WinML_principles.md * Updated * Update WinML_principles.md * Update WinML_principles.md * Update WinML_principles.md * Update WinML_principles.md	2020-11-12 10:39:24 -08:00
stevenlix	54de618c2e	Improve TensorRT engine caching (#5737 ) * add profile caching to improve engine caching feature * Add comments * fix typo * add decryption for engine caching * Update tensorrt_execution_provider.cc * Update tensorrt_execution_provider.cc * Update tensorrt_execution_provider.cc * Update tensorrt_execution_provider.cc * Update tensorrt_execution_provider.cc * update onnx-tensorrt submodule * set opt profile to max value of the range * add hash to engine/profile name * Add calibration based INT8 quantization * add an option to enable both FP16 and INT8 * Update tensorrt_execution_provider.cc * add env variable to specify calibration file name * clean up code * Add comments and update TRT document * enable tensorrt basic test and add EngineCachingTest * clean up * update envrionment variable in the test * clean up	2020-11-12 08:56:45 -08:00
Maajid khan	a84a058f9e	[OpenVINO-EP] Enabling Multi Device support (#5740 ) * Enabling Multi Device support for UEP Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> * Minor fix added *Added a simple fix to determine OpenVINO version for Arm build as well Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com>	2020-11-11 15:16:30 -08:00
Tim Harris	5e44d25c5a	Support multi-loop parallel sections, use multi-loop sections in GRU (#5602 ) This PR updates the ThreadPool API to support multi-loop parallel sections. As with the OpenMP "parallel" construct, this allows per-loop work to be amortized over a series of loops. For ORT, it also promotes locality between successive loops in the sense that iteration X of one loop will tend to run on the same worker thread as iteration X of preceding loops. The change was developed while optimizing the implementation of a model that performed better with OpenMP. Profiling indicated that OpenMP was providing lower loop entry/exit costs and that, via OpenMP's static scheduling, it was leading to a lower L2 miss rate in the series of parallel loops used in GRU. The main changes are: - Addition of ThreadPool::ParallelSection and underlying support in the modified Eigen thread pool. - In EigenNonBlockingThreadPool.h, refactoring the RunInParallel method to support two variants: one that takes an existing parallel section object created by the caller, and another (used by default) that creates its own parallel section. - Simplify ThreadPool::LoopCounter (used by worker threads to claim loop iterations), basing it an ID supplied by the underlying Eigen thread pool for affinity in a series of loops. - Fix a possible perf issue where a loop with iterations scheduled in batches would have more threads than batches available. - Use of parallel sections in the GRU operator. - Additional test cases in threadpool_test.h. - Additional comments at the top of threadpool.h and EigenNonBlockingThreadPool.h.	2020-11-10 12:24:57 +00:00
Johannes Bannhofer	6f6dd0b869	added missing flag ORT_TENSORRT_DUMP_SUBGRAPHS (#5724 ) [DOCUMENTATION] added descriptionof the function ORT_TENSORRT_DUMP_SUBGRAPHS to the documentation	2020-11-06 12:32:12 -08:00
Dmitri Smirnov	830f567be8	Add C API Guidelines document (#5686 ) Add C API Guidelines document Signed-off-by: Dmitri Smirnov <dmitrism@microsoft.com>	2020-11-04 18:50:31 -08:00
alexzakv	8bae883d3e	User/alexzak/win ml principles (#5453 ) * Contributing page change * Update WinML_principles.md * Update WinML_principles.md * Update WinML_principles.md * Updated * Update WinML_principles.md * Update WinML_principles.md * Update WinML_principles.md	2020-11-04 13:35:40 -08:00
Maajid khan	d98062da0c	[OpenVINO-EP] Hetero support (#5627 ) * Implement Hetero in UEP * Added security checks to take valid Hetero combinations as device type * Integrating Hetero features * Get the statistics Report in Debug Mode Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> * Passing right device type for vadm_baackend Added simple fix to pick the right device type when using vadm_backend with Hetero as well. Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> * Fixed batching logic for 2020.4 and above * Fixed flake8 PEP8 errors Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> * Minor Fixes Added Added security checks for device_type passed in for Hetero build during run time code cleanup Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> * Minor changes Added Fixed batch_size bug in vadm_backend code cleanup *Documentation updated for Hetero Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> Co-authored-by: suryasidd <surya.siddharth.pemmaraju@intel.com>	2020-10-30 22:35:08 -07:00
Changming Sun	d9293f38e6	Revert "Custom Op on GPU (#5620 )" This reverts commit `2c63196600`.	2020-10-30 21:23:51 -07:00
RandySheriffH	2c63196600	Custom Op on GPU (#5620 ) * add case for cpu custom op on gpu * format doc * restrict GPU custom op on Linux GPU CI only * separate cu file to a independent project * fix typo Co-authored-by: RandySheriffH <rashuai@microsoft.com>	2020-10-30 12:25:44 -07:00
Tim Harris	5e8952ef89	ThreadPool clean up : mm_pause in loops, correctly spin-then-wait, and adopt static methods consistently in the API (#5590 ) Description: This change makes three changes to the ThreadPool class to clean up issues identified during performance analysis and optimization. (1) It uses mm_pause intrinsics in spin loops, helping avoid consuming pipeline resources while waiting. (2) It re-organizes the spin-then-steal loop for work distribution to start out spinning as intended, rather than to start out trying to steal. (3) It updates the ThreadPool class's API to be consistent in the use of static methods for public functions. The PR includes minor doc updates and corresponding changes to test cases. Motivation and Context The change helps ensure consistency in behavior between the OpenMP and Eigen-based implementations. Unlike the instance methods, the static methods abstract over the different ways in which threading can be implemented; they will map onto the OpenMP or Eigen-based implementations when threading is used. When threading is not used they will run work sequentially.	2020-10-28 09:49:18 +00:00
Maajid khan	ddf83d1ace	Maajid/multi threading 2 (#5568 ) * Enabled multi-threading for OpenVino EP ->Enabled support for concurrent_session_runs Run UEP using concurrent_session_runs > 1 Enabled support for ORT_PARALLEL ExecutionMode ->Documentation Added for Enabling MultiThreading Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> * Minor Fixes added Configure the value of nireq during Runtime Documentation typos rectified and details added for Multi_Threaded Inference Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> * Some checks added for this fix Added checks to invalidate wrong nireq value and assigned it to default value of 8 Added new config options for enable_vpu_fast_compile which were changed w.r.t OpenVINO_2021.1 Release Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com>	2020-10-27 14:48:12 -07:00
Scott McKay	6d35be215f	Add `--skip_tests` to example command line as the included ops are being reduced. (#5554 )	2020-10-22 08:55:42 +10:00
Hariharan Seshadri	4291c57322	[C# and Python APIs] Expose knobs to enable/disable platform telemetry collection (#5481 )	2020-10-21 10:32:13 -07:00
Scott McKay	a3d2bc36be	Fix script name in doco (#5530 )	2020-10-20 06:42:53 +10:00
Thien Bui	6ad70d7371	[Doc] ONNX_Runtime_Server_Usage fix proto uri (#5345 ) The predict proto should be `../server/protobuf/prediction_service.proto` instead of `../onnxruntime/server/protobuf/prediction_service.proto`	2020-10-19 13:30:58 -07:00
Olivia Jain	1e4b259d28	Updating EP docs with Onnxruntime API calls (#5503 ) * updating examples with current api calls * Fixing capitalization in api calls, adding RKNPU update * Correcting nuphar and rknpu ep api calls * Include creating session in readme	2020-10-19 12:21:21 -07:00
sfatimar	6d2a30eae3	[OPENVINO-EP] 2021.1 Release (#5431 ) * Cmake changes for 2021.1 * added new ov version 2020.1 for faster rcnn * Added missing defs * equal op modified * changes to incoroporate faster rcnn * backend util.cc * hddl_plugin_config.hpp is depreceated . instead use hddl_config.hpp * changing myriad precision bool to i32 * gather is not enabled for gpu * conv2D and pooltest auto_pad attribute should not be null * negative indices are not valid for scatter op in myriad * non max suppression op only supported in faster rcnn mode * maxpool indices output is not supported * Cleaned redundant code in backends * Added ifdefs for HDDL config * cast output dimensions check topk operator k input it seems only resolved for myriad as it is throwing issues for ask rcnn . need to verify * we are limiting the subgraph size to 3 here * taking care of review comments * Fixed minor bugs * Modified Slice op checks * Added NonZero, Upsample * Removed TopK if it's in the middle of a subgraph * incorporated upsample conditions too * Dockerfile changes for 2021.1 release * dockerfile aptkey update * Minor fixes * ceil condition added again * Fixed few gpu models * Disabled LSTM and yolov3 in ModelTests * python softmax cross entropy tests and negative log likelihood * Update Build.md Updated for openvino 2021.1 * Update OpenVINO-ExecutionProvider.md update openvino execution provider for 2021.1 * Update READMe.md updated new openvino version * Update Dockerfile.openvino added environment variable for DEBIAN Frontend * Fixed myriad models * Fixed gather condition * Fixed mask rcnn model on myriad * Modified Gather condition * set default target of MCR dockerfile to MYRIAD_FP16 * Fixed tinyolov3 on CPU * Update OpenVINO-ExecutionProvider.md update openvino execution provider documentation * Update Dockerfile.openvino Removed environment variable * Update OpenVINO-ExecutionProvider.md update image manipulation networks supported * Update onnx_backend_test_series_filters.jsonc removed test_upsample_nearest from cpu test cases * New InternalCI changes for 2021.1 * Full protobuf removed for OpenVINO * Protobuf added * Updated with apt installation for openvino * Revert the testing changes * Reverted testing changes * File permessions are changed to original * Deleted openvino installation and cmake change * Optimized Dockerfile Removed unnecessary cmake installation, numpy * Added missing ifdefs * delete array fix * backend_utils.cc output_shape * Revert "set default target of MCR dockerfile to MYRIAD_FP16" This reverts commit 928d3e2b71e2f589cf51dacd3a133951cf9ca18d. Co-authored-by: suryasidd <surya.siddharth.pemmaraju@intel.com> Co-authored-by: sfatimar <sahar.fatima@intel/com> Co-authored-by: suryasidd <48925384+suryasidd@users.noreply.github.com> Co-authored-by: S. Manohar Karlapalem <manohar.karlapalem@intel.com> Co-authored-by: Aravind <aravindx.gunda@intel.com> Co-authored-by: Aravind Gunda <38353114+gundaarx@users.noreply.github.com>	2020-10-14 15:56:00 -07:00
Ashwini Khade	2a018cc235	revert contrib op version bump and deprecation of TransposeMatMul (#5424 ) * revert contrib op version bump and deprecation of TransposeMatMul * update documentation	2020-10-12 13:02:15 -07:00
Tianlei Wu	15696b8fce	bump version to 1.5.2 (#5420 )	2020-10-08 16:30:13 -07:00
manashgoswami	132ab2230d	Updated with image for creating the onnxruntime pkg (#5400 ) * Create Mobile.png * Update ONNX_Runtime_for_Mobile_Platforms.md * Update ONNX_Runtime_for_Mobile_Platforms.md	2020-10-08 08:54:27 -07:00
Hariharan Seshadri	6f54113a1b	Support OrtValue binding in Python to enable interesting IOBinding scenarios in Python (#5248 )	2020-10-06 21:14:41 -07:00
manashgoswami	b5caa7cb12	Updated docs: Execution Provider overview (#5328 ) * Update ReleaseManagement.md * Create ONNX_Runtime_Execution_Providers.md * Create ONNX_Runtime_EP3.png * Create ONNX_Runtime_EP2.png * Create ONNX_Runtime_EP1.png * Delete ONNX_Runtime_Execution_Providers.md * Create README.md * Update README.md * commit * Updated in error. Revert "Update ReleaseManagement.md" This reverts commit 8530bd5fd46aebce3a6d6055d8952ae4f6458c4e. * Create ONNX_Runtime_Execution_Providers.md * Create ONNX_Runtime_EP3.png * Create ONNX_Runtime_EP2.png * Create ONNX_Runtime_EP1.png * Delete ONNX_Runtime_Execution_Providers.md * Create README.md * Update README.md * commit * Updated in error. Revert "Update ReleaseManagement.md" This reverts commit 8530bd5fd46aebce3a6d6055d8952ae4f6458c4e. * Update ReleaseManagement.md * Update .gitignore * Update README.md * Update README.md	2020-10-06 15:01:25 -07:00
Ashwini Khade	3f00b8db8f	move all experimental ops to version 1 of ms domain (#5287 ) * move all experimental ops to version 1 of ms domain * deprecate TransposeMatMul in favor of FusedMatMul * update documentation	2020-09-30 14:50:18 -07:00
Faith Xu	cb57c100e6	Doc updates for 1.5 (#5302 ) * Fix Windows AI version * Update text to extend telemetry coverage Includes all official binaries * Update text about EP pluggability * Update CUDA/cuDNN versions * Add link to reduce operator kernel page * Update roadmap * Add preview for migraphx * Move Rockchip under IoT/Edge * Update text to include ORT for Mobile doc link	2020-09-30 09:53:33 -07:00
Scott McKay	3693f91218	Update doc to be explicit about backwards compatibility. (#5309 )	2020-09-29 07:34:49 +10:00
Dwayne Robinson	6ad39819c2	Update DirectML Nuget to 1.3.0 (#5274 ) Update to 1.3.0	2020-09-23 22:53:02 -07:00
Tianlei Wu	3bbce69185	bump version to 1.5.1 (#5258 )	2020-09-22 20:57:34 -07:00
KeDengMS	8dceebda0e	[Training/Python] Add option to enable symbolic shape inference (#5107 ) This change adds symbolic shape inference to ORT training which helps static memory planning for model like BART.	2020-09-22 10:49:07 -07:00
George Wu	3147bc00c3	update TensorRT docs (#5238 ) * doc updates TensorRT * update * update * fix warning * newline * format	2020-09-21 15:24:20 -07:00
Pranav Sharma	974b9bfc09	Allow sharing of initializers between sessions. (#5092 ) * Allow sharing of initializers between sessions. * Allow sharing of initializers between sessions (2). * Add test for C# * Add test for C#; address PR comments * Address PR comments Moved AddInitializer logic to internal session options Added tests for owned buffer Clarified documentation Fix bug where memory info and not device was getting compared * Fix test * Fix training build * Add ver 5 end marker and ver 6 starter, add scenario and usage examples.	2020-09-21 14:09:37 -07:00
KeDengMS	ce3b67e0cd	[Python] Move symbolic_shape_infer from nuphar to tools (#5162 ) * [Python] Move symbolic shape inference from nuphar to tools * Fix PEP8 ERROR	2020-09-18 09:31:06 -07:00
Scott McKay	c46a480306	Update conversion script and process to simplify creating ORT format models and a minimal build (#5217 ) * Update conversion script and process to simplify creating ORT format models and a minimal build.	2020-09-18 18:49:54 +10:00
S. Manohar Karlapalem	584638e5d3	Corrects doc typos and formatting (#5201 )	2020-09-17 01:25:19 -07:00
Tianlei Wu	0752fd7425	change version number from 1.4.0 to 1.5.0 (#5178 )	2020-09-15 15:50:25 -07:00
Sheil Kumar	c0d7c8bc44	Add docs indicating that the onnxruntime engine from other distributions can be compatible with the WinRT NuGet (#5009 ) * add docs for mix and matching * typos Co-authored-by: Sheil Kumar <sheilk@microsoft.com>	2020-09-14 21:15:51 -07:00
S. Manohar Karlapalem	f7edf0aa57	[OpenVINO-EP] Enable EP config options for VPU hardware (#5119 ) * Added config flags for VPU Fast Recompile * clean-up ifdefs * Add VPU Fast compile config option Adds an option that enables Fast compilation of models to VPU hardware specific format. * Add config option to choose specific device id for inference Inference of all subgraphs will be scheduled only on this device even if other devices of the same type are available. * Add Python API to list available device IDs * code cleanup * Add second C/C++ API with settings string parameter Adds an additional C/C++ API that allows passing multiple key-value pairs for settings as a single string. Multiple settings are delimited by '\n' while the key and value within a setting are delimited by '\|'. * Append 'Ex' to the extended C/C++ API * Use set_providers Py API to set config options. Uses Session.set_providers Python API to set EP runtime config options as key/val pairs Deprecated older module function definitions for config settings. Updates documentation. * avoid globals for py config options where possible Co-authored-by: intel <you@example.com>	2020-09-14 15:46:14 -07:00
Ashwini Khade	cd56ab197c	csharp build documentation (#5121 )	2020-09-11 11:46:10 -07:00
Scott McKay	dbf4e7019d	Add ability to generate configuration file with required operators. (#5089 ) * Add ability to generate configuration file with required operators.	2020-09-09 21:39:17 +10:00
Scott McKay	80ada0291f	Improve the minimal build size on android and linux (#5086 ) Fix bug where linux build fails when python is enabled and rtti is disabled Update doco for new build settings	2020-09-09 21:38:34 +10:00
Scott McKay	e03a391895	Small updates to ORT Mobile documentation (#5075 ) * Few documentation clarifications * Few more tweaks	2020-09-08 11:02:31 +10:00
Scott McKay	b5c2932ae8	Last major set of ORT format model changes (#5056 ) * Add minimal build option to build.py Group some of the build settings so binary size reduction options are all together Make some cmake variable naming more consistent Replace usage of std::hash with murmurhash3 for kernel. std::hash is implementation dependent so can't be used. Add initial doco and ONNX to ORT model conversion script Misc cleanups of minimal build breaks.	2020-09-05 07:59:01 +10:00
Nat Kershaw (MSFT)	8a03b6e5c7	Render Operator documentation as compliant markdown (#3658 )	2020-09-02 15:07:50 -07:00
RandySheriffH	14b51d6502	CiPipeline@ReducedOpsBuild (#4917 ) * cancel night build on pyop * setup ci pipeline for build of reduced ops * add back c# test * remove debugging print * add testing model * add more arg in pipeline script * disable pipeline trigger temporarily * fix yaml format * fix yaml format * fix pipeline error * rid c# test * add ops for test cases * add Conv from domain com.microsoft.nchwc * remove --reduce_ops * fix typo * remove --build_java * add test case for excluded op * update doc with --skip_test * formatting code, renaming files and simplify yaml * remove debug build from yaml * remove surplus ops from included_ops.txt * add MinSizeRel build to yaml * rename test cases and models * exclude ir test from minimum build * restrict ir test to be only applied to reduced ops build	2020-08-31 21:21:18 -07:00
Hariharan Seshadri	7045910d10	Support RegisterCustomOpsLibrary via the Python API (#4764 )	2020-08-28 13:24:29 -07:00
Brian Martin	39382dc6c3	Update winrt_api.md to address the 1.4 release (#4946 )	2020-08-28 08:05:22 -07:00
Pranav Sharma	29dcfb24ab	Allow multiple sessions to share an allocator, optimize constant folding memory usage, expose arena configs. (#4813 ) * Add support for sharing allocators * Incremental update * Address some PR comments, add unit tests, add documentation. * Address PR comments, add tests and some documentation. * Fix build and test issues * Remove RegisterAllocator API restoring the OrtAllocator interface changes. Changed docs to reflect this. Also fixed the orttraining segfault. The segfault was because in the case of training session, the CPU exec prov is not available at the time the transformers are applied. Changed it to create a new one.	2020-08-22 10:03:17 -07:00
RandySheriffH	3fa73a5b6a	ReduceBinarySize (#4747 ) * cancel night build on pyop * add rewriter to rewrite cpu provider * skip BuildKernelCreateInfo<void> * refactor variable name and comment * include ops from csv file * process multiple eps * add default function to cuda provider * rename function and add license header * fix import * add doc * fix typo * deal with empty kernel entry in cuda * rename the rewriter file * add comment into provider file * add comment and rename function * log warnings * refactor extracting logic * add entry for script to run solo * add better example * avoid onnx importing * fix flake8 alerts * minor fixes to better comments and doc * add entries for all domains * add void entry into contrib providers * format cuda_contrib_kernels.cc * format cpu_contrib_kernels.cc * add all providers * add default entry to all providers * include op_kernel header * cancelling change in providers beyond cpu/cuda * rename file and switch file format to domain;opset;op1,op2... * update doc * restore non-regular ending grammar in cuda_contrib_kernels.cc * add ort_root as input argument of script * enable test in ci * update doc * update doc * revert change on linux gnu ci * switch to set to host ops * simplify trimming logic * add domain map to track current model * allow ort_root to take relative path	2020-08-21 19:50:13 -07:00

1 2 3 4 5 ...

252 commits