onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-05-14 20:48:00 +00:00

Author	SHA1	Message	Date
Changming Sun	5bd192c439	Update ContribOperators.md (#7246 )	2021-04-05 17:11:33 -07:00
Thiago Crepaldi	867804bea1	Add auto doc gen for ORTModule API during CI build (#7046 ) In addition to ORTModule auto documentation during packaging, this PR also update golden numbers to fix CI	2021-03-22 10:20:33 -07:00
Xavier Dupré	514444d820	Fix pipeline generating python documentation (#7027 ) Co-authored-by: xavier dupré <xavier.dupre@gmail.com>	2021-03-17 16:57:51 -07:00
Raduan Al-Shedivat	743a93faf3	Fix broken link in server usage and remove absolute path from dockerfiles readme (#6926 )	2021-03-09 11:54:21 -08:00
Edward Chen	b6c4a7ac54	Support required types when excluding typed registrations (#6871 )	2021-03-08 08:22:07 -08:00
Edward Chen	09a5d6a9dc	Update docs/ONNX_Runtime_for_Mobile_Platforms.md with info about op type reduction. (#6747 )	2021-02-23 10:25:23 -08:00
Nat Kershaw (MSFT)	c170061998	Removed BUILD.md from master as source now lives in gh-pages (#6709 )	2021-02-19 11:34:21 -08:00
Olivia Jain	ea3aee4d5f	Bumping up version to 1.7 (#6736 ) * bumping up version to 1.7 * Windows AI should align with ORT Version	2021-02-17 19:07:38 -08:00
Guoyu Wang	6810d98ea3	Update links to gh-pages for ORT minimal documents (#6721 ) * Fix broken link in ort minimal docs * Update link of build.md to gh-pages	2021-02-17 14:34:50 -08:00
Scott McKay	02c7873b0e	Update ORT model conversion script to support custom ops (#6701 ) * Add support for custom ops library to the ORT model conversion script Simplify model conversion now that we read ops from the ORT format model. Enable custom ops in the python bindings if custom ops are turned on in a minimal build. * Add test of model conversion involving custom ops.	2021-02-17 12:52:39 +10:00
Nat Kershaw (MSFT)	af9dfa7a4d	Remove docs that have been migrated to https://onnxruntime.ai/docs (#6225 )	2021-02-05 18:09:27 -08:00
Xavier Dupré	615acf156c	remove keras example from python documentation (#6574 )	2021-02-05 01:10:11 +01:00
Scott McKay	c84bb9df9f	Add ability to track per operator types in reduced build config. (#6428 ) * Add ability to generate configuration that includes required types for individual operators, to allow build size reduction based on that. - Add python bindings for ORT format models - Add script to update bindings and help info - Add parsing of ORT format models - Add ability to enable type reduction to config generation - Update build.py to only allow operator/type reduction via config - simpler to require config to be generated first - can't mix a type aware (ORT format model only) and non-type aware config as that may result in insufficient types being enabled - Add script to create reduced build config - Update CIs	2021-01-29 07:59:51 +10:00
Wenbing Li	69af0440b1	Add the custom op project information (#6334 )	2021-01-20 15:23:24 -08:00
Xavier Dupré	481a2cdf61	Add script to preprocess python documentation before publishing (#6129 ) * add script to preprocessing python documentation before publishing	2021-01-07 19:23:59 +01:00
Edward Chen	d761571afc	Deprecate Python global configuration functions [Part 2] (#6171 ) Update Python API to allow more flexibility for setting providers and provider options. The providers argument (InferenceSession/TrainingSession constructors, InferenceSession.set_providers()) now also accepts a tuple of (name, options dict). Fix get_available_providers() API (and the corresponding function in the C API) to return the providers in default priority order. Now it can be used as a starting point for the providers argument and maintain the default priority order. Convert some usages of the deprecated global configuration functions to use EP-specific options instead. Update some EP-specific option parsing to fail on unknown options. Other clean up.	2021-01-07 10:10:55 -08:00
sfatimar	7347996942	Openvino ep 2021.2 (#6196 ) * Enabling fasterrcnn variant and vehicle detector * changes for 2021_2 branch * yolov3_pytorch commit * fixed braces in basic_backend.cc * ci information added * faster rcnn variant and vehicle detector changes were made in 2021.1 and not in 2021.2 * some changes to support unit tests * disable some tests which are failing * fix myriad tests for vehicle detector * Did some cleanup cleaned up comments Disabled Add_Broadcast_0x1 and Add_Broadcast_1x0 tests on MYRIAD_FP16 backend due to a bug cleaned up capability_2021_2.cc file Removed extra conditions which were added for some validation in backend_utils Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> * yolov3 pytorch workaround to ensure that the output names are matched * gemmoptest fixed on myriad * Fixed MYRIADX CPP Test Failures Expand,GatherND,Range,Round op's are only supported in model where op with float input data types are not supported and fixed Scatter and ScatterElements op's with negative axis are fixed Reshape op with 0 dim value are not supported and fixed Disabled InstanceNorm_2 test on MYRIADX Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> make changes to yolov3 pytorch * Fixed python unit tests Fixed failing python tests on vpu, GPU and CPU Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> Fixes POW op failures on GPU_FP16 Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> * Clean up capability_2021_2.cc Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> * Updated docx for MultiThreading option Added extra info on setting the num_of_threads option using the API and it's actual usage Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> fixed slice and removed extra prints * Disabled failing python tests Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> * Minor changes added in capabilty_2021_2 Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> * made changes to slice to avoid failures * Disabling FP16 support for GPU_FP32 ->Inferencing an FP16 model on GPU_FP32 leads to accuracy mismatches. so, we would rather use GPU_FP16 to infer an FP16 model on GPU Device Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> * Updated docx for Inferencing a FP16 Model Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> * fix for mask rcnn * Script for installing openvino from source * Updated with openvino 2021.2 online installation * code comment fixes fixed accuracy mismatch for div * Update OpenvinoEP-ExecutionProvider.md updated for 2021.2 branch * Update README.md updated dockerfile documentation * Update BUILD.md build.md update documentation * permissiong change of install_openvino.sh * made changes to align with microsoft onnxruntime changes * Updated with ov 2021.2.200 Co-authored-by: suryasidd <surya.siddharth.pemmaraju@intel.com> Co-authored-by: sfatimar <sahar.fatima@intel/com> Co-authored-by: MaajidKhan <n.maajidkhan@gmail.com> Co-authored-by: mohdansx <mohdx.ansari@intel.com>	2020-12-23 08:47:22 -08:00
Pranav Sharma	86493e6d0c	Update documentation for contributing a PR and add deprecation notices for PyOp and ORT server. (#6172 )	2020-12-18 02:00:42 -08:00
Jay Rodge	dec703b62d	Update TensorRT-ExecutionProvider.md (#6161 )	2020-12-17 17:10:40 -08:00
RandySheriffH	404982ded5	Enable varied input type for custom op (#6066 ) * allow custom op taking varied types * refactor test case * add test model * refactor test case * enable copy elision * update test case * fix issue in ToString function	2020-12-09 15:10:42 -08:00
Du Li	3e81711a13	Update version to 1.6.0 (#6041 ) * Update version to 1.6.0 * Add v 1.5.3 info * Updating WindowsAI and ONNX version Co-authored-by: Du Li <duli@OrtTrainingDev0.af05slrtruoetgaxwwjv5nsq5e.px.internal.cloudapp.net>	2020-12-08 11:09:51 -08:00
Hariharan Seshadri	a046ef133a	Update api_summary.rst (#6038 )	2020-12-04 17:59:56 -08:00
Scott McKay	30c7fffbab	Expand the documentation on using compiling EPs with a minimal build (#5893 ) * Expand the documentation on using compiling EPs with a minimal build to call out a 'simple' option that is easier to use. Provide more background on what happens to help users choose the best option for them. Tweak conversion script to be noisier about attempted usage of 'all' optimization level. Co-authored-by: manashgoswami <magoswam@microsoft.com>	2020-12-02 09:12:36 +10:00
Changming Sun	5fdd9f0fd2	Fix Python Linux GPU package name (#5943 ) Fix Python Linux GPU package name. I accidentally added "noopenmp" to it.	2020-11-25 17:46:11 -08:00
sfatimar	8168c91978	Sahar/fix documentation shared lib (#5926 ) * Update OpenVINO-ExecutionProvider.Md update openvino-executionprovider.md for shared library * Update Build.md updated --build_shared_lib flag for building openvino shared provider lib * Update Dockerfile.openvino building for shared library with the new changes for openvino shared lib * Revert "Update Build.md" This reverts commit c9cf5fee76be7fdc10cadf07259f1d4ed5b45b93. * Revert "Update Dockerfile.openvino " This reverts commit e1624e4f93a4cfb425b6f21d7fb71b299a146740. * Update OpenVINO-ExecutionProvider.md fix documentation to the shared library Co-authored-by: sfatimar <sahar.fatima@intel/com>	2020-11-25 08:50:01 -08:00
Scott McKay	3970eb2e5d	Add documentation on enabling/using NNAPI in a minimal build (#5879 ) * Add initial documentation on using NNAPI with a minimal build * minor clarification * Add note on avoiding local full build * Address a couple of PR comments	2020-11-21 09:00:24 +10:00
stevenlix	1068f3eb87	Use flatbuffers for INT8 calibration table (de)serialization in TensorRT EP (#5873 ) * add int8 * support both native TRT cal table and ORT cal table * add more comments * Update env variable name and check platform availability for int8/fp16 * add backward compatibility on old env var ORT_TENSORRT_ENGINE_CACHE_PATH and switch to flatbuffers for ort cal table deserialization	2020-11-19 21:41:12 -08:00
stevenlix	dfea92925c	Add calibration based INT8 quantization to TensorRT EP (#5842 ) * add int8 * support both native TRT cal table and ORT cal table * add more comments * Update env variable name and check platform availability for int8/fp16	2020-11-19 17:10:49 -08:00
S. Manohar Karlapalem	ff58f621fa	Remove nGraph Execution Provider (#5858 ) * Remove nGraph Execution Provider Pursuant to nGraph deprecation notice: https://github.com/microsoft/onnxruntime/blob/master/docs/execution_providers/nGraph-ExecutionProvider.md#deprecation-notice Deprecation Notice \| \| \| \| --- \| --- \| \| Deprecation Begins \| June 1, 2020 \| \| Removal Date \| December 1, 2020 \| Starting with the OpenVINO™ toolkit 2020.2 release, all of the features previously available through nGraph have been merged into the OpenVINO™ toolkit. As a result, all the features previously available through ONNX RT Execution Provider for nGraph have been merged with ONNX RT Execution Provider for OpenVINO™ toolkit. Therefore, ONNX RT Execution Provider for nGraph will be deprecated starting June 1, 2020 and will be completely removed on December 1, 2020. Users are recommended to migrate to the ONNX RT Execution Provider for OpenVINO™ toolkit as the unified solution for all AI inferencing on Intel® hardware. * Remove nGraph Licence info from ThirdPartyNotices.txt * Use simple Test.Run() for tests without EP exclusions To be consistent with rest of test code. * Remove nGraph EP functions from Java code	2020-11-19 16:47:55 -08:00
Pranav Sharma	c2a993e745	Add documentation for OrtArenaCfg for CreateAndRegisterAllocator API. (#5831 ) * Add documentation for OrtArenaCfg for CreateAndRegisterAllocator API. * Address PR comments * More comments	2020-11-18 10:21:20 -08:00
Justin Stoecker	bd236ecc26	Switch to unified DirectML 1.4.0 redistributable (#5794 ) Transitions from the ORT-only DML NuGet (hosted on the onnxruntime_public feed) to the new unified DirectML NuGet (Microsoft.AI.DirectML) on nuget.org. In addition, the Microsoft.AI.MachineLearning (WinML) and Microsoft.ML.OnnxRuntime.DirectML packages now take a dependency on the Microsoft.AI.DirectML package. This means we can remove the extra copy of DML binaries in these packages since they will be installed by the DML package.	2020-11-17 13:42:23 -08:00
RandySheriffH	20ae1ea21f	Remerge custom gpu op (#5818 ) * add case for cpu custom op on gpu * format doc * restrict GPU custom op on Linux GPU CI only * separate cu file to a independent project * fix typo * include cuda_add lib * move lib def * add file header Co-authored-by: RandySheriffH <rashuai@microsoft.com>	2020-11-16 09:27:46 -08:00
alexzakv	44d3c31200	Winml_principles_change (#5727 ) * Contributing page change * Update WinML_principles.md * Update WinML_principles.md * Update WinML_principles.md * Updated * Update WinML_principles.md * Update WinML_principles.md * Update WinML_principles.md * Update WinML_principles.md	2020-11-12 10:39:24 -08:00
stevenlix	54de618c2e	Improve TensorRT engine caching (#5737 ) * add profile caching to improve engine caching feature * Add comments * fix typo * add decryption for engine caching * Update tensorrt_execution_provider.cc * Update tensorrt_execution_provider.cc * Update tensorrt_execution_provider.cc * Update tensorrt_execution_provider.cc * Update tensorrt_execution_provider.cc * update onnx-tensorrt submodule * set opt profile to max value of the range * add hash to engine/profile name * Add calibration based INT8 quantization * add an option to enable both FP16 and INT8 * Update tensorrt_execution_provider.cc * add env variable to specify calibration file name * clean up code * Add comments and update TRT document * enable tensorrt basic test and add EngineCachingTest * clean up * update envrionment variable in the test * clean up	2020-11-12 08:56:45 -08:00
Maajid khan	a84a058f9e	[OpenVINO-EP] Enabling Multi Device support (#5740 ) * Enabling Multi Device support for UEP Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> * Minor fix added *Added a simple fix to determine OpenVINO version for Arm build as well Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com>	2020-11-11 15:16:30 -08:00
Tim Harris	5e44d25c5a	Support multi-loop parallel sections, use multi-loop sections in GRU (#5602 ) This PR updates the ThreadPool API to support multi-loop parallel sections. As with the OpenMP "parallel" construct, this allows per-loop work to be amortized over a series of loops. For ORT, it also promotes locality between successive loops in the sense that iteration X of one loop will tend to run on the same worker thread as iteration X of preceding loops. The change was developed while optimizing the implementation of a model that performed better with OpenMP. Profiling indicated that OpenMP was providing lower loop entry/exit costs and that, via OpenMP's static scheduling, it was leading to a lower L2 miss rate in the series of parallel loops used in GRU. The main changes are: - Addition of ThreadPool::ParallelSection and underlying support in the modified Eigen thread pool. - In EigenNonBlockingThreadPool.h, refactoring the RunInParallel method to support two variants: one that takes an existing parallel section object created by the caller, and another (used by default) that creates its own parallel section. - Simplify ThreadPool::LoopCounter (used by worker threads to claim loop iterations), basing it an ID supplied by the underlying Eigen thread pool for affinity in a series of loops. - Fix a possible perf issue where a loop with iterations scheduled in batches would have more threads than batches available. - Use of parallel sections in the GRU operator. - Additional test cases in threadpool_test.h. - Additional comments at the top of threadpool.h and EigenNonBlockingThreadPool.h.	2020-11-10 12:24:57 +00:00
Johannes Bannhofer	6f6dd0b869	added missing flag ORT_TENSORRT_DUMP_SUBGRAPHS (#5724 ) [DOCUMENTATION] added descriptionof the function ORT_TENSORRT_DUMP_SUBGRAPHS to the documentation	2020-11-06 12:32:12 -08:00
Dmitri Smirnov	830f567be8	Add C API Guidelines document (#5686 ) Add C API Guidelines document Signed-off-by: Dmitri Smirnov <dmitrism@microsoft.com>	2020-11-04 18:50:31 -08:00
alexzakv	8bae883d3e	User/alexzak/win ml principles (#5453 ) * Contributing page change * Update WinML_principles.md * Update WinML_principles.md * Update WinML_principles.md * Updated * Update WinML_principles.md * Update WinML_principles.md * Update WinML_principles.md	2020-11-04 13:35:40 -08:00
Maajid khan	d98062da0c	[OpenVINO-EP] Hetero support (#5627 ) * Implement Hetero in UEP * Added security checks to take valid Hetero combinations as device type * Integrating Hetero features * Get the statistics Report in Debug Mode Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> * Passing right device type for vadm_baackend Added simple fix to pick the right device type when using vadm_backend with Hetero as well. Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> * Fixed batching logic for 2020.4 and above * Fixed flake8 PEP8 errors Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> * Minor Fixes Added Added security checks for device_type passed in for Hetero build during run time code cleanup Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> * Minor changes Added Fixed batch_size bug in vadm_backend code cleanup *Documentation updated for Hetero Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> Co-authored-by: suryasidd <surya.siddharth.pemmaraju@intel.com>	2020-10-30 22:35:08 -07:00
Changming Sun	d9293f38e6	Revert "Custom Op on GPU (#5620 )" This reverts commit `2c63196600`.	2020-10-30 21:23:51 -07:00
RandySheriffH	2c63196600	Custom Op on GPU (#5620 ) * add case for cpu custom op on gpu * format doc * restrict GPU custom op on Linux GPU CI only * separate cu file to a independent project * fix typo Co-authored-by: RandySheriffH <rashuai@microsoft.com>	2020-10-30 12:25:44 -07:00
Tim Harris	5e8952ef89	ThreadPool clean up : mm_pause in loops, correctly spin-then-wait, and adopt static methods consistently in the API (#5590 ) Description: This change makes three changes to the ThreadPool class to clean up issues identified during performance analysis and optimization. (1) It uses mm_pause intrinsics in spin loops, helping avoid consuming pipeline resources while waiting. (2) It re-organizes the spin-then-steal loop for work distribution to start out spinning as intended, rather than to start out trying to steal. (3) It updates the ThreadPool class's API to be consistent in the use of static methods for public functions. The PR includes minor doc updates and corresponding changes to test cases. Motivation and Context The change helps ensure consistency in behavior between the OpenMP and Eigen-based implementations. Unlike the instance methods, the static methods abstract over the different ways in which threading can be implemented; they will map onto the OpenMP or Eigen-based implementations when threading is used. When threading is not used they will run work sequentially.	2020-10-28 09:49:18 +00:00
Maajid khan	ddf83d1ace	Maajid/multi threading 2 (#5568 ) * Enabled multi-threading for OpenVino EP ->Enabled support for concurrent_session_runs Run UEP using concurrent_session_runs > 1 Enabled support for ORT_PARALLEL ExecutionMode ->Documentation Added for Enabling MultiThreading Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> * Minor Fixes added Configure the value of nireq during Runtime Documentation typos rectified and details added for Multi_Threaded Inference Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> * Some checks added for this fix Added checks to invalidate wrong nireq value and assigned it to default value of 8 Added new config options for enable_vpu_fast_compile which were changed w.r.t OpenVINO_2021.1 Release Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com>	2020-10-27 14:48:12 -07:00
Scott McKay	6d35be215f	Add `--skip_tests` to example command line as the included ops are being reduced. (#5554 )	2020-10-22 08:55:42 +10:00
Hariharan Seshadri	4291c57322	[C# and Python APIs] Expose knobs to enable/disable platform telemetry collection (#5481 )	2020-10-21 10:32:13 -07:00
Scott McKay	a3d2bc36be	Fix script name in doco (#5530 )	2020-10-20 06:42:53 +10:00
Thien Bui	6ad70d7371	[Doc] ONNX_Runtime_Server_Usage fix proto uri (#5345 ) The predict proto should be `../server/protobuf/prediction_service.proto` instead of `../onnxruntime/server/protobuf/prediction_service.proto`	2020-10-19 13:30:58 -07:00
Olivia Jain	1e4b259d28	Updating EP docs with Onnxruntime API calls (#5503 ) * updating examples with current api calls * Fixing capitalization in api calls, adding RKNPU update * Correcting nuphar and rknpu ep api calls * Include creating session in readme	2020-10-19 12:21:21 -07:00
sfatimar	6d2a30eae3	[OPENVINO-EP] 2021.1 Release (#5431 ) * Cmake changes for 2021.1 * added new ov version 2020.1 for faster rcnn * Added missing defs * equal op modified * changes to incoroporate faster rcnn * backend util.cc * hddl_plugin_config.hpp is depreceated . instead use hddl_config.hpp * changing myriad precision bool to i32 * gather is not enabled for gpu * conv2D and pooltest auto_pad attribute should not be null * negative indices are not valid for scatter op in myriad * non max suppression op only supported in faster rcnn mode * maxpool indices output is not supported * Cleaned redundant code in backends * Added ifdefs for HDDL config * cast output dimensions check topk operator k input it seems only resolved for myriad as it is throwing issues for ask rcnn . need to verify * we are limiting the subgraph size to 3 here * taking care of review comments * Fixed minor bugs * Modified Slice op checks * Added NonZero, Upsample * Removed TopK if it's in the middle of a subgraph * incorporated upsample conditions too * Dockerfile changes for 2021.1 release * dockerfile aptkey update * Minor fixes * ceil condition added again * Fixed few gpu models * Disabled LSTM and yolov3 in ModelTests * python softmax cross entropy tests and negative log likelihood * Update Build.md Updated for openvino 2021.1 * Update OpenVINO-ExecutionProvider.md update openvino execution provider for 2021.1 * Update READMe.md updated new openvino version * Update Dockerfile.openvino added environment variable for DEBIAN Frontend * Fixed myriad models * Fixed gather condition * Fixed mask rcnn model on myriad * Modified Gather condition * set default target of MCR dockerfile to MYRIAD_FP16 * Fixed tinyolov3 on CPU * Update OpenVINO-ExecutionProvider.md update openvino execution provider documentation * Update Dockerfile.openvino Removed environment variable * Update OpenVINO-ExecutionProvider.md update image manipulation networks supported * Update onnx_backend_test_series_filters.jsonc removed test_upsample_nearest from cpu test cases * New InternalCI changes for 2021.1 * Full protobuf removed for OpenVINO * Protobuf added * Updated with apt installation for openvino * Revert the testing changes * Reverted testing changes * File permessions are changed to original * Deleted openvino installation and cmake change * Optimized Dockerfile Removed unnecessary cmake installation, numpy * Added missing ifdefs * delete array fix * backend_utils.cc output_shape * Revert "set default target of MCR dockerfile to MYRIAD_FP16" This reverts commit 928d3e2b71e2f589cf51dacd3a133951cf9ca18d. Co-authored-by: suryasidd <surya.siddharth.pemmaraju@intel.com> Co-authored-by: sfatimar <sahar.fatima@intel/com> Co-authored-by: suryasidd <48925384+suryasidd@users.noreply.github.com> Co-authored-by: S. Manohar Karlapalem <manohar.karlapalem@intel.com> Co-authored-by: Aravind <aravindx.gunda@intel.com> Co-authored-by: Aravind Gunda <38353114+gundaarx@users.noreply.github.com>	2020-10-14 15:56:00 -07:00

1 2 3 4 5 ...

282 commits