onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-06-08 00:23:03 +00:00

Author	SHA1	Message	Date
Chi Lo	9f526f45ac	TensorRT Perf Tool (#4900 ) * Initialize tensorrt perf script * Add bert-squad dependencies * Modified code to make ort inference with CUDA/Tensorrt * Add get CUDA/TRT version * uncomment bert-squad * Add BERT-SQUAD inputs.json * Add FastRCNN * Make preprocess/validation in to common functions * Add MaskRCNN and SSD and consolidate the code * Add dependencies for MaskRCNN * following modifications are made: - create common fetch function to get inputs/outputs of model from ONNX model zoo. - create common validation function to compare inference outputs with reference outputs from ONNX model zoo. - move run/repeat time to argument list. (still working on other arguments, like fp16 or fp32, latency percentile). - generate table in csv file to show the latency comparison (TRT vs CUDA) side by side. * Add approache to analyze profling file and also update model related settings * Add models * Add most of models from ONNX model zoo * Add model input name and print all the model names at the end of run * Add system info * Add TRT fp16 support * Refine the code * Handle TRT fall back and modify the way to get input data * Refine code * Modify code * Add more precise approach to measure inference * Add io-binding * Add YoLoV4 * Refine the code * Refine the code * Add models * Add yolov4 notebook for jetson device * Update notebook * Update notebook * Add CVS models * Add missing model * Add support of float16 * Add new way to get trt version * Add "validate" and "benchmark" mode * Add randomly generated input * Refine perf script * Refine the code. * Add README * Refine the code * Update README.md * Refine code * Update README.md * Remove all the model related python and instead using model_list.json as models configuration. Refine the benchmark.py * Refine the code Co-authored-by: Chi Lo <lochi@microsoft.com>	2020-09-15 10:06:01 -07:00
Changming Sun	ef496d36ea	Build: Add missing EXCLUDE_FROM_ALL to ONNX submodule (#5161 ) Avoid building unnecessary things	2020-09-15 09:22:09 -07:00
Wenbing Li	de6e3fb61d	Reduce IOS shared library size by symbol file. (#5171 )	2020-09-14 23:59:41 -07:00
Ryan Hill	8fa427b264	Ryanunderhill/backout 5014 (#5167 ) * Revert 5014	2020-09-14 22:48:00 -07:00
Scott McKay	089789c135	Revert change to disable support for loading ORT format models in the packaging pipelines. (#5168 )	2020-09-15 15:11:06 +10:00
Sheil Kumar	c0d7c8bc44	Add docs indicating that the onnxruntime engine from other distributions can be compatible with the WinRT NuGet (#5009 ) * add docs for mix and matching * typos Co-authored-by: Sheil Kumar <sheilk@microsoft.com>	2020-09-14 21:15:51 -07:00
RandySheriffH	1dde215d96	promote cuda version on packacking pipelines (#5154 ) * promote cuda version on packacking pipelines * fix cudnn version in py packaing template Co-authored-by: Randy Shuai <rashuai@microsoft.com>	2020-09-14 21:09:09 -07:00
Yufeng Li	3068a835f1	Fix quantization of 1-D conv with bias (#5157 )	2020-09-14 18:07:14 -07:00
Andrei Shadrikov	82b25e1731	Fix datasize call in calibrate (#5110 ) * Moving datasize to the interface. * Reverting changes and adressing the comment	2020-09-14 18:06:23 -07:00
S. Manohar Karlapalem	f7edf0aa57	[OpenVINO-EP] Enable EP config options for VPU hardware (#5119 ) * Added config flags for VPU Fast Recompile * clean-up ifdefs * Add VPU Fast compile config option Adds an option that enables Fast compilation of models to VPU hardware specific format. * Add config option to choose specific device id for inference Inference of all subgraphs will be scheduled only on this device even if other devices of the same type are available. * Add Python API to list available device IDs * code cleanup * Add second C/C++ API with settings string parameter Adds an additional C/C++ API that allows passing multiple key-value pairs for settings as a single string. Multiple settings are delimited by '\n' while the key and value within a setting are delimited by '\|'. * Append 'Ex' to the extended C/C++ API * Use set_providers Py API to set config options. Uses Session.set_providers Python API to set EP runtime config options as key/val pairs Deprecated older module function definitions for config settings. Updates documentation. * avoid globals for py config options where possible Co-authored-by: intel <you@example.com>	2020-09-14 15:46:14 -07:00
Zhang Lei	d45e49dd2b	Add LeakyRelu and Sigmoid QLinear Quantization support (#5116 ) * Add LeakyRelu and Sigmoid QLinear Quantization support * Change due to reflect master changes.	2020-09-14 14:46:24 -07:00
Changming Sun	8946d212bf	Remove the dependency on CUDA SDk's version.txt (#5155 )	2020-09-14 14:25:28 -07:00
Yufeng Li	20b2f45b24	Support per-channel quantization of weight tensor (#5057 ) * Support per-channel quantization of weight tensor * rename util functions * fix bugs in calibrate * add support of reduce_range * refine opset check	2020-09-14 11:53:50 -07:00
Wenbing Li	2a456d16c0	Enable onnxruntime iOS shared library build. (#5148 )	2020-09-14 10:32:39 -07:00
ashbhandare	cc3212f9d5	Add fp16 pow kernel (#5016 ) * Add fp16 pow kernel * Fix test added for non-cuda runs	2020-09-14 10:01:39 -07:00
Moshe David	1d6a21fd08	[TensorRT] Add slightly faster hash computation for `vector<int>` (#5142 ) * w * w Co-authored-by: modav <modav@microsoft.com>	2020-09-14 09:01:59 -07:00
sfatimar	0c7e9fb52a	changes to ensure compilation issues in windows is fixed by disabling the level 3 warning 4267 (#5147 ) while a more permanent fix is found Co-authored-by: sfatimar <sahar.fatima@intel/com>	2020-09-14 08:59:41 -07:00
RandySheriffH	9392aa2f64	Promote Cuda version to 10.2 for windows pipelines (#5138 )	2020-09-13 20:32:06 -07:00
Guoyu Wang	6fcd99f6ed	Some minor updates for ORT mobile (#5146 ) * Minor update ios build instructions and other comments * Create shared string for nodearg name	2020-09-13 17:28:12 -07:00
Guoyu Wang	e5892dd6f3	Change the version check of ort format save/load (#5140 ) * change the version check of ort format save/load * Address PR comments, update the unit test ort models * Update some variable names to code convention * Move IsOrtModelVersionSupported inside of #if defined(ENABLE_ORT_FORMAT_LOAD)	2020-09-11 23:11:31 -07:00
Scott McKay	323a1ba8a4	Add option to exclude support for loading ORT format models in full build. (#5129 ) * Add ability to exclude support for loading ORT format models. Disable support for ORT format models in packages	2020-09-12 12:21:30 +10:00
Ye Wang	5302fe4079	A fix in load_pretrained_model() (#5137 ) * Fix in load_pretrained_model * Update onnx_exporter.py	2020-09-11 17:23:02 -07:00
Hariharan Seshadri	849bb8653f	Exclude a few python tests for the DML EP (#5135 )	2020-09-11 16:40:02 -07:00
Guoyu Wang	698eccf15e	Add iOS build instruction (#5125 ) * ios build instruction * fix logger issue in onnx_model_info * Revert "fix logger issue in onnx_model_info" This reverts commit 72f2b88256ccf29c75fefbcd1daf6b4dcf7e0c61. * Address comments and fix small issue in iOS build	2020-09-11 16:10:36 -07:00
Tianlei Wu	7511021e0e	Save Gpt2 test data (#5132 ) (1) Save gpt2 test data during test generation. (2) Use torch fp32 model as baseline when onnx model is fp16. (3) Refine logic to compose onnx model path	2020-09-11 14:31:49 -07:00
RandySheriffH	120e3cda74	fix path (#5131 ) Co-authored-by: Randy Shuai <rashuai@microsoft.com>	2020-09-11 12:18:07 -07:00
Rayan-Krishnan	92a8c650ad	[Debuggability] Add feature to ORTTrainer Frontend (#5124 ) * add option, feature to orttrainer and test * address comments * minor fixes * further address comments * minor changes Co-authored-by: Rayan Krishnan <t-rakr@OrtDevTest2v100.af05slrtruoetgaxwwjv5nsq5e.px.internal.cloudapp.net>	2020-09-11 12:16:07 -07:00
Ye Wang	89509f256a	Not fuse SkipLayerNorm when add has initializer input (#5123 )	2020-09-11 11:46:31 -07:00
Ashwini Khade	cd56ab197c	csharp build documentation (#5121 )	2020-09-11 11:46:10 -07:00
dependabot[bot]	15d431f39b	Bump node-fetch from 2.6.0 to 2.6.1 in /nodejs Bumps [node-fetch](https://github.com/bitinn/node-fetch) from 2.6.0 to 2.6.1. - [Release notes](https://github.com/bitinn/node-fetch/releases) - [Changelog](https://github.com/node-fetch/node-fetch/blob/master/docs/CHANGELOG.md) - [Commits](https://github.com/bitinn/node-fetch/compare/v2.6.0...v2.6.1) Signed-off-by: dependabot[bot] <support@github.com>	2020-09-11 11:45:37 -07:00
Tianlei Wu	ccfbc56388	Handle dummy mask in Attention operators (#5108 ) * Handle dummy mask with shape (1, 1) or (batch_size, 1).	2020-09-11 09:31:03 -07:00
stevenlix	c794c88ae0	Solve name conflict in TensorRT engine caching (#5128 ) * fix hash conflict * Add verbose for engine deserialization and destroy old engine memory if new engine is generated * update parser * Update tensorrt_execution_provider.cc * use a better hash algorithm * Update tensorrt_execution_provider.cc	2020-09-11 09:12:56 -07:00
Guoyu Wang	51f3d3af72	Enable onnxruntime_perf_test for ORT minimal build (#5126 ) * Enable onnxruntime_perf_test for ort minimal build * Add error message Co-authored-by: gwang0000 <62914304+gwang0000@users.noreply.github.com>	2020-09-11 01:58:11 -07:00
Scott McKay	59ee8ffb17	Remove SparseTensor support from minimal build. (#5114 ) * Remove SparseTensor support from minimal build. Currently the only valid usage of a SparseTensor is as an attribute of a Constant node. That would have been lifted to a dense tensor initializer when loading the onnx model, so would not exist when saving the ORT format model. Due to that there can be no SparseTensors in an ORT format model. Co-authored-by: gwang <wanggy@outlook.com>	2020-09-11 17:56:54 +10:00
Ye Wang	879751f3b7	Support Tensorflow benchmarking and onnx export in transformers tool (#5068 ) * init checkin for tf export and tf benchmark * small fix on argparse * refactor * review comments * review comments	2020-09-11 00:47:37 -07:00
Changming Sun	c5efb0085d	Update Linux GPU build pipelines to CUDA 10.2 (#5120 ) * Update Linux GPU build pipelines to CUDA 10.2	2020-09-10 17:40:51 -07:00
Ashwini Khade	a8557b3f0f	skip tests when model opset > released opset (#5096 ) * skip tests when model opset > released opset * remove multiple model load * nit fixes * plus some comments	2020-09-10 17:25:32 -07:00
Hariharan Seshadri	782ccff207	Add dll probe path so that the right DirectML.dll is loaded while running C# tests (#5104 )	2020-09-10 16:19:21 -07:00
Wei-Sheng Chin	5618b9dddc	Use CMake built-in function to compare NCCL version (#5118 ) * Use CMake built-in function to compare version * Address comment	2020-09-10 15:59:47 -07:00
Tianlei Wu	c5d4ae0401	Add transformers tools to python package (#5090 ) * Add transformers to onnxruntime python package	2020-09-10 15:42:15 -07:00
Moshe David	61051396e8	[TensorRT] Align naming convention and remove redundant code (#5094 )	2020-09-10 15:03:34 -07:00
Scott McKay	fae5915d76	CMake fixes/tweaks for minimal builds and MinSizeRel builds (#5112 ) * Fix places where MinSizeRel wasn't having relevant flags added in the same way as Release and RelWithDebInfo Enable LTO for minimal build. Cleanups onnx_minimal.cmake to remove some things handled when LTO is enabled in CMakeLists.txt * Only enable LTO for MSVC in a minimal build	2020-09-11 06:50:28 +10:00
Changming Sun	a5530358c9	Fix a path problem in Dockerfile.manylinux2014_cuda10_2 (#5106 )	2020-09-10 10:30:13 -07:00
Changming Sun	47554a0422	Disable some tests (#5103 )	2020-09-10 08:15:18 -07:00
Ryan Hill	3207de276c	Remove IDeviceAllocator class as it doesn't extend IAllocator in any way. (#5067 )	2020-09-10 00:46:35 -07:00
Guoyu Wang	5b6643cefb	Move ort flatbuffers header to use enum class instead of enum (#5105 ) * change fbs to scoped enum * modify ort code to use new fbs header Co-authored-by: gwang0000 <62914304+gwang0000@users.noreply.github.com>	2020-09-10 17:17:49 +10:00
Guoyu Wang	433061531e	Enable onnx_test_runner for ort format (#5100 ) * Enable onnx_test_runner using ort format, for ort minimal build only Co-authored-by: gwang0000 <62914304+gwang0000@users.noreply.github.com>	2020-09-10 17:15:19 +10:00
Tiago Koji Castro Shibata	62848c4de5	Add store builds to nuget packaging (#5040 ) * Nuget store packaging * Move DNNL workaround to EP * Fix warning as error * Disable store tests * Skip store tests * msbuild target * Cross compile protoc in Store * Disable DML in store * Move store builds to CPU queue * Copy uap10 to final nuget * Fix pip8 error * Remove extra dml copies * Fix argparse * pep8 * Forward IsStoreBuild * Apply is_store_build to duplicate generate_nuspec * runtimes * Refactor uap10 * Store .NET * uap * PR feedback	2020-09-09 21:38:14 -07:00
Wei-Sheng Chin	9ba56dcfed	Support Send and Recv for old NCCL versions (#5097 ) If NCCL version < 2.7, MPI is sued. Otherwise, we use NCCL Send and Recv.	2020-09-09 20:58:05 -07:00
Changming Sun	09a6ce6bc0	Add re2 to memory leak checker whitelist (#5101 ) * Add re2 to memory leak checker whitelist	2020-09-09 20:08:37 -07:00

1 2 3 4 5 ...

3371 commits