Commit graph

3377 commits

Author SHA1 Message Date
Tiago Koji Castro Shibata
b523fa08bc Use onecore umbrella lib in onecore builds (#5182)
* delayload hack

* Skip tests

* Onecore uses onecore umbrella

* Uncomment tests

* cleanup

* Disable dev mode for WinML
2020-09-17 08:39:47 -07:00
Chun-Wei Chen
393ff2f434 Add GetStartTime() for profiler to get private profiling_start_time_ (#4994)
* add GetStartTime() for profiler

* add function in inference_session

* remove qualified name

* add the api in cxx_api.h

* rename starttime to StartTimeNs, expost profiling object

* rename GetProfilingStartTime

* move Ortapis to the right place

* move to the end

* add const for session

* const the right place

* use const auto instead of const auto* for session

* remove const for auto getstarttime

* remove const for auto getstarttime

add unit tests

* nit: update test name and add comments
2020-09-17 08:39:47 -07:00
edgchen1
5d3c962481 Install ssh in builder image, fix segfault in TrainingRunnerTest.Basic. (#5186) 2020-09-17 08:39:47 -07:00
Bowen Bao
53d8779dbc Improve error message for FE model export checking (#5156) 2020-09-17 08:39:47 -07:00
Changming Sun
a0a435abc6
Add sympy==1.1.1 to Linux docker image (#5177) 2020-09-15 16:08:49 -07:00
Tianlei Wu
0752fd7425
change version number from 1.4.0 to 1.5.0 (#5178) 2020-09-15 15:50:25 -07:00
Chi Lo
9f526f45ac
TensorRT Perf Tool (#4900)
* Initialize tensorrt perf script

* Add bert-squad dependencies

* Modified code to make ort inference with CUDA/Tensorrt

* Add get CUDA/TRT version

* uncomment bert-squad

* Add BERT-SQUAD inputs.json

* Add FastRCNN

* Make preprocess/validation in to common functions

* Add MaskRCNN and SSD and consolidate the code

* Add dependencies for MaskRCNN

* following modifications are made:
    - create common fetch function to get inputs/outputs of model from ONNX model zoo.
    - create common validation function to compare inference outputs with reference outputs from ONNX model zoo.
    - move run/repeat time to argument list. (still working on other arguments, like fp16 or fp32, latency percentile).
    - generate table in csv file to show the latency comparison (TRT vs CUDA) side by side.

* Add approache to analyze profling file and also update model related
settings

* Add models

* Add most of models from ONNX model zoo

* Add model input name and print all the model names at the end of run

* Add system info

* Add TRT fp16 support

* Refine the code

* Handle TRT fall back and modify the way to get input data

* Refine code

* Modify code

* Add more precise approach to measure inference

* Add io-binding

* Add YoLoV4

* Refine the code

* Refine the code

* Add models

* Add yolov4 notebook for jetson device

* Update notebook

* Update notebook

* Add CVS models

* Add missing model

* Add support of float16

* Add new way to get trt version

* Add "validate" and "benchmark" mode

* Add randomly generated input

* Refine perf script

* Refine the code.

* Add README

* Refine the code

* Update README.md

* Refine code

* Update README.md

* Remove all the model related python and instead using model_list.json as
models configuration.

Refine the benchmark.py

* Refine the code

Co-authored-by: Chi Lo <lochi@microsoft.com>
2020-09-15 10:06:01 -07:00
Changming Sun
ef496d36ea
Build: Add missing EXCLUDE_FROM_ALL to ONNX submodule (#5161)
Avoid building unnecessary things
2020-09-15 09:22:09 -07:00
Wenbing Li
de6e3fb61d
Reduce IOS shared library size by symbol file. (#5171) 2020-09-14 23:59:41 -07:00
Ryan Hill
8fa427b264
Ryanunderhill/backout 5014 (#5167)
* Revert 5014
2020-09-14 22:48:00 -07:00
Scott McKay
089789c135
Revert change to disable support for loading ORT format models in the packaging pipelines. (#5168) 2020-09-15 15:11:06 +10:00
Sheil Kumar
c0d7c8bc44
Add docs indicating that the onnxruntime engine from other distributions can be compatible with the WinRT NuGet (#5009)
* add docs for mix and matching

* typos

Co-authored-by: Sheil Kumar <sheilk@microsoft.com>
2020-09-14 21:15:51 -07:00
RandySheriffH
1dde215d96
promote cuda version on packacking pipelines (#5154)
* promote cuda version on packacking pipelines

* fix cudnn version in py packaing template

Co-authored-by: Randy Shuai <rashuai@microsoft.com>
2020-09-14 21:09:09 -07:00
Yufeng Li
3068a835f1
Fix quantization of 1-D conv with bias (#5157) 2020-09-14 18:07:14 -07:00
Andrei Shadrikov
82b25e1731
Fix datasize call in calibrate (#5110)
* Moving datasize to the interface.

* Reverting changes and adressing the comment
2020-09-14 18:06:23 -07:00
S. Manohar Karlapalem
f7edf0aa57
[OpenVINO-EP] Enable EP config options for VPU hardware (#5119)
* Added config flags for VPU Fast Recompile

* clean-up ifdefs

* Add VPU Fast compile config option

Adds an option that enables Fast compilation of models to VPU
hardware specific format.

* Add config option to choose specific device id for inference

Inference of all subgraphs will be scheduled only on this device
even if other devices of the same type are available.

* Add Python API to list available device IDs

* code cleanup

* Add second C/C++ API with settings string parameter

Adds an additional C/C++ API that allows passing multiple
key-value pairs for settings as a single string. Multiple
settings are delimited by '\n' while the key and value
within a setting are delimited by '|'.

* Append 'Ex' to the extended C/C++ API

* Use set_providers Py API to set config options.

Uses Session.set_providers Python API to set EP runtime config
options as key/val pairs
Deprecated older module function definitions for config settings.
Updates documentation.

* avoid globals for py config options where possible

Co-authored-by: intel <you@example.com>
2020-09-14 15:46:14 -07:00
Zhang Lei
d45e49dd2b
Add LeakyRelu and Sigmoid QLinear Quantization support (#5116)
* Add LeakyRelu and Sigmoid QLinear Quantization support

* Change due to reflect master changes.
2020-09-14 14:46:24 -07:00
Changming Sun
8946d212bf
Remove the dependency on CUDA SDk's version.txt (#5155) 2020-09-14 14:25:28 -07:00
Yufeng Li
20b2f45b24
Support per-channel quantization of weight tensor (#5057)
* Support per-channel quantization of weight tensor

* rename util functions

* fix bugs in calibrate

* add support of reduce_range

* refine opset check
2020-09-14 11:53:50 -07:00
Wenbing Li
2a456d16c0
Enable onnxruntime iOS shared library build. (#5148) 2020-09-14 10:32:39 -07:00
ashbhandare
cc3212f9d5
Add fp16 pow kernel (#5016)
* Add fp16 pow kernel

* Fix test added for non-cuda runs
2020-09-14 10:01:39 -07:00
Moshe David
1d6a21fd08
[TensorRT] Add slightly faster hash computation for vector<int> (#5142)
* w

* w

Co-authored-by: modav <modav@microsoft.com>
2020-09-14 09:01:59 -07:00
sfatimar
0c7e9fb52a
changes to ensure compilation issues in windows is fixed by disabling the level 3 warning 4267 (#5147)
while a more permanent fix is found

Co-authored-by: sfatimar <sahar.fatima@intel/com>
2020-09-14 08:59:41 -07:00
RandySheriffH
9392aa2f64
Promote Cuda version to 10.2 for windows pipelines (#5138) 2020-09-13 20:32:06 -07:00
Guoyu Wang
6fcd99f6ed
Some minor updates for ORT mobile (#5146)
* Minor update ios build instructions and other comments

* Create shared string for nodearg name
2020-09-13 17:28:12 -07:00
Guoyu Wang
e5892dd6f3
Change the version check of ort format save/load (#5140)
* change the version check of ort format save/load

* Address PR comments, update the unit test ort models

* Update some variable names to code convention

* Move IsOrtModelVersionSupported inside of #if defined(ENABLE_ORT_FORMAT_LOAD)
2020-09-11 23:11:31 -07:00
Scott McKay
323a1ba8a4
Add option to exclude support for loading ORT format models in full build. (#5129)
* Add ability to exclude support for loading ORT format models.
Disable support for ORT format models in packages
2020-09-12 12:21:30 +10:00
Ye Wang
5302fe4079
A fix in load_pretrained_model() (#5137)
* Fix in load_pretrained_model

* Update onnx_exporter.py
2020-09-11 17:23:02 -07:00
Hariharan Seshadri
849bb8653f
Exclude a few python tests for the DML EP (#5135) 2020-09-11 16:40:02 -07:00
Guoyu Wang
698eccf15e
Add iOS build instruction (#5125)
* ios build instruction

* fix logger issue in onnx_model_info

* Revert "fix logger issue in onnx_model_info"

This reverts commit 72f2b88256ccf29c75fefbcd1daf6b4dcf7e0c61.

* Address comments and fix small issue in iOS build
2020-09-11 16:10:36 -07:00
Tianlei Wu
7511021e0e
Save Gpt2 test data (#5132)
(1) Save gpt2 test data during test generation.
(2) Use torch fp32 model as baseline when onnx model is fp16.
(3) Refine logic to compose onnx model path
2020-09-11 14:31:49 -07:00
RandySheriffH
120e3cda74
fix path (#5131)
Co-authored-by: Randy Shuai <rashuai@microsoft.com>
2020-09-11 12:18:07 -07:00
Rayan-Krishnan
92a8c650ad
[Debuggability] Add feature to ORTTrainer Frontend (#5124)
* add option, feature to orttrainer and test

* address comments

* minor fixes

* further address comments

* minor changes

Co-authored-by: Rayan Krishnan <t-rakr@OrtDevTest2v100.af05slrtruoetgaxwwjv5nsq5e.px.internal.cloudapp.net>
2020-09-11 12:16:07 -07:00
Ye Wang
89509f256a
Not fuse SkipLayerNorm when add has initializer input (#5123) 2020-09-11 11:46:31 -07:00
Ashwini Khade
cd56ab197c
csharp build documentation (#5121) 2020-09-11 11:46:10 -07:00
dependabot[bot]
15d431f39b Bump node-fetch from 2.6.0 to 2.6.1 in /nodejs
Bumps [node-fetch](https://github.com/bitinn/node-fetch) from 2.6.0 to 2.6.1.
- [Release notes](https://github.com/bitinn/node-fetch/releases)
- [Changelog](https://github.com/node-fetch/node-fetch/blob/master/docs/CHANGELOG.md)
- [Commits](https://github.com/bitinn/node-fetch/compare/v2.6.0...v2.6.1)

Signed-off-by: dependabot[bot] <support@github.com>
2020-09-11 11:45:37 -07:00
Tianlei Wu
ccfbc56388
Handle dummy mask in Attention operators (#5108)
* Handle dummy mask with shape (1, 1) or (batch_size, 1).
2020-09-11 09:31:03 -07:00
stevenlix
c794c88ae0
Solve name conflict in TensorRT engine caching (#5128)
* fix hash conflict

* Add verbose for engine deserialization and destroy old engine memory if new engine is generated

* update parser

* Update tensorrt_execution_provider.cc

* use a better hash algorithm

* Update tensorrt_execution_provider.cc
2020-09-11 09:12:56 -07:00
Guoyu Wang
51f3d3af72
Enable onnxruntime_perf_test for ORT minimal build (#5126)
* Enable onnxruntime_perf_test for ort minimal build

* Add error message

Co-authored-by: gwang0000 <62914304+gwang0000@users.noreply.github.com>
2020-09-11 01:58:11 -07:00
Scott McKay
59ee8ffb17
Remove SparseTensor support from minimal build. (#5114)
* Remove SparseTensor support from minimal build.

Currently the only valid usage of a SparseTensor is as an attribute of a Constant node. That would have been lifted to a dense tensor initializer when loading the onnx model, so would not exist when saving the ORT format model. Due to that there can be no SparseTensors in an ORT format model.

Co-authored-by: gwang <wanggy@outlook.com>
2020-09-11 17:56:54 +10:00
Ye Wang
879751f3b7
Support Tensorflow benchmarking and onnx export in transformers tool (#5068)
* init checkin for tf export and tf benchmark

* small fix on argparse

* refactor

* review comments

* review comments
2020-09-11 00:47:37 -07:00
Changming Sun
c5efb0085d
Update Linux GPU build pipelines to CUDA 10.2 (#5120)
* Update Linux GPU build pipelines to CUDA 10.2
2020-09-10 17:40:51 -07:00
Ashwini Khade
a8557b3f0f
skip tests when model opset > released opset (#5096)
* skip tests when model opset > released opset

* remove multiple model load

* nit fixes

* plus some comments
2020-09-10 17:25:32 -07:00
Hariharan Seshadri
782ccff207
Add dll probe path so that the right DirectML.dll is loaded while running C# tests (#5104) 2020-09-10 16:19:21 -07:00
Wei-Sheng Chin
5618b9dddc
Use CMake built-in function to compare NCCL version (#5118)
* Use CMake built-in function to compare version

* Address comment
2020-09-10 15:59:47 -07:00
Tianlei Wu
c5d4ae0401
Add transformers tools to python package (#5090)
* Add transformers to onnxruntime python package
2020-09-10 15:42:15 -07:00
Moshe David
61051396e8
[TensorRT] Align naming convention and remove redundant code (#5094) 2020-09-10 15:03:34 -07:00
Scott McKay
fae5915d76
CMake fixes/tweaks for minimal builds and MinSizeRel builds (#5112)
* Fix places where MinSizeRel wasn't having relevant flags added in the same way as Release and RelWithDebInfo
Enable LTO for minimal build. Cleanups onnx_minimal.cmake to remove some things handled when LTO is enabled in CMakeLists.txt

* Only enable LTO for MSVC in a minimal build
2020-09-11 06:50:28 +10:00
Changming Sun
a5530358c9
Fix a path problem in Dockerfile.manylinux2014_cuda10_2 (#5106) 2020-09-10 10:30:13 -07:00
Changming Sun
47554a0422
Disable some tests (#5103) 2020-09-10 08:15:18 -07:00