Commit graph

6540 commits

Author SHA1 Message Date
Yi Zhang
c1e37e4ebf
Android CI Pipeline: Fix post coverage bug (#10949) 2022-03-19 11:17:08 -07:00
Ella Charlaix
fe6ab719f3
Fix a typo in quantization tools (#10940) 2022-03-18 21:03:16 -07:00
soundarthiaga
eabb14788a
[perf_metric] added inferences per second metric (#10921) 2022-03-18 21:01:11 -07:00
Yi Zhang
3897b93606
optimize Android CI (#10938) 2022-03-19 11:00:21 +08:00
Kotaro Yamamoto
2dea7dc27f
Skip python arena shrinkage test on ppc (#10901) 2022-03-18 19:31:21 -07:00
soundarthiaga
de06d95096
[parallel_inference] added support for parallel inference with timed duration perf test (#10922) 2022-03-18 19:05:28 -07:00
Scott McKay
5cbacec854
Maintain aspect ratio by doing resize + crop in image_to_pb tool (#10887) 2022-03-19 07:08:45 +10:00
ytaous
f058c59407
Performance: add io_binding support for bert benchmark util (#10907)
* io_binding support

* cover all test cases

* per comments

Co-authored-by: Ethan Tao <ettao@microsoft.com@orttrainingdev7.d32nl1ml4oruzj4qz3bqlggovf.px.internal.cloudapp.net>
2022-03-18 10:33:30 -07:00
Changming Sun
c29d94383d
Update CODEOWNERS (#10932) 2022-03-18 09:37:58 -07:00
Ye Wang
ee05c591e5
Fix benchmark bugs and add Pytorch version control (#10928) 2022-03-18 09:24:19 -07:00
Guoyu Wang
6f844522c8
Follow up update for python API checking if vcruntime140_1.dll is available (#10927) (#10933) 2022-03-18 08:09:16 -07:00
Vincent Wang
8860fded02
Disable Some Einsum ORTModule Tests Due to Issue from PyTorch Exporter (#10906)
* disable some einsum tests due to pytorch issue

* disable tests on specific torch versions

* use skipif
2022-03-18 21:28:18 +08:00
Tiago Koji Castro Shibata
5ed2f4ad5f Remove Windows Store specific code 2022-03-17 23:38:14 -07:00
Changming Sun
a3eeb7b0f7 Update orttraining-linux-external-custom-ops.yml for Azure Pipelines 2022-03-17 23:36:53 -07:00
Ye Wang
78133434b5
Fix fp16 converter bugs[1/n] (#10882)
handle sequence type
2022-03-17 22:38:43 -07:00
zhangyaobit
5d4ff67c36
Support fusion options for benchmark.py (#10900)
* Support fusion options for benchmark.py

* Add fusion options for tf model export as well.

* Add command example and warning related to fusion options.
2022-03-17 20:57:43 -07:00
Guoyu Wang
b86d105153
[python API] Change raise import error when C:\Windows\System32\vcruntime140_1.dll is not found to warning (#10927)
* remove throw if C:\\Windows\\System32\\vcruntime140_1.dll cannot be found

* Add comments and update warning message

* adding back accidentally removed line

Co-authored-by: gwang0000 <62914304+gwang0000@users.noreply.github.com>
2022-03-17 18:56:43 -07:00
Sunghoon
740870a285
[js] Create npm packaging pipeline (#10886)
* create npm packaging pipeline

* fix indentations

* Update npm-packaging-pipeline.yml for Azure Pipelines

* Update npm-packaging-pipeline.yml for Azure Pipelines

* Update npm-packaging-pipeline.yml for Azure Pipelines

* react-native-ci as a template

* fix typos

* fix template paths

* add a depencendy

* change a stage name

* set different artifact name for each package

* fix typo

* Update npm-packaging-pipeline.yml for Azure Pipelines

Set a build Id for node npm package as a parameter

* Update npm-packaging-pipeline.yml for Azure Pipelines

Set a build Id for node npm package as a parameter

* Update npm-packaging-pipeline.yml for Azure Pipelines
2022-03-17 16:44:00 -07:00
Valery Chernov
766e6ac4fd
check TVM target for CPU (#10926)
Co-authored-by: Valery Chernov <valery.chernov@deelvin.com>
2022-03-17 15:51:24 -07:00
Yi-Hong Lyu
a46b00499a
Revert "Upsample support NHWC (#10554)" (#10917)
This reverts commit bd08f11a58.

Co-authored-by: Yufeng Li <liyufeng1987@gmail.com>
2022-03-17 13:16:29 -07:00
Sheil Kumar
810c18e809
fix complex multi-dim dft (#10896)
Co-authored-by: Sheil Kumar <sheilk@microsoft.com>
2022-03-17 12:45:51 -07:00
Edward Chen
07a71d5bf2
Fix handling of nodes inserted by NHWC transformer. (#10904) 2022-03-17 12:41:34 -07:00
cloudhan
e03b799b95
Make it clear in verbose log on why a kernel impl does not fit for a node (#10872) 2022-03-17 20:05:11 +08:00
Vincent Wang
6c0eff1ae4
optimize Transpose3DKernel (#10891) 2022-03-17 18:09:04 +08:00
PeixuanZuo
463fac67a3
[FIX] symbolic shape infer error with onnx-1.11.0 (#10674)
* [FIX] symbolic shape infer error with onnx-1.11.0

* [FIX] consider inputs name contains 'unk__'

* [TEST] enable gpt2 test

* [FIX] gpt2_megatron_opt.onnx graph
2022-03-17 13:47:02 +08:00
mindest
d7d7665023
restore random states after export_model (#10705)
* restore random states after export_model

* move get/set_random_states inside _export_model

* add comments for random state save/restore

* add unit test for random state check

* resolve comments

* fix error
2022-03-17 11:56:25 +08:00
Chi Lo
42d7112f03
Add multithreading test and put a lock on nvinfer1::createInferRuntime() for TRT EP (#10714)
* Add multithread unit test and put lock on library call

* update code

* remove debug code

* add comment

* add one session multi-threads inference

* Put lock for build engine all the time

* Update naming and comment

* remove unnecessary lock

* Revert "remove unnecessary lock"

This reverts commit 9c2317b1d2273dec0ebdeb52160bc757839e5edc.
2022-03-16 09:19:33 -07:00
Chi Lo
ce204d0744
Update to flatbuffers v2.0.0 (#10866) 2022-03-16 09:18:49 -07:00
Chun-Wei Chen
bac9c0eb50
skip optional related models from opset16 (#10840) (#10878) 2022-03-16 08:49:42 -07:00
PeixuanZuo
5763657715
[UPDATE] Add prefix in front of the file (#10884) 2022-03-16 21:05:18 +08:00
Valery Chernov
625a1f7673
[TVM EP] code refactor (#10655)
* rename info to options for TVM EP

* transfer options processing from TVMExecutionProvider to TVMEPOptions

* transfer TVMRunner to separated files

* implement TVMCompiler class

* replace CompileFunc by TVMCompiler object. update TVMRunner. now it does not depend on TvmExecutionProvider

* correct logging of TVM EP options

* RunnerImpl, GERunnerImpl and VMRunnerImpl were implemented

* add prepareComputeInfo method

* remove update_output_shapes flag

* embed all TVM EP dependences to tvm namespace. transfer model compilation from TVMRunner. connect TVMRunnerImpl to TVMRunner

* refactor compileModel method

* small cleaning

* separate TVM EP options data store and processing

* replace TvmTensorShape by InlinedVector with max_size 5

* correct indentation

* update TVM hash

Co-authored-by: Valery Chernov <valery.chernov@deelvin.com>
2022-03-16 13:55:04 +01:00
Edward Chen
f468ea40e5
Refactor Node::AddAttribute() (#10869) 2022-03-16 14:53:00 +10:00
PeixuanZuo
040c0645e2
[ADD] Add micro-benchmark for Cast (#10870)
* [ADD] Add micro-benchmark for Cast

* [UPDATE] related to bert model and fix the format
2022-03-16 10:48:26 +08:00
Sheil Kumar
860f28254e
Update DFT definition to more closely align with PyTorch by enabling axis attribute, and arbitrary tensor rank. (#10842)
* Add axis attribute

* fix breaks

* Enable axis-specified DFT

* remove static cast

Co-authored-by: Sheil Kumar <sheilk@microsoft.com>
2022-03-15 15:27:12 -07:00
Changming Sun
de6d1fcb41 Update C# runtest.sh 2022-03-15 14:18:34 -07:00
Chi Lo
aebbb90b79
Integrate C-API tests into Pipelines for release packages (#10794)
* add c-api test for package

* fix bug for running c-api test for package

* refine run application script

* remove redundant code

* include CUDA test

* Remove testing CUDA EP temporarily

* fix bug

* Code refactor

* try to fix YAML bug

* try to fix YAML bug

* try to fix YAML bug

* fix bug for multiple directories in Pipelines

* fix bug

* add comments and fix bug

* Update c-api-noopenmp-packaging-pipelines.yml

* Remove failOnStandardError flag in Pipelines
2022-03-15 10:18:38 -07:00
Nat Kershaw (MSFT)
2d961604b1
Refactor Python API docs to better explain IO binding scenarios (#10651) 2022-03-15 09:40:59 -07:00
Scott McKay
f385c73058
Fix a couple of issues with the python package tools (#10858)
* Tweaks to the model utils
  * Add handling for a dim_value of -1 when replacing the entire input shape. This occurs in models exported from PaddlePaddle
  * make pytorch helpers accessible in package
  * make QDQ helpers accessible in package
2022-03-15 15:52:12 +10:00
Hariharan Seshadri
0d8d44d035
Fix Reduced ops pipeline (#10861) 2022-03-14 20:42:14 -07:00
Rachel Guo
7e9dfe627a
Add additional NNAPI QDQ test cases for expected failure path (#10769)
* add more qdq softmax test case

* add rest of additional nnapi qdq test case

* comment out

* update

* update concat test case

* extract zp/scale point retrieval out

* fix

* fix helper function for scale/zp and address some pr comments

* use enum class expectedepnodeassignments

* fix default value for ExpectedEPNodeAssignment

* update

* refine some comments

* refine some minor comments

* refine pr comments

* update

Co-authored-by: rachguo <rachguo@rachguos-Mini.attlocal.net>
2022-03-14 19:22:22 -07:00
Edward Chen
e53422c6d0
Update convert_onnx_models_to_ort.py to support runtime optimizations. (#10765)
Add runtime optimization support to ONNX -> ORT format conversion script.
Replace `--optimization_level`, `--use_nnapi`, and `--use_coreml` with a new `--optimization_style` option.
2022-03-14 16:50:41 -07:00
Abhishek Jindal
03181caeae
Creating test case for printing ort tensor (#10850)
* creating a test for printing ort tensor

* modifying comment for error case

* Using Output Grabber to assert the print output

* modifying the print ort test

* removing comments

* removing sys import
2022-03-11 21:39:48 -08:00
Tiago Koji Castro Shibata
ce10d7d231 Workaround https://github.com/microsoft/STL/issues/434#issuecomment-921321254 2022-03-11 18:10:23 -08:00
Tiago Koji Castro Shibata
d1f059c856 Remove kernel32 defaultlib 2022-03-11 18:10:23 -08:00
Tiago Koji Castro Shibata
ddf1a312ab Remove cpuinfo from WCOS builds 2022-03-11 18:10:23 -08:00
Funtowicz Morgan
c4f73af234
Fix wrong percentile values returned during calibration (#10847)
* Use numpy.percentile to get the lookup value.

* Use 1.0 as float value rather than integer.

* Add missing cdf parameter for `np.percentile`.

* Use 100. instead of 1.0

* Remove print.

* Update from @yufenglee
2022-03-11 14:52:09 -08:00
Scott McKay
6ac3b8d46a
Add support for opset 16 to transpose optimizer. (#10841)
* Add support for opset 16 to transpose optimizer.

Only change required is for GridSample to be added to the layout sensitive ops. The existing handling for layout transpose works with that as the first input and first output are layout sensitive.

Update the optimize to be able to return an error message if it fails.
2022-03-12 08:50:55 +10:00
Changming Sun
2d2eebb844 Correct a comment
"WINVER=0x0602" means  Windows 8.  source: https://docs.microsoft.com/en-us/cpp/porting/modifying-winver-and-win32-winnt?view=msvc-170
2022-03-11 11:42:41 -08:00
Ryan Lai
2e7592ddf8
avoid using LocalFree on FormatMessageW buffer (#10796)
* remove local free

* Remove local free from onnxruntime

* don't allocate

* Change to use constexpr to satisfy  CPU build warning
2022-03-11 11:11:40 -08:00
Kotaro Yamamoto
64556888a1
add python binding for RunOptions config entry (#10694) 2022-03-11 08:49:22 -08:00