Commit graph

2509 commits

Author SHA1 Message Date
Andrews548
475ea382f9
Fix ACL EP convolution-activation fusion optimization (#3896)
Co-authored-by: Andrei-Alexandru <andrei-alexandru.avram@nxp.com>
2020-05-12 14:06:48 -07:00
Jeff Bloomfield
f170f31e1d
Extend workaround with input name matching in DML fused graph kernel (#3918) 2020-05-12 14:04:09 -07:00
Bowen Bao
0f82b42fed
Ensure pt model is set to cpu in ort_trainer (#3867)
* Ensure pt model is set to cpu in ort_trainer

* add note comment
2020-05-12 13:32:27 -07:00
Tianlei Wu
29496170d7
Add Benchmark Script for Bert Models (#3829)
Add benchmark script for Transformer models
* Set intra_op_num_threads=1 for cpu (version <= 1.2.0)
* Add percentiles for latency
* torch.set_num_threads (for intra op) to get fair comparison
* Allow export ONNX model with specified number of inputs
* Add fusion statistics
* Install transformers from source
2020-05-12 12:26:22 -07:00
Scott McKay
2fed37c8eb
Fix bug in handling of an initializer that provides a graph output. (#3912)
* Outputs from model execution should always be returned in a newly allocated buffer or an pre-allocated buffer provided in fetches. When an initializer is providing a graph output (e.g. constant folding may result in this) we were returning an OrtValue that pointed to the initializer and not a separately allocated buffer with a copy.

This was wrong as:
  - value wasn't returned in a pre-allocated fetch so whilst the value returned was correct, it was returned in the wrong place
  - user could alter the data in the initializer via the returned value

* Add unit test with and without pre-allocated fetch.

* Add some extra info around why we're handling this special case.
2020-05-12 20:42:58 +10:00
George Wu
6f729b100f
use LOAD_WITH_ALTERED_SEARCH_PATH for LoadLibraryExA (#3908) 2020-05-11 19:53:34 -07:00
Thiago Crepaldi
70abb120b3
Remove ORTModel from frontend API (#3825)
* Resolve conflict

* Address review
2020-05-11 18:20:33 -07:00
Tracy Sharpe
b12d35b679
MLAS: tune softmax kernels for partial vectors (#3906) 2020-05-11 18:02:50 -07:00
M. Zeeshan Siddiqui
c46a9e8d65
Add numerical stability to SoftmaxGrad test inputs. (#3857)
* Increase the tolerance for SoftmaxGrad CPU-GPU compare tests.

* Increase the tolerance for SoftmaxGrad CPU-GPU compare tests.

* Add 1e-2 to Y for numerical stability.

* build break.

* comments.

* PR feedback.

* PR feedback.
2020-05-11 17:59:24 -07:00
Jeff Bloomfield
af7d453435
Merge DML Execution Provider updates (#3885)
* Merged PR 4616739: Update QLinear Ops fix 1D support layout

Update QLinear Ops fix 1D support layout

Related work items: #26011523

* Merged PR 4617257: Gather operator DML EP fails with scalar indices and 1D inputs

Fix gather with scalar value.

The ONNX conformance test case is in another PR:

    // 0D, axis 1, rank 0 indices tensor
    {
      "op_type": "Gather",
      "axis": 0,
      "data": [1,2,3],
      "indices": 0,
      "output": 1,
      "T": "float32"
    }

* Merged PR 4632178: Re-enable ORT onnx_test_runner test case (DirectML ConvTranspose validation needs to be loosened to comply with ONNX definition of output_padding)

Re-enable 1D convolution tests.

Related work items: #23499747

* Merged PR 4656672: Make DML EP use Direct queue

While a Compute queue has benefits, Direct is consistent with Winml.

Related work items: #26324112

* Update DML nuget version

* Merged PR 4662079: Update DmlDev branch again from github master

Include Sheil's changes to fix namespace and header file include paths. Without this, the ONNX conformance tests all fail with E_NOTIMPL.

* Increment DML nuget version

Co-authored-by: Nick Feeney <nickfe@microsoft.com>
Co-authored-by: Dwayne Robinson <dwayner@microsoft.com>
2020-05-11 17:57:01 -07:00
Scott McKay
0cdc1459c6
Add a few more things to the helper python scripts. (#3842)
* Add a few more things to the helper python scripts.
Add documentation with usage examples.

* Address PR comments
2020-05-12 09:59:40 +10:00
stevenlix
28f693a8a1
Update TensorRT dynamic shape profile when input shape changed during runtime (#3904)
* Update dynamic shape range when input shape changed during runtime

* Update tensorrt_execution_provider.cc

* Update tensorrt_execution_provider.cc
2020-05-11 16:11:41 -07:00
Ori Levari
6d2d927809
add test to api tests to ensure correct hresult is returned for corrupt model file (#3796)
Co-authored-by: Ori Levari <orlevari@microsoft.com>
2020-05-11 13:22:22 -07:00
Sheil Kumar
c7da194313
remove winrt (#3899)
Co-authored-by: Sheil Kumar <sheilk@microsoft.com>
2020-05-11 10:47:07 -07:00
Prabhat
ce3678ffaf
Added aarch64 build pipeline (#3841)
* Added aarch64 build pipeline

* Fix build error

* Remove auditwheel repair which doesn't work with cross compiling

* Statically link C++

* Added auditwheel repair back and fix stdlib.h

* Remove extra space
2020-05-11 22:56:16 +05:30
Scott McKay
d7e39569da
Use MlasComputeSoftmax in traditional ML ops (#3892)
* Use MlasSoftmax in ML ops

* Refine when mlas is used based on perf testing.
2020-05-11 16:28:59 +10:00
Hariharan Seshadri
06985a9922
Stop proceeding with constant folding if a CPU kernel is not found found (#3836)
* Stop proceeding with constant folding if a CPU kernel is not found for a node

* Fix build

* PR feedback

* Fix typo

* Refine

* Remove unnecessary header inclusion

* Refine

* Fix build

* More changes

* More changes

* More changes

* Fix CentOS build
2020-05-10 19:08:37 -07:00
Ryan Lai
7fd2c8f9e8
Add signed GPU nuget package to publish ort-nightly nuget feed (#3834)
* Add signed nuget package to publish ort-nightly nuget feed

* Push managed nuget as well

* Indentation fix

* Indentation fix

* Update gpu.yml to also publish directml nuget

* Fix typo in naming of task
2020-05-10 16:24:45 -07:00
M. Zeeshan Siddiqui
5e1244eb4d
Update ONNX submodule to ONNX 1.7 release branch. (#3888)
* Update to ONNX submodule to ONNX 1.7 release branch.

* Update to ONNX submodule to ONNX 1.7 release branch.

* fix version.
2020-05-10 15:44:44 -07:00
ytaous
96030fdcbc
dashboard integration - output training perf metrics as json (#3809)
* dashboard integration - first phase

* change a field

* perf scripts

* addressing PR comments

* address comments and fix build

* minor

* make GetConfigFromData() const

* more update for comments

* addressing comments

* more on addressing comments

* minor

* fix build

* add condition check

* more on comments

* retrun status

* remove batch size

* on comments

* rename pkg path

* rename pkg path

* additional commentss

Co-authored-by: Ethan Tao <ettao@microsoft.com>
2020-05-10 10:29:38 -07:00
M. Zeeshan Siddiqui
8291065e20
Re-enable Dropout(12) ONNX inferencing tests. (#3833)
* Re-enable disabled Dropout(12) inferencing tests.

* Re-enable disabled Dropout(12) inferencing tests.

* Disable Dropout(12) tests that return boolean value on NGraph EP.
2020-05-10 06:26:01 -07:00
M. Zeeshan Siddiqui
eb33d5eda9
Do not register Dropout(12) as training ONLY kernel. (#3859)
* Do not register Dropout(12) as training ONLY kernel.

* Move Dropout forward implementation in inference project.

* fix inference build test failures.

* remove fp16 test since its support is absent on CPU.

* build break.
2020-05-09 21:38:17 -07:00
Ryan Hill
408f62dd57
Load provider shared libraries relative to core runtime executable (#3884)
* Load provider DLL relative to core runtime executable
* Use LoadLibraryEx to fix dependent DLL loading
* Fix custom op DLL loading path issue.
2020-05-09 20:49:15 -07:00
Vincent Wang
3c24841569
Fold Shape Node During Constant Folding (#3748)
* Fold Shape node in constant folding.

* bugfix

* Fix test failure.

* Bugfix for C++ frontend.

* Bugfix for C++ frontend.

Co-authored-by: Vincent Wang <weicwang@microsoft.com>
2020-05-09 20:15:03 +08:00
Sheil Kumar
890c945d4c
winml_is_inbox is not accessible in function scope, move it out. (#3886)
Co-authored-by: Sheil Kumar <sheilk@microsoft.com>
2020-05-09 00:23:59 -07:00
George Wu
672e40bac2
fix dnnl ep shared lib python packaging. (#3875) 2020-05-08 17:11:29 -07:00
George Wu
93e39b81a7
fix build break on Linux. (#3874) 2020-05-08 17:11:03 -07:00
Sheil Kumar
cf6a1c1715
Fix Windows Inbox build failing on 1) building raw api tests and 2) referencing _winml namespace in onnxruntime.dll (#3872)
* add build inbox flag

* remove raw tests and wstring for utf filenames

* enable raw tests

* use ToWideString

* create new utf8 helper

* update string helper to utf8

Co-authored-by: Sheil Kumar <sheilk@microsoft.com>
2020-05-08 15:59:16 -07:00
Pranav Sharma
22a711457f
Fix C# log APIs. Also fixes github issue #3409. (#3840)
* Fix C# log APIs. Fixes github issue #3409.

* Fix build error due to accidental duplication of GraphOptimizationLevel

* Fix runoptions

* Fix broken test. Add --blame switch to dotnet test cmd line to print the failed test in case of crash.
2020-05-08 14:31:06 -07:00
Ye Wang
a9b43b05cb
Support more types in MurmurhashOp (#3827)
* support more types for MurmurhashOp

* resolve comments

* fix comments
2020-05-07 20:43:11 -07:00
Tracy Sharpe
19e3dc47d7
LSTM performance tweak + cleanup (#3868)
* avoid unneeded buffer zeroing; includes cleanup

* fix includes
2020-05-07 19:41:52 -07:00
Hariharan Seshadri
fe5a20f2d1
Fix initial value of graph optimization (#3856) 2020-05-07 18:41:29 -07:00
Tiago Koji Castro Shibata
eb7486d632
WinML Adapter DMLEP tests (#3752)
* Add DML EP tests

* wip

* Fix DmlCopyTensor

* Cleanup

* Remove leaks in AdapterDmlEpTest

* Fix rebase conflicts

* Move GPUTEST macro to tests instead of suite setup

* Address PR feedback
2020-05-07 16:25:41 -07:00
Sheil Kumar
e981496cf7
add build inbox flag (#3855)
Co-authored-by: Sheil Kumar <sheilk@microsoft.com>
2020-05-07 15:52:43 -07:00
ashbhandare
424a00bf04
Fix enabling gradient as output for easy mode. (#3866) 2020-05-07 15:07:14 -07:00
stevenlix
4ea10c9202
bump up ORT version and extend time limit for windows cpu packaging pipelines (#3852) 2020-05-07 14:22:20 -07:00
Tracy Sharpe
cb554fbc2d
MLAS: Add MlasComputeSoftmax/MlasComputeExp (#3846)
* add MlasComputeSoftmax

* fix onnxruntime_mlas_test DLLs

* remove unneeded header

* remove unneeded header

* call MlasComputeExp

* call MlasComputeSoftmax

* call MlasComputeSoftmax

* finish off

* fix static analysis warning
2020-05-07 14:02:01 -07:00
liuziyue
914aaaa1c8
Fuse Attention For One Input bert-base-dynamic Model (#3850)
match additional mask path for attention
2020-05-07 13:40:28 -07:00
Tiago Koji Castro Shibata
43b87de7d1
Support wide char paths in CreateModel (#3835)
* Support wide char paths in CreateModel

* Use UTF-8 in WinML adapter API
2020-05-07 13:32:19 -07:00
Ryan Hill
3bafc096bb
Fast exit workaround for dnnl test. (#3863) 2020-05-07 11:23:40 -07:00
Wei-Sheng Chin
0aeb383273
Support Pipeline in Training Runner (#3770) 2020-05-06 21:03:36 -07:00
Vincent Wang
c222ed6327
Optimize Where CUDA kernel for UniLMV2. (#3799)
Co-authored-by: Vincent Wang <weicwang@OrtDevTest2v100.af05slrtruoetgaxwwjv5nsq5e.px.internal.cloudapp.net>
2020-05-07 10:23:54 +08:00
Yulong Wang
65bfece19d
[Node.js API] optimize prebuild (#3844)
* optimize prebuild

* fix base folder for prepack

* bump dev version
2020-05-06 15:48:13 -07:00
Faith Xu
9cca219b1a
Add FAQ page (#3324)
* Create FAQ.md

* Update README.md

* Update README.md

* Update FAQ.md

* Minor update

* Resync readme page from master

* Update structure and wordings

* Minor update

* Updates based on feedback

* Fix links

* Update to include common perf questions

* Update ONNX_Runtime_Perf_Tuning.md

* Update FAQ.md

* Update README.md

* Update FAQ.md

* Update docs/ONNX_Runtime_Perf_Tuning.md

Co-Authored-By: Nat Kershaw (MSFT) <nakersha@microsoft.com>

* Update docs/ONNX_Runtime_Perf_Tuning.md

Co-Authored-By: Nat Kershaw (MSFT) <nakersha@microsoft.com>

* Update docs/ONNX_Runtime_Perf_Tuning.md

Co-Authored-By: Nat Kershaw (MSFT) <nakersha@microsoft.com>

* Update docs/ONNX_Runtime_Perf_Tuning.md

Co-Authored-By: Nat Kershaw (MSFT) <nakersha@microsoft.com>

* Update ONNX_Runtime_Perf_Tuning.md

* Update FAQ.md

* Update README.md

* Update FAQ.md

Co-authored-by: Nat Kershaw (MSFT) <nakersha@microsoft.com>
2020-05-06 15:43:32 -07:00
Xueyun Zhu
0e59668c1b
add support for symbolic broadcast for Add/Sub/Mul (#3743)
* add support for symbolic broadcast

* fix comment

* address feedback
2020-05-06 10:40:57 -07:00
Scott McKay
687edd702c
Add RelWithDebInfo target to the C# projects so that it correctly finds the native build. (#3839)
Make the cmake file slightly more consistent for the build c# flag.
2020-05-06 20:01:04 +10:00
Adam Pocock
d38b79c6e5
[Java] Adding missing methods on Session, SessionOptions and RunOptions (v2) (#3832)
* java - adding support for custom op libraries.

* Adding support for RunOptions and additional methods for SessionOptions and OrtSession.

As a result OrtEnvironment.LoggingLevel moved to be a top level enum
called OrtLoggingLevel.

* java - adding unit tests for RunOptions and SessionOptions.

* java - removing unused releaseNamesHandle method

* java - add test for custom op library.

* java - adding log verbosity methods, and tests for the same.

* java - fixes for custom op loading test on Windows.

* Cleanup after rebase on master.
2020-05-06 01:19:46 -07:00
Ryan Hill
d5ec353e58
Ryanunderhill/mkldnn dll (#3314)
First version of allowing providers to work as DLLs, only implemented for DNNL so far.

More improvements to come next!
2020-05-06 00:57:09 -07:00
M. Zeeshan Siddiqui
9b02b3df6f
Update ONNX submodule to ONNX 1.7 release candidate 3. (#3838) 2020-05-06 00:55:19 -07:00
Hariharan Seshadri
b45ce92542
Fix 3828 (#3837) 2020-05-06 00:05:15 -07:00