Commit graph

2541 commits

Author SHA1 Message Date
edgchen1
999554cc53 CGManifest - add training entries and generate entries for submodules. (#3933)
Add cgmanifest.json entries for training dependencies.
Add script to generate git submodule cgmanifest.json entries.
2020-05-15 13:34:18 -07:00
ytaous
bc441b7e5c
Add cpu/mem usage for perf metrics (#3947)
* add cpu/mem usage

* on comments

* on comments

* renaming

Co-authored-by: Ethan Tao <ettao@microsoft.com>
2020-05-15 12:29:40 -07:00
Tracy Sharpe
be003dbab7
Fix ACL build break (#3952) 2020-05-14 23:14:15 -07:00
Pranav Sharma
47ae9691fd
Fix ordering of APIs. (#3951) 2020-05-14 21:27:46 -07:00
Hariharan Seshadri
9ef376880b
Add test for If node with conditional branches only containing Constant nodes (#3949) 2020-05-14 19:21:40 -07:00
Ryan Lai
38467f8c9a
DirectML Nuget package has different time stamp than Native and Managed Nuget (#3950)
* Fix DirectML nuget creation in Nuget pipeline

* DirectML Nuget package has different timestamp

* remove accidentally changed file
2020-05-14 18:52:08 -07:00
Jeff Bloomfield
e6da5946d1
Update DML Nuget version and DML EP Doc (#3945)
Update DML Nuget version and DML EP Doc
2020-05-14 17:33:46 -07:00
Tianlei Wu
782c6c24b2
Rename bert to transformers (#3946)
* rename folder bert to transformers
* rename bert_model_optimization.py to optimizer.py
* update URL links in notebooks
2020-05-14 15:32:59 -07:00
Zhang Lei
3c4f3d01cd
Implement QLinearLeakyRelu (#3648)
* Implement QLinearRelu and its unit test.
* Add logic to compute table during constructor when all parameters is constant.
* Fix test case rounding result related with rounding mode.
2020-05-14 14:52:55 -07:00
Scott McKay
5e0928a777
Enable running PEP8 on python scripts using flake8 (#3928)
* Enable running PEP8 checks via flake8 as part of the build if flake8 is installed.
Update scripts in \tools and \onnxruntime\python. Excluding \onnxruntime\python\tools which needs a lot more work to be PEP8 compliant. Also excluding orttraining\tools for the same reason.
Install flake8 as part of the static_analysis build task in the Win-CPU CI so the checks are run in one CI build.
Update coding standards doc.
2020-05-15 07:15:06 +10:00
Yulong Wang
39814319b2
[Node.js binding] fix linux build (#3927) 2020-05-14 12:37:32 -07:00
Yufeng Li
50f798dad6
support non-zero zero point for matmulinteger u8s8 (#3883)
* support non-zero zero point for matmulinteger u8s8
2020-05-14 12:02:28 -07:00
Changming Sun
9c989c8dd6
Update build doc for cross-compiling (#3672) 2020-05-14 11:43:06 -07:00
manashgoswami
cab21223b3
Updated TPN for OpenMPI and cleanup (#3932)
* Update README.md

* Update ReleaseManagement.md

* Updated Third Party Notice for training feature

Added Open MPI license
2020-05-14 11:42:44 -07:00
gwang-msft
cba8bdc790
Make some compile change for Android NNAPI provider using DNNLibrary (#3935)
* Change compile settings for NNAPI with DNNLib

* update build.py

* update build readme
2020-05-14 10:53:37 -07:00
Prasanth Pulavarthi
84c108a85e
link to folder instead of READMEs inside folder (#3938)
otherwise hard to find the source code
2020-05-14 07:35:23 -07:00
Ryan Lai
48f69cfbb8
Fix DirectML nuget creation in Nuget pipeline (#3929) 2020-05-13 19:34:38 -07:00
Chi Lo
f380460a9e
Update the build steps to support ORT on Jetson (#3869)
* Update BUILD doc for ARM64 build for TensorRT support on Jetson device

* minor revision

* JetPack 4.4 is in developer preview stage, so we suggest to use JetPack
4.3
2020-05-13 19:30:27 -07:00
ytaous
93eb9bcfde
Add yaml/perf scripts for new perf test pipeline (#3909)
* yaml/perf scripts for new pipeline

* yaml/perf scripts for new pipeline

* remove unused imports

* testing some comments change

* testing some comments change

* testing jdbc

* testing jdbc

* testing jdbc

* exclude pwd from jdbc properties

* exclude pwd from jdbc properties

* namedtuple

* on comments

Co-authored-by: Ethan Tao <ettao@microsoft.com>
2020-05-13 14:15:17 -07:00
Changming Sun
e86214e5c0
Fix the tensorflow performance test (#3847) 2020-05-13 11:52:59 -07:00
Yufeng Li
7c774e967a
support quantization of optimized model with ir<4 (#3853) 2020-05-13 11:16:37 -07:00
Prabhat
25257a661d
Added onnxruntime aarch64 wheel to pypi publishing pipeline (#3903)
* Added onnxruntime aarch64 wheel to pypi publishing pipeline

* Support nightly build flag

* Add support for nightly build
2020-05-13 23:20:29 +05:30
Jeff Bloomfield
1c1685ad2b
Fix error handling in LearningModelSession.cpp (#3920) 2020-05-13 10:13:16 -07:00
Tiago Koji Castro Shibata
385073e1cd
Fix DmlCopyTensor test (#3923)
* Fix heap corruption

* Cleanup
2020-05-13 09:14:55 -07:00
Zhang Lei
eab61e87ce
Fix quantization tool bugs when model nodes have no name. (#3854)
Fix bugs when model nodes have no name.
2020-05-12 20:38:26 -07:00
liqunfu
9b5daa2039
patch torch onnx opset 10 (#3910)
patch pytorch to export onnx nll_loss opset version 10. add mnist test to covert onnx opset version 10.
2020-05-12 18:11:25 -07:00
Ori Levari
7b858d60b0
Various changes for automated downlevel test pipeline (#3901)
Co-authored-by: Ori Levari <orlevari@microsoft.com>
2020-05-12 17:22:47 -07:00
Hariharan Seshadri
3065219cc1
Changes related to the release binaries requiring Visual C++ 2019 runtime (#3871) 2020-05-12 17:07:06 -07:00
Xiang Zhang
bccbdd03f1
User/xianz/enable batch tests (#3914)
* enable batch tests in winml_image_test

* copy batchGroundTruth folder

* skip GPU tests when GPU is unavailable
2020-05-12 15:46:46 -07:00
Dmitri Smirnov
18dc0ec39f
Rework jar by creating os-arch folders (#3849)
Detect os and arch and move the artifacts to a new folder.
 Remove unnecesary jars so we cam focus on those we publish.
 Add signing
 Make signature simlper.
 Fix indent.
 Halt on 32-bit arch.
 Credits: @Craigacp
2020-05-12 15:31:20 -07:00
Hariharan Seshadri
c00945ae81
Build ORT by default for Mac OS X versions 10.12+ (#3626) 2020-05-12 14:43:32 -07:00
Scott McKay
99415f09fe
Fix bug where linear_output_ is not cleared when linear_before_reset is true and no bias input is provided. Requires a batch size of 3 or more to trigger if initial_h is not provided. (#3893) 2020-05-13 07:18:32 +10:00
Andrews548
475ea382f9
Fix ACL EP convolution-activation fusion optimization (#3896)
Co-authored-by: Andrei-Alexandru <andrei-alexandru.avram@nxp.com>
2020-05-12 14:06:48 -07:00
Jeff Bloomfield
f170f31e1d
Extend workaround with input name matching in DML fused graph kernel (#3918) 2020-05-12 14:04:09 -07:00
Bowen Bao
0f82b42fed
Ensure pt model is set to cpu in ort_trainer (#3867)
* Ensure pt model is set to cpu in ort_trainer

* add note comment
2020-05-12 13:32:27 -07:00
Tianlei Wu
29496170d7
Add Benchmark Script for Bert Models (#3829)
Add benchmark script for Transformer models
* Set intra_op_num_threads=1 for cpu (version <= 1.2.0)
* Add percentiles for latency
* torch.set_num_threads (for intra op) to get fair comparison
* Allow export ONNX model with specified number of inputs
* Add fusion statistics
* Install transformers from source
2020-05-12 12:26:22 -07:00
Scott McKay
2fed37c8eb
Fix bug in handling of an initializer that provides a graph output. (#3912)
* Outputs from model execution should always be returned in a newly allocated buffer or an pre-allocated buffer provided in fetches. When an initializer is providing a graph output (e.g. constant folding may result in this) we were returning an OrtValue that pointed to the initializer and not a separately allocated buffer with a copy.

This was wrong as:
  - value wasn't returned in a pre-allocated fetch so whilst the value returned was correct, it was returned in the wrong place
  - user could alter the data in the initializer via the returned value

* Add unit test with and without pre-allocated fetch.

* Add some extra info around why we're handling this special case.
2020-05-12 20:42:58 +10:00
George Wu
6f729b100f
use LOAD_WITH_ALTERED_SEARCH_PATH for LoadLibraryExA (#3908) 2020-05-11 19:53:34 -07:00
Thiago Crepaldi
70abb120b3
Remove ORTModel from frontend API (#3825)
* Resolve conflict

* Address review
2020-05-11 18:20:33 -07:00
Tracy Sharpe
b12d35b679
MLAS: tune softmax kernels for partial vectors (#3906) 2020-05-11 18:02:50 -07:00
M. Zeeshan Siddiqui
c46a9e8d65
Add numerical stability to SoftmaxGrad test inputs. (#3857)
* Increase the tolerance for SoftmaxGrad CPU-GPU compare tests.

* Increase the tolerance for SoftmaxGrad CPU-GPU compare tests.

* Add 1e-2 to Y for numerical stability.

* build break.

* comments.

* PR feedback.

* PR feedback.
2020-05-11 17:59:24 -07:00
Jeff Bloomfield
af7d453435
Merge DML Execution Provider updates (#3885)
* Merged PR 4616739: Update QLinear Ops fix 1D support layout

Update QLinear Ops fix 1D support layout

Related work items: #26011523

* Merged PR 4617257: Gather operator DML EP fails with scalar indices and 1D inputs

Fix gather with scalar value.

The ONNX conformance test case is in another PR:

    // 0D, axis 1, rank 0 indices tensor
    {
      "op_type": "Gather",
      "axis": 0,
      "data": [1,2,3],
      "indices": 0,
      "output": 1,
      "T": "float32"
    }

* Merged PR 4632178: Re-enable ORT onnx_test_runner test case (DirectML ConvTranspose validation needs to be loosened to comply with ONNX definition of output_padding)

Re-enable 1D convolution tests.

Related work items: #23499747

* Merged PR 4656672: Make DML EP use Direct queue

While a Compute queue has benefits, Direct is consistent with Winml.

Related work items: #26324112

* Update DML nuget version

* Merged PR 4662079: Update DmlDev branch again from github master

Include Sheil's changes to fix namespace and header file include paths. Without this, the ONNX conformance tests all fail with E_NOTIMPL.

* Increment DML nuget version

Co-authored-by: Nick Feeney <nickfe@microsoft.com>
Co-authored-by: Dwayne Robinson <dwayner@microsoft.com>
2020-05-11 17:57:01 -07:00
Scott McKay
0cdc1459c6
Add a few more things to the helper python scripts. (#3842)
* Add a few more things to the helper python scripts.
Add documentation with usage examples.

* Address PR comments
2020-05-12 09:59:40 +10:00
stevenlix
28f693a8a1
Update TensorRT dynamic shape profile when input shape changed during runtime (#3904)
* Update dynamic shape range when input shape changed during runtime

* Update tensorrt_execution_provider.cc

* Update tensorrt_execution_provider.cc
2020-05-11 16:11:41 -07:00
Ori Levari
6d2d927809
add test to api tests to ensure correct hresult is returned for corrupt model file (#3796)
Co-authored-by: Ori Levari <orlevari@microsoft.com>
2020-05-11 13:22:22 -07:00
Sheil Kumar
c7da194313
remove winrt (#3899)
Co-authored-by: Sheil Kumar <sheilk@microsoft.com>
2020-05-11 10:47:07 -07:00
Prabhat
ce3678ffaf
Added aarch64 build pipeline (#3841)
* Added aarch64 build pipeline

* Fix build error

* Remove auditwheel repair which doesn't work with cross compiling

* Statically link C++

* Added auditwheel repair back and fix stdlib.h

* Remove extra space
2020-05-11 22:56:16 +05:30
Scott McKay
d7e39569da
Use MlasComputeSoftmax in traditional ML ops (#3892)
* Use MlasSoftmax in ML ops

* Refine when mlas is used based on perf testing.
2020-05-11 16:28:59 +10:00
Hariharan Seshadri
06985a9922
Stop proceeding with constant folding if a CPU kernel is not found found (#3836)
* Stop proceeding with constant folding if a CPU kernel is not found for a node

* Fix build

* PR feedback

* Fix typo

* Refine

* Remove unnecessary header inclusion

* Refine

* Fix build

* More changes

* More changes

* More changes

* Fix CentOS build
2020-05-10 19:08:37 -07:00
Ryan Lai
7fd2c8f9e8
Add signed GPU nuget package to publish ort-nightly nuget feed (#3834)
* Add signed nuget package to publish ort-nightly nuget feed

* Push managed nuget as well

* Indentation fix

* Indentation fix

* Update gpu.yml to also publish directml nuget

* Fix typo in naming of task
2020-05-10 16:24:45 -07:00