Commit graph

2557 commits

Author SHA1 Message Date
edgchen1
024b92a970
Use path relative to script location to refer to symbolic_opset10.py from install_deps.sh. (#3975)
Update install_deps.sh to use relative path from script directory to symbolic_opset10.py. This allows install_deps.sh to be called from different working directories.
2020-05-18 13:36:06 -07:00
Adam Pocock
9d2d1eb6f6
[java] Adds a CUDA test (#3956)
* [java] - adding a cuda enabled test.

* Adding --build_java to the windows gpu ci pipeline.

* Removing a stray line from the unit tests that always enabled CUDA for Java.
2020-05-18 12:05:51 -07:00
Hariharan Seshadri
1a183784a8
Fix C# layer in the way it handles sequences (#3965)
* Fix C# layer in the way it handles sequence of tensors

* Revert comment
2020-05-18 11:10:13 -07:00
edgchen1
e259a13f8e
Initial training Python packaging pipeline (#3767)
Add a pipeline to produce training-enabled ORT wheels.
2020-05-18 09:41:00 -07:00
edgchen1
e55f24364a
Disable LTO on Windows training CPU build (#3960)
Disable LTO on Windows training CPU build. Add a parameter to the win-ci-2019.yml build template for enabling LTO with a default value of true.
2020-05-18 09:24:10 -07:00
M. Zeeshan Siddiqui
44731e88bb
Add comments for zero valued normalization factor in SoftmaxCrossEntropyLossGrad CUDA kernel. (#3972) 2020-05-18 09:08:09 -07:00
Scott McKay
fd8ea4e466
Improve handling of symbolic dimensions in the onnxruntime_test.py script. (#3959)
If a symbolic dimension is found allow the user to provide a value, or default to 1.

`python .\onnxruntime_test.py --symbolic_dims batch=1,seqlen=4 onnxruntime\test\testdata\transform\fusion\fast_gelu_use_graph_input.onnx`
2020-05-18 16:51:09 +10:00
Tianlei Wu
523d70f667
Improve Transformer Benchmark for FP16 (#3970)
Disable ORT in offline optimization script (ORT could generate some fused ops (like FusedGemm) which cannot be converted to fp16).
Remove some models from benchmark until we have optimizations for them.
2020-05-17 21:50:45 -07:00
Wei-Sheng Chin
0d11649bb3
Address comments from #3823 and polish code (#3964)
* Address comments from #3823 and polish code

* One line
2020-05-17 14:08:33 -07:00
Prabhat
4ff73d00b0
Fix python pkg permission issue (#3957)
* Fix python pkg permission issue

* Run chown with sudo

* Add workspace clean to arm pipeline

* Run docker as current user
2020-05-17 14:06:55 +05:30
Tianlei Wu
07e9a4c164
Update benchmark to reflect those used in our latest results (#3967)
Update optimizer for GPT2 models exported from PyTorch 1.5.
Update benchmark to use GPT2 models without Past State inputs/outputs
Update bert_perf_test to allow setting omp_num_threads etc to test only one setting
2020-05-16 20:17:40 -07:00
Tianlei Wu
56700bec83
Add example of python code to readme of transformers tools (#3966)
* Use shorter name for tools
* Use optimizer_cli
* Add comments about -i parameter
2020-05-16 20:13:24 -07:00
Tianlei Wu
769c11f217
Update doc for transformers tools (#3963)
* update readme for onnxruntime-tools package
* update license section in benchmark
2020-05-16 11:13:34 -07:00
M. Zeeshan Siddiqui
a296b16719
Prevent divide by zero in CUDA implementation of SoftmaxCrossEntropyLossGrad. (#3962) 2020-05-16 00:33:25 -07:00
KeDengMS
132ce3a561
Fixes for quantizing a BERT from HuggingFace (#3939)
* Fixes for quantizing a BERT from HuggingFace

* Address CR and some other minor fixes
2020-05-15 23:41:29 -07:00
Wei-Sheng Chin
33208c9f6b
Modify Pipeline Facilities to Fix PipeDream Deadlock (#3823)
* Prepare utils for adding Wait's and Record's

* Have a running PipeDream

* Add comments

* Polish comments

* Clean code

* Fix test

* Polish names

* Polish names

* Remove debug headers

* Fix a shape inference bug (not related to pipeline code)

* Fix a warning

* Address some comments

* Address comments

* Only touch consumers of outputs when re-wire edges
2020-05-15 18:27:19 -07:00
edgchen1
999554cc53 CGManifest - add training entries and generate entries for submodules. (#3933)
Add cgmanifest.json entries for training dependencies.
Add script to generate git submodule cgmanifest.json entries.
2020-05-15 13:34:18 -07:00
ytaous
bc441b7e5c
Add cpu/mem usage for perf metrics (#3947)
* add cpu/mem usage

* on comments

* on comments

* renaming

Co-authored-by: Ethan Tao <ettao@microsoft.com>
2020-05-15 12:29:40 -07:00
Tracy Sharpe
be003dbab7
Fix ACL build break (#3952) 2020-05-14 23:14:15 -07:00
Pranav Sharma
47ae9691fd
Fix ordering of APIs. (#3951) 2020-05-14 21:27:46 -07:00
Hariharan Seshadri
9ef376880b
Add test for If node with conditional branches only containing Constant nodes (#3949) 2020-05-14 19:21:40 -07:00
Ryan Lai
38467f8c9a
DirectML Nuget package has different time stamp than Native and Managed Nuget (#3950)
* Fix DirectML nuget creation in Nuget pipeline

* DirectML Nuget package has different timestamp

* remove accidentally changed file
2020-05-14 18:52:08 -07:00
Jeff Bloomfield
e6da5946d1
Update DML Nuget version and DML EP Doc (#3945)
Update DML Nuget version and DML EP Doc
2020-05-14 17:33:46 -07:00
Tianlei Wu
782c6c24b2
Rename bert to transformers (#3946)
* rename folder bert to transformers
* rename bert_model_optimization.py to optimizer.py
* update URL links in notebooks
2020-05-14 15:32:59 -07:00
Zhang Lei
3c4f3d01cd
Implement QLinearLeakyRelu (#3648)
* Implement QLinearRelu and its unit test.
* Add logic to compute table during constructor when all parameters is constant.
* Fix test case rounding result related with rounding mode.
2020-05-14 14:52:55 -07:00
Scott McKay
5e0928a777
Enable running PEP8 on python scripts using flake8 (#3928)
* Enable running PEP8 checks via flake8 as part of the build if flake8 is installed.
Update scripts in \tools and \onnxruntime\python. Excluding \onnxruntime\python\tools which needs a lot more work to be PEP8 compliant. Also excluding orttraining\tools for the same reason.
Install flake8 as part of the static_analysis build task in the Win-CPU CI so the checks are run in one CI build.
Update coding standards doc.
2020-05-15 07:15:06 +10:00
Yulong Wang
39814319b2
[Node.js binding] fix linux build (#3927) 2020-05-14 12:37:32 -07:00
Yufeng Li
50f798dad6
support non-zero zero point for matmulinteger u8s8 (#3883)
* support non-zero zero point for matmulinteger u8s8
2020-05-14 12:02:28 -07:00
Changming Sun
9c989c8dd6
Update build doc for cross-compiling (#3672) 2020-05-14 11:43:06 -07:00
manashgoswami
cab21223b3
Updated TPN for OpenMPI and cleanup (#3932)
* Update README.md

* Update ReleaseManagement.md

* Updated Third Party Notice for training feature

Added Open MPI license
2020-05-14 11:42:44 -07:00
gwang-msft
cba8bdc790
Make some compile change for Android NNAPI provider using DNNLibrary (#3935)
* Change compile settings for NNAPI with DNNLib

* update build.py

* update build readme
2020-05-14 10:53:37 -07:00
Prasanth Pulavarthi
84c108a85e
link to folder instead of READMEs inside folder (#3938)
otherwise hard to find the source code
2020-05-14 07:35:23 -07:00
Ryan Lai
48f69cfbb8
Fix DirectML nuget creation in Nuget pipeline (#3929) 2020-05-13 19:34:38 -07:00
Chi Lo
f380460a9e
Update the build steps to support ORT on Jetson (#3869)
* Update BUILD doc for ARM64 build for TensorRT support on Jetson device

* minor revision

* JetPack 4.4 is in developer preview stage, so we suggest to use JetPack
4.3
2020-05-13 19:30:27 -07:00
ytaous
93eb9bcfde
Add yaml/perf scripts for new perf test pipeline (#3909)
* yaml/perf scripts for new pipeline

* yaml/perf scripts for new pipeline

* remove unused imports

* testing some comments change

* testing some comments change

* testing jdbc

* testing jdbc

* testing jdbc

* exclude pwd from jdbc properties

* exclude pwd from jdbc properties

* namedtuple

* on comments

Co-authored-by: Ethan Tao <ettao@microsoft.com>
2020-05-13 14:15:17 -07:00
Changming Sun
e86214e5c0
Fix the tensorflow performance test (#3847) 2020-05-13 11:52:59 -07:00
Yufeng Li
7c774e967a
support quantization of optimized model with ir<4 (#3853) 2020-05-13 11:16:37 -07:00
Prabhat
25257a661d
Added onnxruntime aarch64 wheel to pypi publishing pipeline (#3903)
* Added onnxruntime aarch64 wheel to pypi publishing pipeline

* Support nightly build flag

* Add support for nightly build
2020-05-13 23:20:29 +05:30
Jeff Bloomfield
1c1685ad2b
Fix error handling in LearningModelSession.cpp (#3920) 2020-05-13 10:13:16 -07:00
Tiago Koji Castro Shibata
385073e1cd
Fix DmlCopyTensor test (#3923)
* Fix heap corruption

* Cleanup
2020-05-13 09:14:55 -07:00
Zhang Lei
eab61e87ce
Fix quantization tool bugs when model nodes have no name. (#3854)
Fix bugs when model nodes have no name.
2020-05-12 20:38:26 -07:00
liqunfu
9b5daa2039
patch torch onnx opset 10 (#3910)
patch pytorch to export onnx nll_loss opset version 10. add mnist test to covert onnx opset version 10.
2020-05-12 18:11:25 -07:00
Ori Levari
7b858d60b0
Various changes for automated downlevel test pipeline (#3901)
Co-authored-by: Ori Levari <orlevari@microsoft.com>
2020-05-12 17:22:47 -07:00
Hariharan Seshadri
3065219cc1
Changes related to the release binaries requiring Visual C++ 2019 runtime (#3871) 2020-05-12 17:07:06 -07:00
Xiang Zhang
bccbdd03f1
User/xianz/enable batch tests (#3914)
* enable batch tests in winml_image_test

* copy batchGroundTruth folder

* skip GPU tests when GPU is unavailable
2020-05-12 15:46:46 -07:00
Dmitri Smirnov
18dc0ec39f
Rework jar by creating os-arch folders (#3849)
Detect os and arch and move the artifacts to a new folder.
 Remove unnecesary jars so we cam focus on those we publish.
 Add signing
 Make signature simlper.
 Fix indent.
 Halt on 32-bit arch.
 Credits: @Craigacp
2020-05-12 15:31:20 -07:00
Hariharan Seshadri
c00945ae81
Build ORT by default for Mac OS X versions 10.12+ (#3626) 2020-05-12 14:43:32 -07:00
Scott McKay
99415f09fe
Fix bug where linear_output_ is not cleared when linear_before_reset is true and no bias input is provided. Requires a batch size of 3 or more to trigger if initial_h is not provided. (#3893) 2020-05-13 07:18:32 +10:00
Andrews548
475ea382f9
Fix ACL EP convolution-activation fusion optimization (#3896)
Co-authored-by: Andrei-Alexandru <andrei-alexandru.avram@nxp.com>
2020-05-12 14:06:48 -07:00
Jeff Bloomfield
f170f31e1d
Extend workaround with input name matching in DML fused graph kernel (#3918) 2020-05-12 14:04:09 -07:00