Commit graph

2929 commits

Author SHA1 Message Date
Sheil Kumar
efa393e596
WinML should dynamically link against onnxruntime.dll and only system32 for inbox builds (#4615)
* Dynamically link onnxruntime.dll

* fixes

* add preceeding backslash to onnxruntime.dll for inbox builds

* remove /d

* loadlibrary -> loadlibraryex

* use loadlibrary system32 option

Co-authored-by: Sheil Kumar <sheilk@microsoft.com>
2020-07-27 09:56:49 -07:00
Sheil Kumar
222fd08f20
DirectML.dll is loaded via LoadLibraryW but should use LoadLibraryExA (#4616)
* create dml device via loadlibraryexa

* add build_INBOX flag to adapter

Co-authored-by: Sheil Kumar <sheilk@microsoft.com>
2020-07-25 21:29:46 -07:00
Alisha Sonawalla
1e67fff93c
Add GetStringTensorElement, GetStringTensorElementLength and FillStringTensorElement API (#4374)
Add new string tensor APIs and unit tests
2020-07-24 21:35:46 -07:00
Sheil Kumar
c361a59cff
disable gpu timeouts in winml (#4604)
Co-authored-by: Sheil Kumar <sheilk@microsoft.com>
2020-07-24 13:44:44 -07:00
Tiago Koji Castro Shibata
48d969f4bf
Constexpr CreateFeatureValueFromInspectable (#4460) 2020-07-24 13:08:14 -07:00
Hariharan Seshadri
9510f26744
[Python] Support more APIs for the SessionOptions class (#4596) 2020-07-24 12:56:54 -07:00
ytaous
9888c9e944
SplitTraining op to support split as input (#4597)
* SplitTraining op to support split as input

* on comments and minor refactor

Co-authored-by: Ethan Tao <ettao@microsoft.com>
2020-07-24 12:49:19 -07:00
Sherlock
aa328c2c20
Update GratherGard to accumulate in fp32 (#4601) 2020-07-24 10:54:31 -07:00
Yufeng Li
9c75c29403
refine opset version getter (#4602) 2020-07-24 10:34:56 -07:00
ashbhandare
5189530b7b
Create the ConcatTraining op (#4595)
* Working changes for ConcatTraining op

* Refactor to move changes to orttraining

* Fix segfault

* Support -ve axis for shape inferencing

* fix build

Co-authored-by: Ethan Tao <ettao@microsoft.com>
2020-07-24 10:03:58 -07:00
ytaous
d5b98a13c2
Move ReduceSumTraining op under orttraining (#4588)
* move training op under orttraining

* code clean up

* fix build

* on comments

Co-authored-by: Ethan Tao <ettao@microsoft.com>
2020-07-23 21:36:52 -07:00
Ryan Lai
f35ce4677f
add double and uint8_t datatypes (#4603)
Co-authored-by: Ryan Lai <ryalai96@gamil.com>
2020-07-23 21:32:29 -07:00
Tianlei Wu
ace41b8064
Force return_tuple=True to handle transformers breaking change of output format. (#4599) 2020-07-23 11:35:41 -07:00
Tianlei Wu
ea87c0d028
Update Transformer Optimizer documents (#4591)
(1) Add bert-base-cased and gpt2 benchmark results on V100
(2) Update list of supported models.
(3) Add comments to gpt2_helper.
(4) Use IO Binding in test parity by default.
2020-07-23 08:38:39 -07:00
gwang-msft
03ebe33850
[Android NNAPI EP] Add support for LRN/Grouped Conv ops, fix issues where NNAPI will fall back to CPU (#4582)
* add LRN/Grouped Conv Support, minor changes

* better pool ops sdk version requirement

* reduce string comparision for gemm/matmul ops

* fix nnapi fall back to cpu for softmax

* addressed review comments, correct a small error in the code
2020-07-23 00:05:39 -07:00
RRRachelllll555
c5df918744
improve calibration tool (#4561)
* improve calibration tool

* modify calibration interface name

* modify calibration interface name

* refine calibrate and calibrate_user

* refine and add type info

* refine and add type info

* add e2e user example file

* remove unnecessary files

* remote test images no longer needed

* update readme document

Co-authored-by: t-yguo <t-yguo@microsoft.com>
2020-07-22 21:31:49 -07:00
Dmitri Smirnov
1b253d18ef
Updated tags for MCR images (#4574)
* Updated pushed CPU and CUDA tags.

* Add tensorRT, fix typo.

* Add OpenVINO tags. Remove 2020.2 installation instructions for VAD-M.

* Revert instruction changes for V-ADM and update 2020.2 to 2020.3
2020-07-22 19:39:42 -07:00
Xueyun Zhu
e2acb165e9
Add exception check in training_runner when worker runs into error, and misc check on nccl and mpi calls (#4380)
* error check

* fix build warning treated as error
2020-07-22 14:32:19 -07:00
gwang-msft
c2ec3b734b
[Android NNAPI EP] Remove dependency on external JD/DNNLibrary (#4576)
* remove dependency of external jd-dnnlibrary

* remove extra variables not used any more

* update /cgmanifest.json
2020-07-22 14:08:12 -07:00
Pranav Sharma
f0edd074fb
Optimize CreateEnv by not creating the logging manager instance if env instance has already been created. (#4583)
* Optimize CreateEnv by not creating the logging manager instance if env instance has already been created.

* Move creation of logging mgr inside if block
2020-07-22 12:29:55 -07:00
Sheil Kumar
fa6d035090
Create WindowsAI zip files automatically as part of the pipeline (#4584)
* copy rename nupkg to zip as part of build task

* update both symbols and regular package

Co-authored-by: Sheil Kumar <sheilk@microsoft.com>
2020-07-22 10:53:47 -07:00
George Wu
6b53a74867
replace invalid sample (#4567) 2020-07-21 23:51:17 -07:00
S. Manohar Karlapalem
e00e2e84d6
[OpenVINO] Update MCR dockerfile with Ubuntu 18.04 package names (#4581)
* Update Ubuntu version codename for install components

* Specify OpenVINO dockerfile's name in documentation

Also install NCS udev rules.
2020-07-21 23:48:24 -07:00
Andrews548
f20afc4991
Update ACL/ArmNN EP (#4571)
* Add BN to ArmNN EP

* Add Concat to ArmNN EP

* ACL logging improvements

* ArmNN logging improvements

* Fallback to CPU for 9x9 convolution in ACL EP

* Fallback to CPU for 9x9 convolution in ArmNN EP

* Enable python support for ACL and ArmNN EPs when compiled with BSP toolchain

* Removed the matmul operator

* Fix conv infer shape function

* Fix provider_names list for armnn

Co-authored-by: Andrei-Alexandru <andrei-alexandru.avram@nxp.com>
2020-07-21 22:25:58 -07:00
Yufeng Li
822b23ff2f
Add support of EmbeddingLayerNorm (#4562) 2020-07-21 21:43:02 -07:00
Hariharan Seshadri
bf78e4d18b
Handle cases which produce an empty output in the MatMul op family (#4573) 2020-07-21 17:57:47 -07:00
Ryan Lai
0b4659c3fe
Populates TensorProto with tensor from protobuf file (#4535)
* Expose load tensor proto from protobuf file function

* Add comment

* Remove use of fstream and use parsefromzerocopystream

* Close file descriptor after finish parsing it

* Close input stream too

* Set Close on delete only, no need to close file descriptor

* Revert "Set Close on delete only, no need to close file descriptor"

This reverts commit 5ba6e3c31b.

* Revert "Close input stream too"

This reverts commit 4564776733.

* Revert "Close file descriptor after finish parsing it"

This reverts commit 846e550c4f.

* Revert "Remove use of fstream and use parsefromzerocopystream"

This reverts commit 25a3117183.
2020-07-21 17:50:15 -07:00
Changming Sun
603f2d1138
Exclude a few OpenVino flaky tests (#4572) 2020-07-21 12:14:34 -07:00
Chi Lo
affdeb53c2
Add Python API for specifying device options. (#4205)
* Add python API for specifying CUDA device id

* Modification for providing session based python api for specifying
device id

* When include header file pybind11/stl.h, conversion between c++
containers and Python list, vector and dict data structure are
automatically enabled.

https://pybind11.readthedocs.io/en/stable/advanced/cast/stl.html#

Therefore, refactor the code for better leverage this advantage.

* Make struct CudaDeviceOptions as default cuda device options

* Implement sess.set_providers(list_of_providers, list_of_provider_option_dicts)

But still stay consistent with existing sess.set_providers(list_of_provider)

* Add cuda provider option default setting

* Add support for setting cuda cuda_mem_limit and arena_extend_strategy.
Also resolved the merge conflict on session.py

* Use python ctypes to call cuda library to help python unittest

* Refine the code with reviewer's suggestions

* Add the capability of getting execution provider's configuration

- Once we introduced the capability to set execution provider's
configuration, it makes sense to add capability of getting ep's configuration.

* Modify the code with reviewer's suggestions.

* Using stoull() and stoul() depends on 32/64-bits architecture.

* Rewrite the testcases for testing setting CUDA device id

Note: We need to make sure every ORT process be run on one CUDA device
at a time.

* Make sure old session object is destroyed by python gc before new
session object is being created

* Move testcases to original onnxruntime_test_python.py

* Fix bugs to pass CI build

* Make it pass CI build (cont.)

* Make it pass CI build (cont.)
2020-07-21 07:28:13 -07:00
M. Zeeshan Siddiqui
e11629d9e4
Revert "Deprecate TrainableDropout (#4501)" (#4564)
This reverts commit 6eb5549cb9.

# Conflicts:
#	orttraining/orttraining/core/graph/training_op_defs.cc
2020-07-21 06:18:42 -07:00
ashbhandare
ab4be8355f
Add the BroadcastGradientArgs op (#4511)
* Adding CPU implementation of BroadcastGradientArgs op

* Modify to take shape as input instead of tensor

* Cleanup

* Correct schema

* Corrected kernel, added tests, addressed review comments.

* Added exception,test for invalid broadcast,addresed review comments.

* Fix mac build error.
2020-07-20 23:59:10 -07:00
ytaous
0008e92b4e
Internal ReduceSum op that accepts axes as input (#4522)
* Initial change, to add ReduceSumTraining cpu op

* cpu support

* cuda support + more UTs

* on comments + UT

* no op support for {} axes with new attr - noop_with_empty_axes

* on comments

* fix build

* on comments

Co-authored-by: aishwarya bhandare <aibhanda@microsoft.com>
Co-authored-by: Ethan Tao <ettao@microsoft.com>
2020-07-20 21:05:00 -07:00
Yufeng Li
e92e0860c8
BERT quantization notebook (#4543)
* BERT quantization notebook

* update notebooks

* more benchmark

* add version info
2020-07-20 18:23:37 -07:00
gwang-msft
ca0dd8246c
NNAPI EP, add quantization support (#4530)
* nnapi quantization work, 1. add SymmPerChannelQuantParams to operand types

* add ways for operand_type to change dimension

* remove per-channel quantization

* Squashed commit of the following:

commit 4857c3a732298c2f96efb61800b7621251d61c1b
Author: gwang <wanggy@outlook.com>
Date:   Tue Jul 14 23:02:28 2020 -0700

    remove per-channel quantization

commit 775e4b2960f531496b8d11eef27d64e5b85c3c66
Author: gwang <wanggy@outlook.com>
Date:   Mon Jul 13 11:54:02 2020 -0700

    add ways for operand_type to change dimension

commit e56a494de67c66f8122d908270fbc2bb17e38423
Author: gwang <wanggy@outlook.com>
Date:   Wed Jul 8 15:18:55 2020 -0700

    nnapi quantization work, 1. add SymmPerChannelQuantParams to operand types

* add support for QuantizeLinear

* add dequantizelinear support

* minor style update

* minor bug fix

* add quantization support for qlinearmatmul, minor issue fix

* add quantized input support, minor bug fix

* fix issues in the qlinearmatmul

* add verify scale and zeropoint for qlinearmatmul

* add test for [de]qunatizelinear ops

* add qlinearconv support

* fixed small issue causing test failure

* fix test exception

* fix for centos test failure

* fix centos test failure

* fix issue causing win-tensorRT ci failure

* addressed comments
2020-07-20 16:43:31 -07:00
Tracy Sharpe
7f9d9557b1
Remove template<T> from RoiAlignBase (#4558) 2020-07-20 14:28:46 -07:00
dependabot[bot]
bbdabc2c48 Bump lodash from 4.17.15 to 4.17.19 in /nodejs
Bumps [lodash](https://github.com/lodash/lodash) from 4.17.15 to 4.17.19.
- [Release notes](https://github.com/lodash/lodash/releases)
- [Commits](https://github.com/lodash/lodash/compare/4.17.15...4.17.19)

Signed-off-by: dependabot[bot] <support@github.com>
2020-07-20 14:24:21 -07:00
Changming Sun
c2c4e6760b
Fix code sign validation errors in nuget and nodejs pipeline (#4527) 2020-07-20 14:18:47 -07:00
RandySheriffH
1c5733ea9d
Silence binscope warnings (#4542)
* slience binscope warnings

* revert change on rpath

Co-authored-by: Randy <Randy@randysmac.attlocal.net>
2020-07-20 14:10:41 -07:00
Changming Sun
9318b3a47f
Fix a typo in C++ sample application (#4556)
Resolve #3406
2020-07-20 09:31:44 -07:00
M. Zeeshan Siddiqui
9d80235607
Reshape inputs for SoftmaxCrossEntropyLoss instead of transposing them. (#4551) 2020-07-20 06:33:40 -07:00
Changming Sun
bc1d197ddf
Re-enable dnnl in CI build (#4544)
* Revert "Temporarily remove dnnl from Linux CI build to unblock the whole team (#4266)"

Previously it fails because it used too much memory.
Now we only run dnnl EP with opset12 models in unit tests, to reduce peak memory usage.
2020-07-19 23:20:03 -07:00
Yulong Wang
0e91e45049
[node.js binding] use official ORT C++ API (#4552) 2020-07-19 22:46:41 -07:00
Tracy Sharpe
08235e1662
add Output() overloads (#4546) 2020-07-19 15:21:12 -07:00
M. Zeeshan Siddiqui
6c950a1df0
Exclude weight related types/shapes from bert loss. (#4548) 2020-07-18 08:27:02 -07:00
Tianlei Wu
b42fe49f56
Add a script to convert gpt2 to onnx (#4541)
* add a script convert_to_onnx.py and refactor benchmark
2020-07-17 23:14:42 -07:00
Yulong Wang
0229a6a929
[C++ API] add SessionOptions::SetLogSeverityLevel() (#4545) 2020-07-17 21:14:41 -07:00
M. Zeeshan Siddiqui
6eb5549cb9
Deprecate TrainableDropout (#4501)
* Deprecate TrainableDropout.

* Add Dropout(12) back into Megatron transformer.

* Remove TrainableDropout from front-end test models.

* Update baseline for front-end tests after converting test models to opset-12.

* Update baseline for front-end tests after converting test models to opset-12.
2020-07-17 13:43:25 -07:00
Yulong Wang
fdc5c308c4
introduce macro ORT_API_MANUAL_INIT in C++ API (#4536)
* introduce macro ORT_API_MANUAL_INIT in C++ API

* resolve comments
2020-07-17 13:23:30 -07:00
Wei-Sheng Chin
21d2728974
Revise pipeline schedule to consider communication ops (#4524)
* Revise pipeline schedule to consider communication ops

* Add test

* Fix warning

* inline some short functions

* Fix warnings

* Rename a class

* Add comment for test

* op renamed to task

* Fix NVTX wrapper's bug
2020-07-17 10:04:56 -07:00
Xueyun Zhu
183098e344
adding concat logic when initial path is empty (#4525)
* concat

* add path_utils

* address feedback

* use string in test

* convert wstring to sting in windows

* address feedback

* address feedback

* fix comment
2020-07-16 23:46:12 -07:00