Commit graph

2894 commits

Author SHA1 Message Date
dependabot[bot]
bbdabc2c48 Bump lodash from 4.17.15 to 4.17.19 in /nodejs
Bumps [lodash](https://github.com/lodash/lodash) from 4.17.15 to 4.17.19.
- [Release notes](https://github.com/lodash/lodash/releases)
- [Commits](https://github.com/lodash/lodash/compare/4.17.15...4.17.19)

Signed-off-by: dependabot[bot] <support@github.com>
2020-07-20 14:24:21 -07:00
Changming Sun
c2c4e6760b
Fix code sign validation errors in nuget and nodejs pipeline (#4527) 2020-07-20 14:18:47 -07:00
RandySheriffH
1c5733ea9d
Silence binscope warnings (#4542)
* slience binscope warnings

* revert change on rpath

Co-authored-by: Randy <Randy@randysmac.attlocal.net>
2020-07-20 14:10:41 -07:00
Changming Sun
9318b3a47f
Fix a typo in C++ sample application (#4556)
Resolve #3406
2020-07-20 09:31:44 -07:00
M. Zeeshan Siddiqui
9d80235607
Reshape inputs for SoftmaxCrossEntropyLoss instead of transposing them. (#4551) 2020-07-20 06:33:40 -07:00
Changming Sun
bc1d197ddf
Re-enable dnnl in CI build (#4544)
* Revert "Temporarily remove dnnl from Linux CI build to unblock the whole team (#4266)"

Previously it fails because it used too much memory.
Now we only run dnnl EP with opset12 models in unit tests, to reduce peak memory usage.
2020-07-19 23:20:03 -07:00
Yulong Wang
0e91e45049
[node.js binding] use official ORT C++ API (#4552) 2020-07-19 22:46:41 -07:00
Tracy Sharpe
08235e1662
add Output() overloads (#4546) 2020-07-19 15:21:12 -07:00
M. Zeeshan Siddiqui
6c950a1df0
Exclude weight related types/shapes from bert loss. (#4548) 2020-07-18 08:27:02 -07:00
Tianlei Wu
b42fe49f56
Add a script to convert gpt2 to onnx (#4541)
* add a script convert_to_onnx.py and refactor benchmark
2020-07-17 23:14:42 -07:00
Yulong Wang
0229a6a929
[C++ API] add SessionOptions::SetLogSeverityLevel() (#4545) 2020-07-17 21:14:41 -07:00
M. Zeeshan Siddiqui
6eb5549cb9
Deprecate TrainableDropout (#4501)
* Deprecate TrainableDropout.

* Add Dropout(12) back into Megatron transformer.

* Remove TrainableDropout from front-end test models.

* Update baseline for front-end tests after converting test models to opset-12.

* Update baseline for front-end tests after converting test models to opset-12.
2020-07-17 13:43:25 -07:00
Yulong Wang
fdc5c308c4
introduce macro ORT_API_MANUAL_INIT in C++ API (#4536)
* introduce macro ORT_API_MANUAL_INIT in C++ API

* resolve comments
2020-07-17 13:23:30 -07:00
Wei-Sheng Chin
21d2728974
Revise pipeline schedule to consider communication ops (#4524)
* Revise pipeline schedule to consider communication ops

* Add test

* Fix warning

* inline some short functions

* Fix warnings

* Rename a class

* Add comment for test

* op renamed to task

* Fix NVTX wrapper's bug
2020-07-17 10:04:56 -07:00
Xueyun Zhu
183098e344
adding concat logic when initial path is empty (#4525)
* concat

* add path_utils

* address feedback

* use string in test

* convert wstring to sting in windows

* address feedback

* address feedback

* fix comment
2020-07-16 23:46:12 -07:00
Hariharan Seshadri
d1f45f9361
Handle nodes with dtype attribute causing diffs in inferred and actual types of NodeArgs due to InsertCastTransformer (#4523) 2020-07-16 21:59:14 -07:00
Tracy Sharpe
8b86c5cdb5
Merge int32/uint32 and int64/uint64 MatMul kernels (#4531) 2020-07-16 21:25:29 -07:00
Sheil Kumar
02aea5d2d4
rename telemetry provider back to Microsoft.Windows.AI.MachineLearning (#4533)
Co-authored-by: Sheil Kumar <sheilk@microsoft.com>
2020-07-16 19:51:06 -07:00
Yulong Wang
5086e55a35
Fix condition of running tests in win CI (#4459) 2020-07-16 16:33:30 -07:00
Tiago Koji Castro Shibata
2189c77e5b
static_typename (#4520)
* Use static_typename

* Disable RTTI outside of Release

* Fix unused var

* Add test types

* PR feedback
2020-07-16 16:31:02 -07:00
M. Zeeshan Siddiqui
b43ce2d7ad
Replace loss function in BERT_LOSS with SoftmaxCrossEntropyLoss. (#4509)
* Replace loss function in BERT_LOSS with SoftmaxCrossEntropyLoss.

* Update BERT loss function with correct logit shapes for softmax cross entropy loss.

* fix test and PR comments.
2020-07-16 15:28:24 -07:00
RandySheriffH
76b31d6ce2
fix xcode alerts (#4470)
* fix xcode alerts

* fix comment

* fix comments

* update text

* fix comments

* fix comments

* remove checks on context

Co-authored-by: Randy <Randy@randysmac.attlocal.net>
Co-authored-by: Randy <Randy@randysmac.local>
Co-authored-by: Tracy Sharpe <tracysh@microsoft.com>
2020-07-16 10:20:34 -07:00
Changming Sun
8ada440961
Move model tests to onnxruntime_test_all (#4521)
1. Move model tests to onnxruntime_test_all
2. Publish TestResults of Windows CI build.
2020-07-15 16:46:18 -07:00
Xueyun Zhu
5f188f4cf4
ci fix (#4519) 2020-07-15 12:05:24 -07:00
stevenlix
0ebe2fab51
Refactor TensorRT EP code to better handle dynamic shape subgraphs (#4504)
* build engine in runtime for dynamic shape subgraphs

* Update TensorRT-ExecutionProvider.md

* Update TensorRT-ExecutionProvider.md

* fix build issue

* Add more instructions on how to use engine caching

* add precision to trt node name

* Update tensorrt_execution_provider.cc

* Update tensorrt_execution_provider.cc
2020-07-15 02:35:42 -07:00
gwang-msft
cf92497c16
Nnapi, add auto_pad support for Conv/GlobalAveragePool/AveragePool/GlobalMaxPool/MaxPool operators (#4499)
* Split ComputePadAndOutputShape into ComputePad and ComputeOutputShape

* update NNAPI conv ouput shape compute to use shared ComputeOutputShapec

* move use ptr to use reference for ComputePadAndOutputShape

* nnapi conv support auto_pad

* add logging operator support bt target devices

* update InferOutputShape/ComputePadAndOutputShape/ComputePad to use force_symmetric_auto_padding as param instead of template

* make log op support for target devices optional

* add auto_pad support to pool operators

* ignore GetTargetDevices if using all devices

* fix some typo in padding calculation

* fix a bug of compute padding difference between conv and pool ops

* addressed CR comments, removed NNAPI device logging and move nnapi ep autopad handling into a shared function

* change helper functions to static
2020-07-15 00:21:42 -07:00
edgchen1
34f73fa1aa
Add sudo --preserve-env option to allow environment to go through to docker commands. (#4512) 2020-07-14 18:12:31 -07:00
liqunfu
f721f5f1cd
Liqun/multiple choice (#4480)
* multiple choice runner

* add docker cleanup task to frontent pipeline
2020-07-14 17:57:58 -07:00
Xueyun Zhu
7d96960ec8
support pipeline partition with shared initializer (#4321)
* support bert partition with shared initializer

* address feedback

* address feedback

* address feedback

* add more test

* remove bert-tiny model

* address feedback

* address function comment

* move CreateNodeArg to graph_utils

* rename function name

* rename function name

* fix windows build

* fix windows type conversion warning

* add function comment
2020-07-14 17:21:40 -07:00
edgchen1
1ebe598286
Conditionally compile without std::is_trivially_copyable to satisfy old GCC versions. (#4510) 2020-07-14 16:47:40 -07:00
Sheil Kumar
ee5ca27ae2
Split Microsoft.AI.MachineLearning.nupkg in a NuGet package and symbol NuGet package (#4503)
* add threadpool interface

* generate snupkgs

* include_pdb check

* fix snupkg generation

* Add task to merge snupkgs

* folder exists

* check dir

* revert thread pool stuff

Co-authored-by: Sheil Kumar <sheilk@microsoft.com>
2020-07-14 14:52:39 -07:00
Tianlei Wu
25885cf7d0
Add option --torchscript in benchmark_gpt2.py (#4500)
* support TorchScript
* change onnx filename format
* change output name prediction_scores to logits
2020-07-14 11:53:23 -07:00
Tim Harris
a95ae164f7
Create N-1 threads in intra-op pool, given main thread now active (#4493)
Create N-1 threads in a thread pool when configured with intra-op parallelism of N. This ensures we have N active threads, given that the main thread also runs work. To avoid ambiguity on the value returned, rename ThreadPool::NumThreads method to ThreadPool::DegreeOfParallelism, and make corresponding updates in MLAS and operators.
2020-07-14 09:48:50 +01:00
liqunfu
0bff55512e
updated expected values for frontend test to pass frontend e2e pipeline. raise tolerance to reduce future risk of failure (#4497)
* updated expected values for frontend test, raise tol
2020-07-13 19:25:54 -07:00
Dmitri Smirnov
e0eddf502c
Bump version to 1.4.0 (#4496) 2020-07-13 17:09:18 -07:00
Yufeng Li
3d4ac85124
Add quantization benchmark for transformer based model (#4482)
* add support of quantization benchmark
2020-07-13 15:46:23 -07:00
gwang-msft
a3c358fd29
Split the shared ComputePadAndOutputShape into 2 separated functions ComputePad and ComputeOutputShape (#4487)
* Split ComputePadAndOutputShape into ComputePad and ComputeOutputShape

* update NNAPI conv ouput shape compute to use shared ComputeOutputShapec

* move use ptr to use reference for ComputePadAndOutputShape
2020-07-13 15:07:34 -07:00
Tiago Koji Castro Shibata
3441c687b7
Revert "Remove docstrigs if __ONNX_NO_DOC_STRINGS" (#4495)
This reverts commit bb4d331fa7bf1fe8d68b1527dda56e4739c80800.
2020-07-13 14:55:37 -07:00
gwang-msft
5f8f443ac4
Android CI build, test copy, emulator boot improvement (#4481)
* Enable onnxruntime_test_all for NNAPI EP

* switch to use ninja for ANdroid CI

* make android elumator boot faster in android ci

* simplify adb push

* more style change

* more tweaking on android ci

* build.py style update
2020-07-13 14:18:34 -07:00
Dmitri Smirnov
35ee00d888
Pin typing version. (#4490) 2020-07-13 11:48:30 -07:00
Bowen Bao
07455cff28
Support double type for Greater CPU (#4373)
* Add double for Greater

* add double type for Greater

* udpate test according to dtype
2020-07-13 11:25:14 -07:00
Tiago Koji Castro Shibata
f18dee84c2
Remove docstrigs if __ONNX_NO_DOC_STRINGS (#4494) 2020-07-13 11:08:46 -07:00
edgchen1
c71c49aaa0
Make TArray safer to use and update method name for consistency. (#4483)
- make size_ and data_ data members private
- rename GetCapacity() to Capacity() to be consistent (e.g., with Size())
- add static_assert for trivially copyable T because it is copied with memcpy
2020-07-13 09:59:56 -07:00
Sheil Kumar
00706e1502
dont add deps for uwp apps (#4485)
Co-authored-by: Sheil Kumar <sheilk@microsoft.com>
2020-07-10 22:11:32 -07:00
Derek Murray
3e48ffd21c
Move AutoPadType to common.h (#4474)
Extracting some common code related to "AutoPadType" from the cpu execution provider into "common.h".

Motivation and Context
* Sharing code with authors of other execution providers that need the same functionality.
* I didn't modify the code in shared_library or dnnl EP to avoid changing their dependency structure, so there is still a redundant copy of the AutoPadType code in there.
2020-07-10 16:40:32 -07:00
Tianlei Wu
e96a829e84
Handle multiple embed nodes in transformer optimizer (#4471)
Handle model with multiple embed nodes:
* update embed layer norm fusion in onnxruntime
* Fix temp model path in optimizer
* Add unit test for model with multiple embed nodes.
* Add unit test for gpt2 fusion with past state and mask
* Add unit test for change input to int32
2020-07-10 15:28:27 -07:00
Ashwini Khade
6a9a9a35be
fix crashes caused by test runner (#4475)
* Fix crashes in test runner

* plus some fixes

* changes per review
2020-07-10 14:04:22 -07:00
Hariharan Seshadri
26ebcfab88
Fix Nuget GPU pipeline (#4462) 2020-07-10 14:02:28 -07:00
gwang-msft
9b4c54bcef
Enable onnxruntime_test_all for NNAPI EP (#4476) 2020-07-10 13:34:44 -07:00
edgchen1
6c7da5e9d3
Optimize CUDA Sum op kernel and refactor CUDA elementwise variadic input op kernels (#4418)
For the special case where all variadic inputs of a kernel are the same shape (i.e. no broadcasting is required) and there are few enough of them, we perform the entire computation in a single kernel. The general implementation (which was previously used for this special case) handles broadcasting by repeatedly invoking a binary kernel on successive inputs.
2020-07-10 10:20:23 -07:00