Commit graph

3484 commits

Author SHA1 Message Date
Ashwini Khade
ce49cfa67c
add support for configurable build dir when building nuget packages (#5352)
* add support for configurable build dir when building nuget packages

* rename vars
2020-10-02 09:31:35 -07:00
Changming Sun
f265834c2c
Exclude GPT2_LM_HEAD from OpenVino's model test list (#5356)
GPT2_LM_HEAD is a new ONNX model zoo model that OpenVino doesn't support.

Error message:1: [ONNXRuntimeError] : 6 : RUNTIME_EXCEPTION : Non-zero status code returned while running OpenVINO-EP-subgraph_1162 node. Name:'OpenVINOExecutionProvider_OpenVINO-EP-subgraph_1162_1' Status Message: _Map_base::at
2020-10-01 21:49:45 -07:00
Sunghoon
1612934f72
Allow protobuf format of input data for performance test (#5323)
* Allow protobuf format of input data like onnxruntime_perf_tool

* Add OnnxML.cs to fix build failure
2020-10-01 21:40:29 -07:00
Yufeng Li
e8b9aa1f29 fix quantization of EmbeddingLayerNorm (#5321) 2020-10-01 20:08:43 -07:00
KeDengMS
7495dc167a
Symbolic shape inference: fix a bug in auto_merge when broadcasting (#5349)
The bug happens when merging following shapes:

input0: [1, 1, 'Min(1024, input1_dynamic_axes_3)', 'Min(1024, input1_dynamic_axes_3)']
input1: ['input1_dynamic_axes_1*input1_dynamic_axes_2', 12, 'input1_dynamic_axes_3', 'input1_dynamic_axes_3']
input2: []

The fix is to avoid broadcasting merge on input2
2020-10-01 15:24:00 -07:00
Ye Wang
caed6c264c
Add tf2pytorch wrapper in transformers tool (#5316)
* init checkin

* format

* refactor

* review comments
2020-10-01 13:58:58 -07:00
edgchen1
d62873a331
Docker image release build updates (#5326)
- Update docker image release build to use build commit.
- Use valid default in component governance detection step.
- Use smaller docker build context.
2020-10-01 12:25:31 -07:00
liqunfu
fe50213491
Liqun/bert pretrain2 (#5327)
* bert single node multi GPU pretrain w/o checkpoint

Co-authored-by: liqun <liqun@OrtTrainingDev4.af05slrtruoetgaxwwjv5nsq5e.px.internal.cloudapp.net>
2020-10-01 11:01:26 -07:00
Brian Martin
1cad3e322e
typo in contributing.md (#5340)
there's a missing space between two words.
2020-10-01 10:23:08 -07:00
Guoyu Wang
2098d621a6
Make some string optional for save to/load from flatbuffers (#5331)
* Update how to save and load string using flatbuffers and ort_format_only_test

* Add some comments

* Address PR comments
2020-10-01 09:24:37 -07:00
Hariharan Seshadri
383b1e207c
Fix bug in the Resize operator kernels (#5303) 2020-09-30 15:33:33 -07:00
Ashwini Khade
3f00b8db8f
move all experimental ops to version 1 of ms domain (#5287)
* move all experimental ops to version 1 of ms domain

* deprecate TransposeMatMul in favor of FusedMatMul

* update documentation
2020-09-30 14:50:18 -07:00
edgchen1
2c32309e2c
Update dockerfiles/README.md onnxruntime-training image tags. (#5333) 2020-09-30 14:35:38 -07:00
Sherlock
37445d1198
Update Bert Perf Script (#5339)
Co-authored-by: Sherlock Huang <bahuang@OrtTrainingDev3.af05slrtruoetgaxwwjv5nsq5e.px.internal.cloudapp.net>
2020-09-30 14:30:20 -07:00
Changming Sun
8d4740b39c
Add some log for the GetFileLength function (#5330) 2020-09-30 10:39:42 -07:00
Faith Xu
cb57c100e6
Doc updates for 1.5 (#5302)
* Fix Windows AI version

* Update text to extend telemetry coverage 

Includes all official binaries

* Update text about EP pluggability

* Update CUDA/cuDNN versions

* Add link to reduce operator kernel page

* Update roadmap

* Add preview for migraphx

* Move Rockchip under IoT/Edge

* Update text to include ORT for Mobile doc link
2020-09-30 09:53:33 -07:00
Tim Harris
69dbaaa015
Add additional test cases to check for leaks in thread pool creation / destruction (#5311)
Add additional test cases such as ThreadPoolTest.TestPoolCreation_10Iter to create and destroy thread pools to watch for any memory leaks. Running under Valgrind, these tests should show all of the data allocated being deallocated again.

Two recent issues #5176 and #5292 indicated memory leaks. The test cases help identify whether or not any of the data structures used in the thread pool are being leaked. Currently, on WSL, the only data not being de-allocated in these tests are a small number of nsync waiter objects. This behavior is as expected (the waiter objects should be held on a free list in the nsync library).
2020-09-30 11:26:02 +01:00
Ye Wang
1a12f510fc
Support T5 benchmarking in transformers tool (#5133)
* init checkin

* review comments

* modify according to transformers release
2020-09-29 22:58:28 -07:00
Sherlock
9ec1ed42a8
Enable BiasDropoutFusion for CUDA EP only (#5324)
Co-authored-by: Sherlock Huang <bahuang@OrtTrainingDev3.af05slrtruoetgaxwwjv5nsq5e.px.internal.cloudapp.net>
2020-09-29 14:00:15 -07:00
Wenbing Li
ed102e9d88
Add iOS test pipeline and a sample app. (#5298)
* Add iOS test pipeline and a sample app.

* clean up the unused code.

* clean up.

* revert the unknown change

* disable the shared library for iOS.

* add open source notice text.

* ignore the skipped test.

* extract the common ortenv setup
2020-09-29 13:53:11 -07:00
Tracy Sharpe
f07059ccc0
Add weight prepacking to LSTM kernel (#5305) 2020-09-29 13:33:38 -07:00
Sherlock
11c194ce29
Minor fix for ComputeBroadcastBackwardAxesDynamic; Fix for GradientGraphBuilder logging (#5313)
Co-authored-by: Sherlock Huang <bahuang@OrtTrainingDev3.af05slrtruoetgaxwwjv5nsq5e.px.internal.cloudapp.net>
2020-09-29 09:49:05 -07:00
liqunfu
24d8b1bf42
to skip an unstable test to unblock release (#5314)
Co-authored-by: liqun <liqun@OrtTrainingDev4.af05slrtruoetgaxwwjv5nsq5e.px.internal.cloudapp.net>
2020-09-28 22:30:11 -07:00
Hariharan Seshadri
cb83097632
Cosmetic change in non tensor tests (#5317) 2020-09-28 21:23:30 -07:00
Scott McKay
1ff3b2d5b8
Add ability to generate multiple test dirs so that different input mixes can be tested. (#5310) 2020-09-29 12:55:15 +10:00
Vincent Wang
eae2473dc1
Scale Op for ReduceMeanGrad. (#5191)
* Scale Op for ReduceMeanGrad

* fix Windows build error

* resove PR comments.

Co-authored-by: Vincent Wang <weicwang@microsoft.com>
2020-09-29 09:30:49 +08:00
Vincent Wang
506060dc37
Remove Useless Cast from Contiguous Cast Nodes (#5204)
* remove useless cast

* move the optimization to cast transformer

* bugfix

* resolve comments

* fix comment

Co-authored-by: Vincent Wang <weicwang@microsoft.com>
2020-09-29 09:18:52 +08:00
Changming Sun
d45d68fdd4
Fix a memory leak in our testing code (#5312) 2020-09-28 16:00:57 -07:00
Scott McKay
3693f91218
Update doc to be explicit about backwards compatibility. (#5309) 2020-09-29 07:34:49 +10:00
ytaous
b18a8bc74f
Transpose kernel fix for illegal memory access error (#5294)
* transpose fix

* minor update per comments

Co-authored-by: Ethan Tao <ettao@microsoft.com>
2020-09-28 13:59:50 -07:00
Changming Sun
1a04b8f8b7
Add valgrind support to our cmake files (#5296) 2020-09-28 09:31:08 -07:00
Guoyu Wang
fec890a09a
fix build break (#5306) 2020-09-28 00:10:48 -07:00
RRRachelllll555
507f5bf5f6
Update test calibrate script (#5185)
* update test_calibrate according to latest calibrate.py

* fix datasize bug in e2e example

Co-authored-by: t-yguo <t-yguo@microsoft.com>
2020-09-27 21:59:56 -07:00
Tang, Cheng
d9ecc0cebf
add bert loss legacy back (#5224) 2020-09-27 13:41:16 -07:00
George Wu
16d35266ab
add install targets for ep shared libs (#5286) 2020-09-25 07:10:43 -07:00
Guoyu Wang
3a3f26f38e
Move ort flatbuffers helper functions and value info r/w functions into separated lib (#5276)
* Move fbs include from header to cc

* add initial cmake for flatbuffers

* Move most flatbuffers util to ort_flatbuffers

* move code around

* fix

* move test/perf runner to use flatbuffer directly instead of model

* minor update

* Fix build break

* Clean up includes and foward decl

* Fix traning CI build breaks

* Addressed PR comment, replaced some include with forward decls

* Remove ORT_MUST_USE_RESULT temporarily
2020-09-25 05:36:29 -07:00
Changming Sun
17f1178c2e
Downgrade GCC (#5269)
Co-authored-by: Edward Chen <18449977+edgchen1@users.noreply.github.com>
2020-09-24 21:14:54 -07:00
Sherlock
b03fb82ab7
Transformer layer-wise Recompute (#4526)
* Build Recomputation Graph

* Make topological sort to run FW nodes first

* Pattern match start and end of transformer layer

* Topological sort with Priority

* Add logger to Gradient Graph Builder

* Use Logger

* Introduce Execution Order
2020-09-24 19:56:32 -07:00
Faith Xu
b6e71200eb
Add additional tutorial links (#5272) 2020-09-24 17:27:58 -07:00
Dmitri Smirnov
89742411ec
Insert telemetry template into GPU build, add telemry build switches. (#5278) 2020-09-24 17:13:09 -07:00
Thiago Crepaldi
ebeeff22dd
Update PyTorch TransformerModel sample (#5275) 2020-09-24 16:28:07 -07:00
Ryan Lai
71b52ad5de
Fix inbox telemetry (#5265)
* ifdef to check if redist or not

* Fix redist telemetry

Co-authored-by: Ryan Lai <ryalai96@gamil.com>
2020-09-24 14:58:07 -07:00
Scott McKay
b49ff6151e
Workaround issue with VS2017 compiler. (#5279)
The definitions for some Eigen classes don't get pulled in leading to errors. Split out the broadcast function creation logic from the functions using std::enable_if to workaround that.
2020-09-25 06:50:14 +10:00
KeDengMS
5a71819be6
Symbolic shape inference: fix a case for concat (#5277)
* Symbolic shape inference: fix a case when concat requires merge multiple dims

* Fix a bug triggered in newer version of sympy
Fix a bug in output data type guessing
2020-09-24 08:16:47 -07:00
Josh Bradley
4ed31ca214
Combine custom logger global threadpools (#4857)
* add custom logger and global threadpools to C and C++ API

* code cleanup and formatting

* reformat code

* tidy up some more code formatting

* remove comment

* fix API break from merging from master

* renamed API function to CreateEnvWithCustomLoggerAndGlobalThreadPools

* rename log variable and apply clang-format
2020-09-24 00:50:26 -07:00
Dwayne Robinson
6ad39819c2
Update DirectML Nuget to 1.3.0 (#5274)
Update to 1.3.0
2020-09-23 22:53:02 -07:00
Dwayne Robinson
a4cb00b91e
Merge pull request #5273 from microsoft/user/dwayner/CmakeLinkerOptFlags
Linker opt flags - fix conflicting CMake linker flags which contradict those needed by the Windows inbox universal CRT
2020-09-23 20:08:21 -07:00
edgchen1
6d5b93b805
Synchronize training dependency versions between Docker image and Python wheel. (#5261)
Synchronize training dependency versions between Docker image and wheel, update docs, refactor build scripts.
2020-09-23 19:03:42 -07:00
Justin Stoecker
56862f4022 Add way to disable additional linker opt flags 2020-09-23 12:56:40 -07:00
Ashwini Khade
16220f3848
Add FusedMatMul contrib op (#5213)
* bug fix transformer

* fuse cpu kernel for transposescalematmul and matmul

* fuse transpose_scale_matmul cpu kernel with matmul

* fix test

* Add FusedMatMul Contrib Op

* fix test

* fix typo

* plus more updates per review
2020-09-23 12:17:50 -07:00