Commit graph

4335 commits

Author SHA1 Message Date
Thiago Crepaldi
f71d93ea2b
Enable PyTorch Lightning basic test on CI (#6809) 2021-02-27 09:35:42 -08:00
Sergii Dymchenko
059ed1c241
Copy forward signature from PyTorch model. (#6777) 2021-02-26 13:02:13 -08:00
baijumeswani
c1b0cf6d0b
Add pipeline to clear the cache for huggingface transormers models (#6813) 2021-02-26 10:39:22 -08:00
satyajandhyala
355057cf9c
Added RequiredGrad attribute to YieldOp (#6657)
* Added required_grad attribute to YieldOp

* Chagened YieldOp attribute to hold the indices of the required gradient outputs from the count, and removed the code reordering the outputs.

* Changed backward_output_grad_names to a map from backward output gradient name to the corresponding output index.
2021-02-26 10:38:38 -08:00
Sherlock
8a450d523f
Check gradient correctness in the UTs (#6803)
* Check gradient correctness in the UTs

Co-authored-by: Sherlock Huang <bahuang@OrtTrainingDev3.af05slrtruoetgaxwwjv5nsq5e.px.internal.cloudapp.net>
2021-02-25 13:31:07 -08:00
baijumeswani
fa8a9015bd
Mount hf model cache and use cache for loading hf models (#6810) 2021-02-25 13:30:14 -08:00
Sergii Dymchenko
99ffffbe6a
Remove backward workaround from test. (#6811) 2021-02-25 13:23:46 -08:00
ashbhandare
b05403d877
Clear iobinding outputs (#6774) 2021-02-25 11:50:43 -08:00
Sherlock
8e200e13fe
Rewrite ORTModule background task coordination (#6700)
* Introduce OrtTasks to replace EventPool

* return run_id to frontend

* pass run_id to backward

* OrtTasks support multiple bg_events

* make message_queue a member of orttask

* Replace MessageQueue with std::promise

* Move status_promise into Task

* Move terminate flag into Task

* Reenable previously disabled UTs

* Add unit tests

* Replace condition variables with std::promise

* Move to CreateBackgroundTask in the main thread

* return status and output in forward_future

* use throw for terminating background thread

* cleanup tasks at destructor

* reenable test_mixed_nnmodule_ortmodules_training

* add mutex for ORTTasks functions

* add mutex for bg_threads

* delay tests before start

* add ut for multi-task common backbone

Co-authored-by: Sherlock Huang <bahuang@OrtTrainingDev3.af05slrtruoetgaxwwjv5nsq5e.px.internal.cloudapp.net>
2021-02-24 18:00:25 -08:00
baijumeswani
7ce4075bbd
Support nested sequence and mapping types in ORTModule (#6791) 2021-02-24 15:45:56 -08:00
Thiago Crepaldi
aa5cd37ac8
Refactor device handling and basic support for PyTorch Lightning (#6758) 2021-02-24 14:12:55 -08:00
baijumeswani
65ba51d93e
Re-enable test and increase timeout (#6785) 2021-02-23 18:51:06 -08:00
Thiago Crepaldi
563218dcda
Update torchtext usage for pytorch transformer sample (#6767)
* Update torchtext usage for pytorch transformer sample
* Temporarily disable tests to unblock repo (failures are being worked on already)
* Update loss numbers for ORTTrainer UTs
2021-02-23 14:06:35 -08:00
Sergii Dymchenko
58f3aca95d
Support keyword arguments for ORTModule (#6539)
* Support keyword arguments for ORTModule.

* Add backward workaround to the test.

* Specify test name directly without -k.

* Handle unused inputs removed by ONNX exporter.
2021-02-19 13:40:44 -08:00
M. Zeeshan Siddiqui
1a2f1bd23a
Enable external CUDA allocator in ORTModule. (#6745)
* Enable external CUDA allocator in ORTModule.

* Fix assert after unification of allocators.

* Update no grad memory test.

* update comments.

* fix provider options array when not sharing allocator.
2021-02-18 20:01:13 -08:00
ytaous
39d182f7fc
ORTModule - FastGeluFusion/fp16 fix and minor LayerNormFusion cleanup (#6734)
* fastgelu fix

* assert cast outputs

Co-authored-by: Ethan Tao <ettao@OrtTrainingDev4.af05slrtruoetgaxwwjv5nsq5e.px.internal.cloudapp.net>
2021-02-18 13:17:16 -08:00
Thiago Crepaldi
fb3f1f5cc1
Enable custom ops on ORTModule (#6740) 2021-02-18 09:08:10 -08:00
Sherlock
b7b5612159
Merge pull request #6742 from microsoft/mzs/sync-from-master
Sync from master
2021-02-18 00:12:38 -08:00
M. Zeeshan Siddiqui
40dda452cf Merge branch 'master' of https://github.com/microsoft/onnxruntime into mzs/sync-from-master 2021-02-18 03:03:01 +00:00
M. Zeeshan Siddiqui
e44ac6524f
Plug n Allocate with external CUDA allocator via PyBind. (#6679) 2021-02-17 18:59:38 -08:00
liqunfu
dd8ef4409a
Liqun/migrate perf test (#6733)
move ort training perf tests to azure devops
2021-02-17 17:48:47 -08:00
liqunfu
2c5e603bad
Liqun/nuphar nuget (#6656)
create nuphar nuget with correct name
2021-02-17 16:13:07 -08:00
Thiago Crepaldi
21f9e32c60
Merge pull request #6714 from microsoft/thiagofc/merge-from-master
Merge master into thiagofc/ortmodule-api
2021-02-17 16:08:53 -08:00
Ramakrishnan Sivakumar
a5bef6886b
Threading support for Hybrid core architecture (#6728) 2021-02-17 15:35:07 -08:00
Guoyu Wang
6810d98ea3
Update links to gh-pages for ORT minimal documents (#6721)
* Fix broken link in ort minimal docs

* Update link of build.md to gh-pages
2021-02-17 14:34:50 -08:00
Justin Stoecker
af4e5c0c6e
Minor WinML model test skip name change 2021-02-17 14:27:58 -08:00
Maajid khan
b41e9b5d4c
[OpenVINO-EP] Fixes OpenVINO-EP build on windows (#6726)
* Fixes OpenVINO-EP windows build

Openvino EP build is broken on windows. The issue
is wchar_t is UTF-16 on windows while on other platforms
such as Linux and MacOS, wchar_t is UTF-32.

so wide Unicode string has to be converted to an UTF8 string
for sure on windows.

This commit fixes this issue.
2021-02-17 13:49:03 -08:00
Thiago Crepaldi
9d4b730e46 Fix merge leftover 2021-02-17 11:58:06 -08:00
M. Zeeshan Siddiqui
9853ef84f8 Reduce binary size, limit asynchronous/backgroud thread stuff to training only. 2021-02-17 11:51:09 -08:00
M. Zeeshan Siddiqui
5b7e7aaa45 Move event_pool and message_queue to core. 2021-02-17 11:50:56 -08:00
M. Zeeshan Siddiqui
eecce31a8b Fix build, cleanup. 2021-02-17 11:50:41 -08:00
Thiago Crepaldi
3184c47ad1 Merge branch 'master' into thiagofc/merge-from-master 2021-02-17 11:49:52 -08:00
Yulong Wang
9a9202a218
[Node.js binding] update dependency typedoc (#6720) 2021-02-17 10:22:05 -08:00
Changming Sun
0be5475de6
Update packaging pipelines(#6664) 2021-02-17 09:53:36 -08:00
Changming Sun
46c06f6ac7
Change Windows GPU CI pipeline to CUDA11 (#6616) 2021-02-17 09:44:44 -08:00
Changming Sun
eefeacd828
Skip running gpt2 model in C# x86 (#6722) 2021-02-17 09:37:16 -08:00
Derek Murray
b8d5fa812c
Fix typo in README.md (#6713)
Fixes #6710.
2021-02-17 09:29:30 -08:00
Wei-Sheng Chin
9e67b88c83
Use local rank as GPU ID (#6719) 2021-02-17 22:42:54 +08:00
RandySheriffH
9043df8b66
Deprecate OMP from nuget pipeline (release:1.7) (#6671)
* deprecate OMP from nuget

* remove omp build

* remove

* add openmp build

* add variants

* rename package

* move GPU to no omp pipeline

* reset path

* switch to abs path

* reset path

* add cpu package

* remove obsolete name

* set package name

Co-authored-by: Randy Shuai <rashuai@microsoft.com>
2021-02-17 00:03:44 -08:00
Suffian Khan
105883f4b8
remove longformer_global_impl.cu from hipify (#6716) 2021-02-16 22:26:18 -08:00
Hariharan Seshadri
aa2622efb2
Support multiple dynamic inputs in custom ops (#6666) 2021-02-16 20:54:30 -08:00
baijumeswani
01dfa8e125
Support non tuple return values from torch.nn.module (#6660)
* Support dictionary, namedtuples and huffingface ModelOutput type for model return values
2021-02-16 20:48:32 -08:00
Scott McKay
02c7873b0e
Update ORT model conversion script to support custom ops (#6701)
* Add support for custom ops library to the ORT model conversion script
Simplify model conversion now that we read ops from the ORT format model.
Enable custom ops in the python bindings if custom ops are turned on in a minimal build.
* Add test of model conversion involving custom ops.
2021-02-17 12:52:39 +10:00
Thiago Crepaldi
7f33671ade
Handle multiple devices scenarios (#6672)
* Handle multiple devices scenarios
2021-02-16 18:22:30 -08:00
Thiago Crepaldi
7ee5baa60d
Remove monkey patch for PyTorch Nightly + ORTTrainer (#6659) 2021-02-16 17:24:50 -08:00
Tianlei Wu
9b446d5f7e
Longformer Attention CUDA kernel memory Improvements (#6646)
* Integrate memory improvements from NVidia
* compute max_global_num before buffer allocation
* update conversion script to support transformers 4.0
* update benchmark script for creating dummy inputs for different batch_size

* Use a wrapper of cuda event to avoid memory leak
2021-02-16 14:54:48 -08:00
Edward Chen
b09a370218
Address warning in data_types_internal.h (#6704)
Address "unreachable code" warning in data_types_internal.h.
2021-02-16 12:41:48 -08:00
RandySheriffH
c36ee4bd40
Rename Python packaging pipelines (#6682)
* rename pipelines

* resync and rename

* resync master

* rename package id

* remove OrtPackageId which is for nuget

Co-authored-by: Randy Shuai <rashuai@microsoft.com>
2021-02-16 11:43:03 -08:00
RandySheriffH
497eef8d3d
remove omp (#6675)
Co-authored-by: Randy Shuai <rashuai@microsoft.com>
2021-02-16 11:42:32 -08:00
Changming Sun
d48a4c0a54
Add CG step to nuget GPU pipeline (#6678) 2021-02-16 09:46:20 -08:00