Commit graph

4398 commits

Author SHA1 Message Date
Vincent Wang
91c6a330c0
Add UseCount for External Outputs (#6894)
* add usecount for external outputs

* ut
2021-03-09 17:06:27 +08:00
baijumeswani
f1ade14e44
Assert that the data is on the same device as ORTModule (#6942) 2021-03-08 17:03:28 -08:00
Vincent Wang
56c5620fd2
Disable Materializing Grads (#6822)
* disable materialize grads

* gradient builder bugfix

* fix ut

* fix ut

* resolve comments and bugfix

* add more assert

* disable forward compare for now
2021-03-08 16:56:06 +08:00
Thiago Crepaldi
dfc7c18e31
Introducing TrainingAgent interface to performance training using YieldOp (#6898) 2021-03-05 17:03:46 -08:00
baijumeswani
79f832c682
Separate requirements.txt file for ORTModule pipelines (#6879)
* Move all ORTModule dependency installations to ortmodule subfolder
2021-03-05 14:12:11 -08:00
ytaous
ac4d615553
Enable priority-based execution order as default to support inputs with symbolic/dynamic shape (#6892)
* priority-based exec order

* disable 1 failing test

* fix UT

* more comments

Co-authored-by: Ethan Tao <ettao@OrtTrainingDev4.af05slrtruoetgaxwwjv5nsq5e.px.internal.cloudapp.net>
2021-03-04 22:36:25 -08:00
Sherlock
b429edcd45
Merge pull request #6890 from microsoft/bmeswani/merge_master_onto_ortmodule
Merge master onto ortmodule dev branch
2021-03-03 23:42:50 -08:00
Baiju Meswani
aa93f2e236 move SetOutputMLValue from op_kernel.h to op_kernel_context.h 2021-03-03 20:39:34 -08:00
Baiju Meswani
d5667554e6 Merge branch 'master' of github.com:microsoft/onnxruntime into bmeswani/merge_master_onto_ortmodule 2021-03-03 20:37:29 -08:00
Sherlock
749e6a08a6
Add more asserts for ORTModule forward's correctness (#6887)
* Add more asserts on forward outputs

* Found one more failing case

Co-authored-by: Sherlock Huang <bahuang@OrtTrainingDev3.af05slrtruoetgaxwwjv5nsq5e.px.internal.cloudapp.net>
2021-03-03 19:57:42 -08:00
baijumeswani
ed1883a97c
Workaround for HTTP Error 403: Forbidden for MNIST dataset (#6885) 2021-03-03 18:59:48 -08:00
Guoyu Wang
fedb68429c
[NNAPI EP] Add per-tensor u8s8 support for Qlinear[Conv/MatMul] (#6818)
* NNAPI Add per-tensor u8s8 support

* Update some comments

* Address CR comments

* Address CR comments
2021-03-03 15:44:49 -08:00
Guoyu Wang
3c5d811e77
[CoreML EP] Add [Average/Max]Pool support (#6870) 2021-03-03 14:32:39 -08:00
Hariharan Seshadri
9a9e741a8c
Support optional inputs/outputs in custom op development (#6727) 2021-03-03 05:59:23 -08:00
jingyanwangms
f22f04a109
Add comment (#6860)
Co-authored-by: Jingyan Wang <jingywa@OrtTrainingDev3.af05slrtruoetgaxwwjv5nsq5e.px.internal.cloudapp.net>
2021-03-02 18:54:25 -08:00
Faith Xu
6285ee2398
Reroute quantization tool readme to /docs page (#6854) 2021-03-02 13:49:42 -08:00
Ye Wang
9073f7a5c3
support opset13 in embednorm (#6866) 2021-03-02 12:33:40 -08:00
Ryan Hill
0d0eb2c85c
Change OpKernel class to be shared with shared providers (#6837)
In the previous shared providers there aren't many OpKernel classes, and the existing Provider_OpKernel wrapper was fine. With the opposibility of making Cuda a shared provider, having this need to be changed per OpKernel adds a lot of complexity.

It was fairly straightforward to make OpKernel work with shared providers with minimal changes.

In this change, the ONNX_OPERATOR_* macros can also be shared with the shared providers.
2021-03-02 00:53:48 -08:00
Hariharan Seshadri
38796ad451
Refine force CPU fallback logic in the CUDA EP (#6849) 2021-03-01 19:59:07 -08:00
Vincent Wang
4238ce341a
Add External Outputs Flag for YieldOp (#6789)
* add external outputs flag for YieldOp

* use kPreExisting

* add ut for mem_pattern

* fix ut after merge from master
2021-03-02 11:38:18 +08:00
Edward Chen
66df167a73
Add support for op kernel type control required types, require int64 for some ops (#6832)
Adds support for required types to the op kernel type control infrastructure. Required types are always enabled.
Added int64 as a required type for certain ops.
2021-03-01 19:04:29 -08:00
Guoyu Wang
36a44d55ed
Only report Android Baseline binary size for master branch (#6844)
* Only report binary size from master

* update script

* Correct the typo
2021-03-01 15:57:18 -08:00
Guoyu Wang
5cf6606964
[CoreML EP] Add Concat support (#6834)
* [CoreML EP] Add concat support

* Update comments
2021-03-01 13:35:44 -08:00
Sherlock
12edf22f11
Merge pull request #6838 from microsoft/mzs/ortmodule-api-sync-from-master-210226
Sync from master
2021-02-27 12:32:36 -08:00
Tianlei Wu
2d6e10ba00
Update Attention and QAttention to support pruned model (#6819)
* update Attention operator spec to support pruned model
* update Attention and QAttention cpu & cuda kernel
* Fix invalid embed layer norm fusion test models.
2021-02-27 09:50:16 -08:00
Thiago Crepaldi
f71d93ea2b
Enable PyTorch Lightning basic test on CI (#6809) 2021-02-27 09:35:42 -08:00
M. Zeeshan Siddiqui
ca48310d6d Merge branch 'master' of https://github.com/microsoft/onnxruntime into mzs/ortmodule-api-sync-from-master-210226 2021-02-27 04:25:23 +00:00
M. Zeeshan Siddiqui
cb8d8464bc
Do not create compute stream when external CUDA allocator is used. (#6833) 2021-02-26 20:13:02 -08:00
Ye Wang
b4b87ac7a0
update (#6827) 2021-02-26 13:58:41 -08:00
Sergii Dymchenko
059ed1c241
Copy forward signature from PyTorch model. (#6777) 2021-02-26 13:02:13 -08:00
baijumeswani
c1b0cf6d0b
Add pipeline to clear the cache for huggingface transormers models (#6813) 2021-02-26 10:39:22 -08:00
satyajandhyala
355057cf9c
Added RequiredGrad attribute to YieldOp (#6657)
* Added required_grad attribute to YieldOp

* Chagened YieldOp attribute to hold the indices of the required gradient outputs from the count, and removed the code reordering the outputs.

* Changed backward_output_grad_names to a map from backward output gradient name to the corresponding output index.
2021-02-26 10:38:38 -08:00
Pranav Prakash
d5175795d2
Improvements to quantizer: Removed unused qType field, add reshape op (#6179)
* Handle case where bias_name is already quantized

If bias is shared between multiple nodes and we've already quantized it, just return the quantized name from the map

* Remove qType attribute from QuantizedValue and QuantizedInitializer

These are unused (and were incorrectly set in the case of int8 quantization)

* Add Reshape op to quantizer

* Add test for Reshape quant
2021-02-26 10:21:37 -08:00
Surya Siddharth Pemmaraju
3426108739
Fixed issue in python cmake to update wheel package (#6384)
* Fixed issue in python cmake to update wheel package

* Fixes python cmake issue for OV EP

Added post build step for libonnxruntime_providers_openvino
that copies the updated libonnxruntime_providers_openvino.so file
to /onnxruntime/capi directory every time this target is rebuilt.

Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com>

* Removed post_build step from onnxruntime_python.cmake

Now that we have added the post build step to copy
onnxruntime_providers_openvino.so and providers_shared.so
to /onnxruntime/capi directory in onnxruntime_providers.cmake file.
so removing the duplication of the same from here.

Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com>

* Fixed python cmake issue for OpenVINO-EP

->Fixed issue for both Linux and windows

Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com>

Co-authored-by: MaajidKhan <n.maajidkhan@gmail.com>
2021-02-26 06:34:43 -08:00
Sherlock
8a450d523f
Check gradient correctness in the UTs (#6803)
* Check gradient correctness in the UTs

Co-authored-by: Sherlock Huang <bahuang@OrtTrainingDev3.af05slrtruoetgaxwwjv5nsq5e.px.internal.cloudapp.net>
2021-02-25 13:31:07 -08:00
baijumeswani
fa8a9015bd
Mount hf model cache and use cache for loading hf models (#6810) 2021-02-25 13:30:14 -08:00
Sergii Dymchenko
99ffffbe6a
Remove backward workaround from test. (#6811) 2021-02-25 13:23:46 -08:00
ashbhandare
b05403d877
Clear iobinding outputs (#6774) 2021-02-25 11:50:43 -08:00
Chi Lo
9b3171e95c
Make keepdims to its default value when adding ReduceMin/ReduceMax for quantization calibration (#6788)
* Make keepdims to its default value when adding ReduceMin/ReduceMax

* Fix bug for adding ReduceMin/ReduceMax with keepdims=1
2021-02-25 09:47:59 -08:00
Olivia Jain
db05d53b94
Setup perf in docker and add features (#6582)
* setup scripts to run in docker 
* percent threshold for accuracy 
* branch testing
2021-02-25 09:31:03 -08:00
stevenlix
d5f292ab73
fix issues caused by quantize/calibrate changes (#6802) 2021-02-25 05:41:21 -08:00
Maajid khan
7465673e33
[OpenVINO-EP] Find package changes (#6801)
* Find package changes to cmake

* Removing unwanted code from cmake

Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com>

Co-authored-by: suryasidd <surya.siddharth.pemmaraju@intel.com>
2021-02-25 05:12:57 -08:00
Suffian Khan
8a148e44fb
make ci pipeline also run batch and convergence test (#6798) 2021-02-24 20:18:03 -08:00
Hariharan Seshadri
ab1713f5cc
Fix regression in constant folding optimizer (#6795) 2021-02-24 19:10:14 -08:00
Sherlock
8e200e13fe
Rewrite ORTModule background task coordination (#6700)
* Introduce OrtTasks to replace EventPool

* return run_id to frontend

* pass run_id to backward

* OrtTasks support multiple bg_events

* make message_queue a member of orttask

* Replace MessageQueue with std::promise

* Move status_promise into Task

* Move terminate flag into Task

* Reenable previously disabled UTs

* Add unit tests

* Replace condition variables with std::promise

* Move to CreateBackgroundTask in the main thread

* return status and output in forward_future

* use throw for terminating background thread

* cleanup tasks at destructor

* reenable test_mixed_nnmodule_ortmodules_training

* add mutex for ORTTasks functions

* add mutex for bg_threads

* delay tests before start

* add ut for multi-task common backbone

Co-authored-by: Sherlock Huang <bahuang@OrtTrainingDev3.af05slrtruoetgaxwwjv5nsq5e.px.internal.cloudapp.net>
2021-02-24 18:00:25 -08:00
baijumeswani
7ce4075bbd
Support nested sequence and mapping types in ORTModule (#6791) 2021-02-24 15:45:56 -08:00
Weixing Zhang
40fa40f3ce
Enable more unit tests for ROCM EP (#6776)
* enable more ops and unit tests for ROCM EP
2021-02-24 15:20:50 -08:00
Thiago Crepaldi
aa5cd37ac8
Refactor device handling and basic support for PyTorch Lightning (#6758) 2021-02-24 14:12:55 -08:00
Tianlei Wu
f4acdb2ecd
Update transformers benchmark for transformers 4.3.* and ORT 1.7 (#6796)
* update benchmark for transformers 4.* and ORT 1.7

* Fix gpt2 onnx conversion for transformers 4.3.*. Add a check of transformer version >= 3.1.

* remove code related to openmp

* update pretrain model list: keep representitive models only
2021-02-24 12:52:35 -08:00
fthielke
71a70ecf6e
Allow 3D ConvTranspose in CUDA execution provider (#6794)
Co-authored-by: Felix Thielke <felix.thielke@mevis.fraunhofer.de>
2021-02-24 11:53:31 -08:00