onnxruntime/orttraining
ashbhandare d4983f83ff
Shape independent gradient builder for ops requiring broadcast (#4586)
* Adding CPU implementation of BroadcastGradientArgs op

Modify to take shape as input instead of tensor

Cleanup

Correct schema

Corrected kernel, added tests, addressed review comments.

Initial change, to add ReduceSumTraining cpu op

cpu support

Initial changes to gradient builder

Non-empty reduction case passing.

Added exception,test for invalid broadcast,addresed review comments.

Initial change, to add ReduceSumTraining cpu op

cpu support

cuda support + more UTs

on comments + UT

no op support for {} axes with new attr - noop_with_empty_axes

Add noop attribute to ReduceSumTraining use

Add testing for no-shape graph, modify AddSub grad builder, logging.:

MulGrad support

Div support

Expand support

Gemm support

MatMul grad change

Transpose Grad change

BiasGeluGrad change.

Fixes after squash

* Remove logging, add specific exception for shape inference error

* fix build

* Review comments

* Review comments

* Fix windows build

Co-authored-by: Ethan Tao <ettao@microsoft.com>
2020-07-28 13:01:07 -07:00
..
orttraining Shape independent gradient builder for ops requiring broadcast (#4586) 2020-07-28 13:01:07 -07:00
pytorch_frontend_examples Remove ORTModel from frontend API (#3825) 2020-05-11 18:20:33 -07:00
tools Update convergence baseline for ci_test. (#4465) 2020-07-09 15:29:36 +08:00