onnxruntime/orttraining
Sherlock 4bc17ca04e
CUDA ConvGrad Kernel (#7227)
* ConvGrad CUDA impl

* Set up the test case for Deberta Conv1D

* Add fp16 test

Co-authored-by: Sherlock Huang <bahuang@OrtTrainingDev3.af05slrtruoetgaxwwjv5nsq5e.px.internal.cloudapp.net>
2021-04-06 22:09:06 -07:00
..
orttraining CUDA ConvGrad Kernel (#7227) 2021-04-06 22:09:06 -07:00
pytorch_frontend_examples Sync ORTModule branch with master and fix tests (#6526) 2021-02-02 08:59:56 -08:00
tools Add BERT-L perf regression test on MI100 and re-enable batch size test (#7240) 2021-04-05 15:51:52 -07:00