onnxruntime/orttraining
sabreshao e6a3308db7
Optimize cuComputeGradInput performance. (#7479)
Move the checking of gamma to host and specialize both case through template.
2021-04-28 17:08:31 -07:00
..
orttraining Optimize cuComputeGradInput performance. (#7479) 2021-04-28 17:08:31 -07:00
pytorch_frontend_examples Sync ORTModule branch with master and fix tests (#6526) 2021-02-02 08:59:56 -08:00
tools Add BERT-L perf regression test on MI100 and re-enable batch size test (#7240) 2021-04-05 15:51:52 -07:00