onnxruntime/orttraining
ashbhandare bfbcc89db1
Add MLFloat16 support for SoftmaxCrossEntropyLoss for CUDA EP (#7679)
* Forward op changes

* Add tests, improve kernel

* add opset 13 registration, remove unnecessary changes

* Add fp16 grad for SCELoss, review comments
2021-05-14 09:00:27 -07:00
..
orttraining Add MLFloat16 support for SoftmaxCrossEntropyLoss for CUDA EP (#7679) 2021-05-14 09:00:27 -07:00
pytorch_frontend_examples Sync ORTModule branch with master and fix tests (#6526) 2021-02-02 08:59:56 -08:00
tools Add BERT-L perf regression test on MI100 and re-enable batch size test (#7240) 2021-04-05 15:51:52 -07:00