onnxruntime/onnxruntime
pengwa 6d1eb9509e
Refine gradient accumulation (on device training) (#12363)
* a

(cherry picked from commit 43909cdd6e3daf30a82d584292286806d1172a0b)

* optimize inplace accumulator a bit

* fix inputs

* revert logging

* minor fix

* tune perf and resolve comments

* typo

* fix

* fix tests

* move threshold to constexpr.
2022-07-30 10:24:01 +08:00
..
contrib_ops Eliminate memory allocations per recent profiling (#12225) 2022-07-25 14:14:38 -07:00
core Refine gradient accumulation (on device training) (#12363) 2022-07-30 10:24:01 +08:00
gsl
python Cosmetic fix to AttentionFusion (#12329) 2022-07-27 12:43:50 -07:00
test Refine gradient accumulation (on device training) (#12363) 2022-07-30 10:24:01 +08:00
tool/etw
wasm EP factory creation cleanup and enhancements. (#11798) 2022-06-16 07:01:41 +10:00
__init__.py Bump ort version number (#11948) 2022-07-22 12:55:53 -07:00
ReformatSource.ps1
ReformatSourcePython.bat
VSCodeCoverage.runsettings