mirror of
https://github.com/saymrwulf/onnxruntime.git
synced 2026-05-18 21:21:17 +00:00
* ortmodule v0.2 * use pt module for eval * get user outputs in yield op * pass output grads to yield output without copy * Disable mem_pattern for ORTModule * Avoid allocating output buffer for Yield op * Change to WaitAndReset to avoid overriding signal * remove unnecessory signal/wait at the end of bg thread * Return Session.Run result as a std::future * export model with torch.no_grad() * Handle bg thread's early return in Forward call * Removed duplicated Yield kernel * Silence "CUDA kernel missing log" * Add missing transforms, clear iobinding (#6532) * revert ortmodule.py to a working state first * Apply ortmodule.py change from dev branch * Rename to YieldOp Co-authored-by: Sherlock Huang <bahuang@OrtTrainingDev3.af05slrtruoetgaxwwjv5nsq5e.px.internal.cloudapp.net> Co-authored-by: ashbhandare <ash.bhandare@gmail.com> Co-authored-by: Sherlock <baihan.huang@gmail.com> |
||
|---|---|---|
| .. | ||
| onnxruntime/core | ||