mirror of
https://github.com/saymrwulf/onnxruntime.git
synced 2026-06-16 01:33:39 +00:00
* ortmodule v0.2 * use pt module for eval * get user outputs in yield op * pass output grads to yield output without copy * Disable mem_pattern for ORTModule * Avoid allocating output buffer for Yield op * Change to WaitAndReset to avoid overriding signal * remove unnecessory signal/wait at the end of bg thread * Return Session.Run result as a std::future * export model with torch.no_grad() * Handle bg thread's early return in Forward call * Removed duplicated Yield kernel * Silence "CUDA kernel missing log" * Add missing transforms, clear iobinding (#6532) * revert ortmodule.py to a working state first * Apply ortmodule.py change from dev branch * Rename to YieldOp Co-authored-by: Sherlock Huang <bahuang@OrtTrainingDev3.af05slrtruoetgaxwwjv5nsq5e.px.internal.cloudapp.net> Co-authored-by: ashbhandare <ash.bhandare@gmail.com> Co-authored-by: Sherlock <baihan.huang@gmail.com> |
||
|---|---|---|
| .. | ||
| backend | ||
| datasets | ||
| tools | ||
| training | ||
| __init__.py | ||
| _ld_preload.py | ||
| _pybind_state.py | ||
| dlpack.h | ||
| dlpack_convertor.cc | ||
| dlpack_convertor.h | ||
| exported_symbols.lst | ||
| onnxruntime_inference_collection.py | ||
| onnxruntime_pybind_exceptions.h | ||
| onnxruntime_pybind_mlvalue.cc | ||
| onnxruntime_pybind_mlvalue.h | ||
| onnxruntime_pybind_state.cc | ||
| onnxruntime_pybind_state_common.cc | ||
| onnxruntime_pybind_state_common.h | ||
| onnxruntime_validation.py | ||
| pybind.def | ||
| version_script.lds | ||