mirror of
https://github.com/saymrwulf/stable-baselines3.git
synced 2026-05-16 21:10:08 +00:00
* add kl divergence wrapper * add test * update changelog * black lint * remove unused import * Fix ent coef loading for SAC (#429) * Fix ent coef loading for SAC * Better fix and add comment * add 'distribution' to base Distribution class * add sample test * revert to plain pytorch implementation * black reformat * Update docs/misc/changelog.rst Co-authored-by: Antonin RAFFIN <antonin.raffin@ensta.org> * Doc update (custom policy + fix her example) (#436) * isort and black reformat * float -> bool tensor * add sanity test * more concise kl code * remove outdated comment * all -> allclose assertion Co-authored-by: Antonin RAFFIN <antonin.raffin@ensta.org> * Fix PyTorch warning * Update gSDE entropy test * Update entropy test Co-authored-by: Antonin RAFFIN <antonin.raffin@ensta.org> Co-authored-by: Antonin Raffin <antonin.raffin@dlr.de> |
||
|---|---|---|
| .. | ||
| changelog.rst | ||
| projects.rst | ||