stable-baselines3/stable_baselines3/a2c
Anssi 2cd6a4f93b
Match performance with stable-baselines (discrete case) (#110)
* Fix storing correct episode dones

* Fix number of filters in NatureCNN network

* Add TF-like RMSprop for matching performance with sb2

* Remove stuff that was accidentally included

* Reformat

* Clarify variable naming

* Update changelog

* Add comment on RMSprop implementations to A2C

* Add test for RMSpropTFLike

Co-authored-by: Antonin RAFFIN <antonin.raffin@ensta.org>
2020-08-03 22:22:51 +02:00
..
__init__.py Auto-formatting with black and isort (#97) 2020-07-16 16:12:16 +02:00
a2c.py Match performance with stable-baselines (discrete case) (#110) 2020-08-03 22:22:51 +02:00
policies.py Auto-formatting with black and isort (#97) 2020-07-16 16:12:16 +02:00