stable-baselines3/docs/modules
Anssi 2cd6a4f93b
Match performance with stable-baselines (discrete case) (#110)
* Fix storing correct episode dones

* Fix number of filters in NatureCNN network

* Add TF-like RMSprop for matching performance with sb2

* Remove stuff that was accidentally included

* Reformat

* Clarify variable naming

* Update changelog

* Add comment on RMSprop implementations to A2C

* Add test for RMSpropTFLike

Co-authored-by: Antonin RAFFIN <antonin.raffin@ensta.org>
2020-08-03 22:22:51 +02:00
..
a2c.rst Match performance with stable-baselines (discrete case) (#110) 2020-08-03 22:22:51 +02:00
base.rst Review of code (A2C, PPO and refactoring) (#35) 2020-06-09 13:54:18 +02:00
ddpg.rst Implement DDPG (#92) 2020-07-16 14:14:22 +02:00
dqn.rst Implement DQN (#28) 2020-06-29 11:16:54 +02:00
ppo.rst Update ppo.rst (#94) 2020-07-10 10:38:35 +02:00
sac.rst Review of code (A2C, PPO and refactoring) (#35) 2020-06-09 13:54:18 +02:00
td3.rst Review of code (A2C, PPO and refactoring) (#35) 2020-06-09 13:54:18 +02:00