stable-baselines3/stable_baselines3
Costa Huang d2ebd2eeaa
Allow PPO to turn off advantage normalization (#763)
* Allow PPO to turn of advantage normalization

* update changelog

* Add a test case

* Update test and sanity check

* Fix tests

Co-authored-by: Antonin RAFFIN <antonin.raffin@ensta.org>
2022-02-22 15:29:21 +01:00
..
a2c Add timeout handling for on-policy algorithms (#658) 2021-11-16 17:19:16 +01:00
common Pin gym version (#782) 2022-02-21 23:12:54 +01:00
ddpg System info helper (#613) 2021-10-18 10:43:56 +02:00
dqn Remove explict forward calls (#753) 2022-02-06 22:27:12 +02:00
her Dictionary Observations (#243) 2021-05-11 12:29:30 +02:00
ppo Allow PPO to turn off advantage normalization (#763) 2022-02-22 15:29:21 +01:00
sac Remove explict forward calls (#753) 2022-02-06 22:27:12 +02:00
td3 Remove explict forward calls (#753) 2022-02-06 22:27:12 +02:00
__init__.py System info helper (#613) 2021-10-18 10:43:56 +02:00
py.typed Rename to stable-baselines3 2020-05-05 15:02:35 +02:00
version.txt Pin gym version (#782) 2022-02-21 23:12:54 +01:00