stable-baselines3/stable_baselines3
Nicholas Goldowsky-Dill 1cd6ae42d5
Fix reward of SimpleMultiObsEnv to always be float (#1676)
* Fix reward of SimpleMultiObsEnv to always be float

Previously the reward was sometimes returned as an int.

* changelog

* Update changelog.rst

* Update version.txt

* Fix type annotation

* Fix import

---------

Co-authored-by: Quentin Gallouédec <45557362+qgallouedec@users.noreply.github.com>
Co-authored-by: Antonin Raffin <antonin.raffin@ensta.org>
2023-09-16 08:56:04 +02:00
..
a2c Release v2.0.0 (#1571) 2023-06-23 12:21:58 +02:00
common Fix reward of SimpleMultiObsEnv to always be float (#1676) 2023-09-16 08:56:04 +02:00
ddpg Upgrade black formatting (#1310) 2023-02-02 11:58:41 +01:00
dqn Fix to use float64 actions for off policy algorithms (#1572) 2023-07-24 16:38:03 +02:00
her Fixes HER mixed ordering of desired_goal and achieved_goal (#1570) 2023-06-21 16:27:06 +02:00
ppo Release v2.0.0 (#1571) 2023-06-23 12:21:58 +02:00
sac Release v2.0.0 (#1571) 2023-06-23 12:21:58 +02:00
td3 Release v2.0.0 (#1571) 2023-06-23 12:21:58 +02:00
__init__.py Add Gymnasium support (#1327) 2023-04-14 13:13:59 +02:00
py.typed Rename to stable-baselines3 2020-05-05 15:02:35 +02:00
version.txt Fix reward of SimpleMultiObsEnv to always be float (#1676) 2023-09-16 08:56:04 +02:00