stable-baselines3/stable_baselines3
Timo Kaufmann 09e9fc42eb
Use consistent logging keys (#605)
* Use a consistent key to log the total timesteps

This changes the timestep logging key of on-policy algorithms from
`time/total_timesteps` to `time/total timesteps` (note the
underscore/space). The off-policy algorithms and the eval callback
already use the latter, so this behavior is more consistent.

* Use underscores instead of spaces in logging keys

Most keys already followed this policy and consistent behavior is
friendlier to new users.

* Minor edit and bump version

Co-authored-by: Antonin Raffin <antonin.raffin@ensta.org>
2021-10-12 13:17:30 +02:00
..
a2c Avoid putting target networks into training mode (#553) 2021-08-30 17:42:41 +02:00
common Use consistent logging keys (#605) 2021-10-12 13:17:30 +02:00
ddpg Fix type annotations (#522) 2021-07-29 13:02:09 +02:00
dqn Use consistent logging keys (#605) 2021-10-12 13:17:30 +02:00
her Dictionary Observations (#243) 2021-05-11 12:29:30 +02:00
ppo Avoid putting target networks into training mode (#553) 2021-08-30 17:42:41 +02:00
sac Remove sde_net_arch + Simplify policy (#584) 2021-09-28 22:32:54 +03:00
td3 Avoid putting target networks into training mode (#553) 2021-08-30 17:42:41 +02:00
__init__.py Dictionary Observations (#243) 2021-05-11 12:29:30 +02:00
py.typed Rename to stable-baselines3 2020-05-05 15:02:35 +02:00
version.txt Use consistent logging keys (#605) 2021-10-12 13:17:30 +02:00