stable-baselines3

mirror of https://github.com/saymrwulf/stable-baselines3.git synced 2026-07-21 19:19:00 +00:00

History

Antonin RAFFIN 507ed1762e Multiprocessing support for off policy algorithms (#439 ) * Add multi-env training support for SAC * Fix for dict obs * Pytype fixes * Fix assert on number of envs * Remove for loop * Add support for Dict obs * Start cleanup * Update doc and bug fix * Add support for vectorized action noise and add multi env example for off-policy * Update version * Bug fix with VecNormalize * Update README table * Update variable names * Update changelog and version * Update doc and fix for `gradient_steps=-1` * Add test for `gradient_steps=-1` * Disable pytype pyi errors * Fix for DQN * Update comment on deepcopy * Remove episode_reward field * Fix RolloutReturn * Avoid modification by reference * Fix error message Co-authored-by: Anssi <kaneran21@hotmail.com>		2021-12-01 22:30:09 +01:00
..
__init__.py	Dictionary Observations (#243 )	2021-05-11 12:29:30 +02:00
policies.py	Avoid putting target networks into training mode (#553 )	2021-08-30 17:42:41 +02:00
td3.py	Multiprocessing support for off policy algorithms (#439 )	2021-12-01 22:30:09 +01:00