stable-baselines3

mirror of https://github.com/saymrwulf/stable-baselines3.git synced 2026-06-29 03:31:08 +00:00

History

Antonin RAFFIN 507ed1762e Multiprocessing support for off policy algorithms (#439 ) * Add multi-env training support for SAC * Fix for dict obs * Pytype fixes * Fix assert on number of envs * Remove for loop * Add support for Dict obs * Start cleanup * Update doc and bug fix * Add support for vectorized action noise and add multi env example for off-policy * Update version * Bug fix with VecNormalize * Update README table * Update variable names * Update changelog and version * Update doc and fix for `gradient_steps=-1` * Add test for `gradient_steps=-1` * Disable pytype pyi errors * Fix for DQN * Update comment on deepcopy * Remove episode_reward field * Fix RolloutReturn * Avoid modification by reference * Fix error message Co-authored-by: Anssi <kaneran21@hotmail.com>		2021-12-01 22:30:09 +01:00
..
a2c.rst	Docs examples warning - issue #526 (#530 )	2021-08-09 16:23:25 +03:00
base.rst	Review of code (A2C, PPO and refactoring) (#35 )	2020-06-09 13:54:18 +02:00
ddpg.rst	Multiprocessing support for off policy algorithms (#439 )	2021-12-01 22:30:09 +01:00
dqn.rst	Multiprocessing support for off policy algorithms (#439 )	2021-12-01 22:30:09 +01:00
her.rst	Docs examples warning - issue #526 (#530 )	2021-08-09 16:23:25 +03:00
ppo.rst	Doc fix environment mixup (#588 )	2021-09-29 10:16:59 +02:00
sac.rst	Multiprocessing support for off policy algorithms (#439 )	2021-12-01 22:30:09 +01:00
td3.rst	Multiprocessing support for off policy algorithms (#439 )	2021-12-01 22:30:09 +01:00