stable-baselines3/docs/modules
Antonin RAFFIN 507ed1762e
Multiprocessing support for off policy algorithms (#439)
* Add multi-env training support for SAC

* Fix for dict obs

* Pytype fixes

* Fix assert on number of envs

* Remove for loop

* Add support for Dict obs

* Start cleanup

* Update doc and bug fix

* Add support for vectorized action noise
and add multi env example for off-policy

* Update version

* Bug fix with VecNormalize

* Update README table

* Update variable names

* Update changelog and version

* Update doc and fix for `gradient_steps=-1`

* Add test for `gradient_steps=-1`

* Disable pytype pyi errors

* Fix for DQN

* Update comment on deepcopy

* Remove episode_reward field

* Fix RolloutReturn

* Avoid modification by reference

* Fix error message

Co-authored-by: Anssi <kaneran21@hotmail.com>
2021-12-01 22:30:09 +01:00
..
a2c.rst Docs examples warning - issue #526 (#530) 2021-08-09 16:23:25 +03:00
base.rst Review of code (A2C, PPO and refactoring) (#35) 2020-06-09 13:54:18 +02:00
ddpg.rst Multiprocessing support for off policy algorithms (#439) 2021-12-01 22:30:09 +01:00
dqn.rst Multiprocessing support for off policy algorithms (#439) 2021-12-01 22:30:09 +01:00
her.rst Docs examples warning - issue #526 (#530) 2021-08-09 16:23:25 +03:00
ppo.rst Doc fix environment mixup (#588) 2021-09-29 10:16:59 +02:00
sac.rst Multiprocessing support for off policy algorithms (#439) 2021-12-01 22:30:09 +01:00
td3.rst Multiprocessing support for off policy algorithms (#439) 2021-12-01 22:30:09 +01:00