stable-baselines3/stable_baselines3/td3
Antonin RAFFIN 507ed1762e
Multiprocessing support for off policy algorithms (#439)
* Add multi-env training support for SAC

* Fix for dict obs

* Pytype fixes

* Fix assert on number of envs

* Remove for loop

* Add support for Dict obs

* Start cleanup

* Update doc and bug fix

* Add support for vectorized action noise
and add multi env example for off-policy

* Update version

* Bug fix with VecNormalize

* Update README table

* Update variable names

* Update changelog and version

* Update doc and fix for `gradient_steps=-1`

* Add test for `gradient_steps=-1`

* Disable pytype pyi errors

* Fix for DQN

* Update comment on deepcopy

* Remove episode_reward field

* Fix RolloutReturn

* Avoid modification by reference

* Fix error message

Co-authored-by: Anssi <kaneran21@hotmail.com>
2021-12-01 22:30:09 +01:00
..
__init__.py Dictionary Observations (#243) 2021-05-11 12:29:30 +02:00
policies.py Avoid putting target networks into training mode (#553) 2021-08-30 17:42:41 +02:00
td3.py Multiprocessing support for off policy algorithms (#439) 2021-12-01 22:30:09 +01:00