stable-baselines3/stable_baselines3/common
Antonin RAFFIN 5ff176b2f1
Implement DDPG (#92)
* Add DDPG + TD3 with any number of critics

* Allow any number of critics for SAC

* Update doc

* [ci skip] Update DDPG example

* Remove unused parameter

* Add DDPG to identity test

* Fix computation with n_critics=1,3

* Update doc

* Apply suggestions from code review

Co-authored-by: Adam Gleave <adam@gleave.me>

* Update docstrings for off-policy algos

* Add check for sde

Co-authored-by: Adam Gleave <adam@gleave.me>
2020-07-16 14:14:22 +02:00
..
vec_env Fixed SubprocVecEnv close. (#68) 2020-06-20 18:01:37 +02:00
__init__.py More doc + sync VecEnvs + atari 2020-05-07 16:08:23 +02:00
atari_wrappers.py Implement DQN (#28) 2020-06-29 11:16:54 +02:00
base_class.py Implement DDPG (#92) 2020-07-16 14:14:22 +02:00
bit_flipping_env.py Sync with Stable-Baselines 2020-05-05 16:28:38 +02:00
buffers.py Implement DQN (#28) 2020-06-29 11:16:54 +02:00
callbacks.py Review of code (A2C, PPO and refactoring) (#35) 2020-06-09 13:54:18 +02:00
cmd_util.py Implement DQN (#28) 2020-06-29 11:16:54 +02:00
distributions.py Address minor issues after clarification by @araffin 2020-07-07 18:39:55 -07:00
env_checker.py Build the doc 2020-05-07 17:35:29 +02:00
evaluation.py Review of code (A2C, PPO and refactoring) (#35) 2020-06-09 13:54:18 +02:00
identity_env.py Add flake8 linter and Github CI (#19) 2020-05-12 17:55:01 +02:00
logger.py Tensorboard integration (#30) 2020-06-01 11:55:44 +02:00
monitor.py Update doc 2020-05-08 13:09:38 +02:00
noise.py Implement DDPG (#92) 2020-07-16 14:14:22 +02:00
off_policy_algorithm.py Implement DDPG (#92) 2020-07-16 14:14:22 +02:00
on_policy_algorithm.py Fix typing, key error 2020-07-02 21:35:06 -07:00
policies.py Ignore errors from new pytype version (#107) 2020-07-16 11:54:37 +02:00
preprocessing.py Support for MultiBinary / MultiDiscrete spaces (#13) 2020-05-18 14:42:13 +02:00
results_plotter.py Rename to stable-baselines3 2020-05-05 15:02:35 +02:00
running_mean_std.py Rename to stable-baselines3 2020-05-05 15:02:35 +02:00
save_util.py Automatically create paths for saved objects (#80) 2020-07-03 01:14:21 +03:00
torch_layers.py Review of code (A2C, PPO and refactoring) (#35) 2020-06-09 13:54:18 +02:00
type_aliases.py Review type aliases 2020-07-02 18:51:10 -07:00
utils.py Implement DQN (#28) 2020-06-29 11:16:54 +02:00