stable-baselines3/docs/guide
Anssi 44f8218df0
Review of code (A2C, PPO and refactoring) (#35)
* Split torch module code into torch_layers file

* Updated reference to CNN

* Change 'CxWxH' to 'CxHxW', as per common notion

* Fix missing import in policies.py

* Move PPOPolicy to OnlineActorCriticPolicy

* Create OnPolicyRLModel from PPO, and make A2C and PPO inherit

* Update A2C optimizer comment

* Clean weight init scales for clarity

* Fix A2C log_interval default parameter

* Rename 'progress' to 'progress_remaining

* Rename 'Models' to 'Algorithms'

* Rename 'OnlineActorCriticPolicy' to 'ActorCriticPolicy'

* Move static functions out from BaseAlgorithm

* Move on/off_policy base algorithms to their own files

* Add  files for A2C/PPO

* Fix docs

* Fix pytype

* Update documentation on OnPolicyAlgorithm

* Add proper doctstring for on_policy rollout gathering

* Add bit clarification on the mlppolicy/cnnpolicy naming

* Move static function is_vectorized_policies to utils.py

* Checking docstrings, pep8 fixes

* Update changelog

* Clean changelog

* Remove policy warnings for sac/td3

* Add monitor_wrapper for OnPolicyAlgorithm. Clean tb logging variables. Add parameter keywords to OffPolicyAlgorithm super init

Co-authored-by: Antonin RAFFIN <antonin.raffin@ensta.org>
2020-06-09 13:54:18 +02:00
..
algos.rst Support for MultiBinary / MultiDiscrete spaces (#13) 2020-05-18 14:42:13 +02:00
callbacks.rst Review of code (A2C, PPO and refactoring) (#35) 2020-06-09 13:54:18 +02:00
checking_nan.rst Add base doc 2020-05-07 10:10:51 +02:00
custom_env.rst Add base doc 2020-05-07 10:10:51 +02:00
custom_policy.rst More doc + sync VecEnvs + atari 2020-05-07 16:08:23 +02:00
developer.rst Review of code (A2C, PPO and refactoring) (#35) 2020-06-09 13:54:18 +02:00
examples.rst Update doc 2020-05-19 10:40:52 +02:00
install.rst Tensorboard integration (#30) 2020-06-01 11:55:44 +02:00
migration.rst Add base doc 2020-05-07 10:10:51 +02:00
quickstart.rst [ci skip] Simplify quickstart example 2020-05-11 15:32:01 +02:00
rl.rst Add base doc 2020-05-07 10:10:51 +02:00
rl_tips.rst Update doc 2020-05-19 10:40:52 +02:00
rl_zoo.rst Update doc (add rl zoo) 2020-05-08 11:58:43 +02:00
tensorboard.rst Tensorboard integration (#30) 2020-06-01 11:55:44 +02:00
vec_envs.rst Add base doc 2020-05-07 10:10:51 +02:00