stable-baselines3

mirror of https://github.com/saymrwulf/stable-baselines3.git synced 2026-07-07 17:15:54 +00:00

History

Anssi 44f8218df0 Review of code (A2C, PPO and refactoring) (#35 ) * Split torch module code into torch_layers file * Updated reference to CNN * Change 'CxWxH' to 'CxHxW', as per common notion * Fix missing import in policies.py * Move PPOPolicy to OnlineActorCriticPolicy * Create OnPolicyRLModel from PPO, and make A2C and PPO inherit * Update A2C optimizer comment * Clean weight init scales for clarity * Fix A2C log_interval default parameter * Rename 'progress' to 'progress_remaining * Rename 'Models' to 'Algorithms' * Rename 'OnlineActorCriticPolicy' to 'ActorCriticPolicy' * Move static functions out from BaseAlgorithm * Move on/off_policy base algorithms to their own files * Add files for A2C/PPO * Fix docs * Fix pytype * Update documentation on OnPolicyAlgorithm * Add proper doctstring for on_policy rollout gathering * Add bit clarification on the mlppolicy/cnnpolicy naming * Move static function is_vectorized_policies to utils.py * Checking docstrings, pep8 fixes * Update changelog * Clean changelog * Remove policy warnings for sac/td3 * Add monitor_wrapper for OnPolicyAlgorithm. Clean tb logging variables. Add parameter keywords to OffPolicyAlgorithm super init Co-authored-by: Antonin RAFFIN <antonin.raffin@ensta.org>		2020-06-09 13:54:18 +02:00
..
algos.rst	Support for MultiBinary / MultiDiscrete spaces (#13 )	2020-05-18 14:42:13 +02:00
callbacks.rst	Review of code (A2C, PPO and refactoring) (#35 )	2020-06-09 13:54:18 +02:00
checking_nan.rst	Add base doc	2020-05-07 10:10:51 +02:00
custom_env.rst	Add base doc	2020-05-07 10:10:51 +02:00
custom_policy.rst	More doc + sync VecEnvs + atari	2020-05-07 16:08:23 +02:00
developer.rst	Review of code (A2C, PPO and refactoring) (#35 )	2020-06-09 13:54:18 +02:00
examples.rst	Update doc	2020-05-19 10:40:52 +02:00
install.rst	Tensorboard integration (#30 )	2020-06-01 11:55:44 +02:00
migration.rst	Add base doc	2020-05-07 10:10:51 +02:00
quickstart.rst	[ci skip] Simplify quickstart example	2020-05-11 15:32:01 +02:00
rl.rst	Add base doc	2020-05-07 10:10:51 +02:00
rl_tips.rst	Update doc	2020-05-19 10:40:52 +02:00
rl_zoo.rst	Update doc (add rl zoo)	2020-05-08 11:58:43 +02:00
tensorboard.rst	Tensorboard integration (#30 )	2020-06-01 11:55:44 +02:00
vec_envs.rst	Add base doc	2020-05-07 10:10:51 +02:00