stable-baselines3

mirror of https://github.com/saymrwulf/stable-baselines3.git synced 2026-05-16 21:10:08 +00:00

Author	SHA1	Message	Date
Antonin RAFFIN	e3875b50a1	Stable-Baselines3 v1.0 (#354 ) * Bump version and update doc * Fix name * Apply suggestions from code review Co-authored-by: Adam Gleave <adam@gleave.me> * Update docs/index.rst Co-authored-by: Adam Gleave <adam@gleave.me> * Update wording for RL zoo Co-authored-by: Adam Gleave <adam@gleave.me>	2021-03-17 14:20:31 +01:00
Anssi	e2b6f5460f	Avoid transposing channel-first envs (#213 ) * Add test for channel-first environments * Add support for channel-first envs, including more tests * Update changelog * Run black * Run black, again * Improve NatureCNN error message * Update image checks and FrameStack wrapper * Update tests * Update docs * Run isort * Reformat * Fixes: avoid breaking changes for non-image env * Add additional checks * Update docstring Co-authored-by: Antonin RAFFIN <antonin.raffin@ensta.org>	2020-11-03 12:34:09 +01:00
Anssi	44f8218df0	Review of code (A2C, PPO and refactoring) (#35 ) * Split torch module code into torch_layers file * Updated reference to CNN * Change 'CxWxH' to 'CxHxW', as per common notion * Fix missing import in policies.py * Move PPOPolicy to OnlineActorCriticPolicy * Create OnPolicyRLModel from PPO, and make A2C and PPO inherit * Update A2C optimizer comment * Clean weight init scales for clarity * Fix A2C log_interval default parameter * Rename 'progress' to 'progress_remaining * Rename 'Models' to 'Algorithms' * Rename 'OnlineActorCriticPolicy' to 'ActorCriticPolicy' * Move static functions out from BaseAlgorithm * Move on/off_policy base algorithms to their own files * Add files for A2C/PPO * Fix docs * Fix pytype * Update documentation on OnPolicyAlgorithm * Add proper doctstring for on_policy rollout gathering * Add bit clarification on the mlppolicy/cnnpolicy naming * Move static function is_vectorized_policies to utils.py * Checking docstrings, pep8 fixes * Update changelog * Clean changelog * Remove policy warnings for sac/td3 * Add monitor_wrapper for OnPolicyAlgorithm. Clean tb logging variables. Add parameter keywords to OffPolicyAlgorithm super init Co-authored-by: Antonin RAFFIN <antonin.raffin@ensta.org>	2020-06-09 13:54:18 +02:00
Antonin RAFFIN	15ff6d47ee	Documentation update and style fixes (#21 ) * Update doc: add gSDE * Fix codestyle * Remove travis script * Add lint check to gitlab	2020-05-15 13:54:06 +02:00
Antonin RAFFIN	b02afd6ee3	Doc update (#15 )	2020-05-11 12:28:43 +02:00
Antonin RAFFIN	f23212e3b2	Add developer guide	2020-05-08 16:20:21 +02:00

6 commits