stable-baselines3

mirror of https://github.com/saymrwulf/stable-baselines3.git synced 2026-05-18 21:30:19 +00:00

Author	SHA1	Message	Date
Quentin Gallouédec	d5d1a02c15	Allow model trained with python3.7 to be loaded with python3.8+ without the `custom_objects` workaround (#1123 ) * Fix loading * Remove documentation note * Update changelog * Revert save_format change * Add test for errors while unpickling * Update version and cleanup Co-authored-by: Antonin RAFFIN <antonin.raffin@ensta.org>	2022-10-17 17:33:47 +02:00
Quentin Gallouédec	98e786f744	Clarify and standardize verbosity documentation (#1056 ) * Standardize the use of verbosity: > to >= * Make verbose docstring more specific * Update changelog	2022-09-09 16:46:28 +02:00
Ram Rachum	d64bcb401a	Fix exception cause in base_class.py (#940 )	2022-06-21 20:58:02 +01:00
Yifei Cheng	44e53ff811	Enable force_zip64 (#839 ) * Enable force_zip64 * mark tests as expensive * Update changelog Co-authored-by: Antonin Raffin <antonin.raffin@ensta.org>	2022-03-28 10:35:33 +02:00
Antonin RAFFIN	1564a85081	System info helper (#613 ) * Add `system_env_info` * Add `print_system_info` to load and store system info at save time * Remove TODO * Rename to `get_system_info` * Import as sb3 for consistency * Update changelog * Add warning for old SB3 versions * Use underscore litteral for more clarity	2021-10-18 10:43:56 +02:00
Antonin RAFFIN	be86883f36	Fix type annotations (#522 ) * Fix type annotations * Add citation file * Update CITATION.cff * Add note about tb logging Co-authored-by: Anssi <kaneran21@hotmail.com>	2021-07-29 13:02:09 +02:00
Antonin RAFFIN	c62e9259db	Add custom objects support + bug fix (#336 ) * Add support for custom objects * Add python 3.8 to the CI * Bump version * PyType fixes * [ci skip] Fix typo * Add note about slow-down + fix typos * Minor edits to the doc * Bug fix for DQN * Update test * Add test for custom objects	2021-03-06 15:17:43 +02:00
Antonin RAFFIN	723b341c61	Fix for saving big replay buffer, use pickle protocol>=4 (#239 )	2020-11-24 16:13:00 +02:00
Antonin RAFFIN	a1e055695c	Improve typing coverage (#175 ) * Improve typing coverage * Even more types * Fixes * Update changelog * Unified docstrings * Improve error messages for unsupported spaces	2020-10-07 10:51:49 +02:00
Antonin RAFFIN	55912576ed	Cleanup docstring types (#169 ) * Cleanup docstring types * Update style * Test with js hack * Revert "Test with js hack" This reverts commit d091f438e8851ab8d01b66628e06a104f5e5ec69. * Fix types * Fix typo * Update CONTRIBUTING example	2020-10-02 20:05:55 +03:00
Antonin RAFFIN	44a723eecb	Fix loading of old versions and update changelog (#165 )	2020-09-24 16:05:36 +02:00
Anssi	9855486488	Get/set parameters and review of saving and loading (#138 ) * Update comments and docstrings * Rename get_torch_variables to private and update docs * Clarify documentation on data, params and tensors * Make excluded_save_params private and update docs * Update get_torch_variable_names to get_torch_save_params for description * Simplify saving code and update docs on params vs tensors * Rename saved item tensors to pytorch_variables for clarity * Reformat * Fix a typo * Add get/set_parameters, update tests accordingly * Use f-strings for formatting * Fix load docstring * Reorganize functions in BaseClass * Update changelog * Add library version to the stored models * Actually run isort this time * Fix flake8 complaints and also fix testing code * Fix isort * ...and black * Fix set_random_seed Co-authored-by: Antonin RAFFIN <antonin.raffin@ensta.org> Co-authored-by: Antonin Raffin <antonin.raffin@dlr.de>	2020-09-24 14:28:27 +02:00
liorcohen5	f5104a5efc	Allow to set a device when loading a model (#154 ) * Added a 'device' keyword argument to BaseAlgorithm.load(). Edited the save and load test to also test the load method with all possible devices. Added the changes to the changelog * improved the load test to ensure that the model loads to the correct device. * improved the test: now the correctness is improved. If the get_device policy would change, it wouldn't break the test. * Update tests/test_save_load.py @araffin's suggestion during the PR process Co-authored-by: Antonin RAFFIN <antonin.raffin@ensta.org> * Update tests/test_save_load.py Co-authored-by: Antonin RAFFIN <antonin.raffin@ensta.org> * Bug fixes: when comparing devices, comparing only device type since get_device() doesn't provide device index. Now the code loads all of the model parameters from the saved state dict straight into the required device. (fixed load_from_zip_file). * PR fixes: bug fix - a non-related test failed when running on GPU. updated the assertion to consider only types of devices. Also corrected a related bug in 'get_device()' method. * Update changelog.rst Co-authored-by: Antonin RAFFIN <antonin.raffin@ensta.org>	2020-09-20 19:13:18 +02:00
Antonin RAFFIN	15d32c6a4a	Update black version + update docker image (#151 ) * Update docker image * Update black and reformat	2020-08-27 23:02:59 +02:00
Antonin RAFFIN	23afedb254	Auto-formatting with black and isort (#97 ) * Add auto formatting with black and isort * Reformat code * Ignore typing errors * Add note about line length * Add minimum version for isort * Add commit-checks * Update docker image * Fixed lost import (during last merge) * Fix opencv dependency	2020-07-16 16:12:16 +02:00
Stelios Tymvios	4aa66ed34a	Automatically create paths for saved objects (#80 ) * automatically create paths for saved objects * Minor Corrections, more tests * linting * typing * Correct mode checking * corrected tests to reflect new verbose functionality	2020-07-03 01:14:21 +03:00
Anssi	44f8218df0	Review of code (A2C, PPO and refactoring) (#35 ) * Split torch module code into torch_layers file * Updated reference to CNN * Change 'CxWxH' to 'CxHxW', as per common notion * Fix missing import in policies.py * Move PPOPolicy to OnlineActorCriticPolicy * Create OnPolicyRLModel from PPO, and make A2C and PPO inherit * Update A2C optimizer comment * Clean weight init scales for clarity * Fix A2C log_interval default parameter * Rename 'progress' to 'progress_remaining * Rename 'Models' to 'Algorithms' * Rename 'OnlineActorCriticPolicy' to 'ActorCriticPolicy' * Move static functions out from BaseAlgorithm * Move on/off_policy base algorithms to their own files * Add files for A2C/PPO * Fix docs * Fix pytype * Update documentation on OnPolicyAlgorithm * Add proper doctstring for on_policy rollout gathering * Add bit clarification on the mlppolicy/cnnpolicy naming * Move static function is_vectorized_policies to utils.py * Checking docstrings, pep8 fixes * Update changelog * Clean changelog * Remove policy warnings for sac/td3 * Add monitor_wrapper for OnPolicyAlgorithm. Clean tb logging variables. Add parameter keywords to OffPolicyAlgorithm super init Co-authored-by: Antonin RAFFIN <antonin.raffin@ensta.org>	2020-06-09 13:54:18 +02:00
Antonin RAFFIN	d542732c8d	Rename to stable-baselines3	2020-05-05 15:02:35 +02:00

18 commits