stable-baselines3/stable_baselines3/common
Anand Balakrishnan 59af0c1b01
CheckpointCallback can now save replay buffer and VecNormalize (#1030)
* CheckpointCallback now saves replay buffer (if present)

* VecNormalize stats are saved at checkpoints

* Make checkpointing replay buffer and VecNormalize opt-in

* Edit changelog

* Add documentation for new parameters

* Update docs/misc/changelog.rst

* Add documentation for new parameters

* Implement suggested edits

* Reformat code

* Fix git conflict

* Add .pkl suffix to VecNormalize checkpoints

* Add tests for new CheckpointCallback params

* Merge CheckpointCallback tests

* Update test and add helper for checkpoint path

Co-authored-by: Antonin RAFFIN <antonin.raffin@ensta.org>
2022-08-25 10:57:51 +02:00
..
envs Upgrade code to Python 3.7+ syntax using pyupgrade (#887) 2022-04-25 13:01:38 +03:00
sb2_compat Upgrade code to Python 3.7+ syntax using pyupgrade (#887) 2022-04-25 13:01:38 +03:00
vec_env Fix synchronization bug with EvalCallback (#907) 2022-05-08 21:54:34 +03:00
__init__.py Update docs (custom policy, type hints) (#167) 2020-09-29 20:41:14 +03:00
atari_wrappers.py Upgrade code to Python 3.7+ syntax using pyupgrade (#887) 2022-04-25 13:01:38 +03:00
base_class.py Fixed missing verbose parameter passing (#1011) 2022-08-16 13:32:32 +02:00
buffers.py Support for device=auto buffers and set it as default value (#1009) 2022-08-16 17:54:55 +02:00
callbacks.py CheckpointCallback can now save replay buffer and VecNormalize (#1030) 2022-08-25 10:57:51 +02:00
distributions.py Handling multi-dimensional action spaces (#971) 2022-08-06 14:19:20 +02:00
env_checker.py Fix exception cause in base_class.py (#940) 2022-06-21 20:58:02 +01:00
env_util.py Added wrapper_kwargs argument to make_vec_env (#448) 2021-05-23 11:33:34 +02:00
evaluation.py Fix evaluation script for recurrent policies (#678) 2021-11-30 13:49:06 +01:00
logger.py Support hparams logging to tensorboard (#984) 2022-08-22 22:06:54 +02:00
monitor.py Upgrade code to Python 3.7+ syntax using pyupgrade (#887) 2022-04-25 13:01:38 +03:00
noise.py Fix exception cause in base_class.py (#940) 2022-06-21 20:58:02 +01:00
off_policy_algorithm.py Use higher resolution time_ns() and avoid division by zero (#979) 2022-07-25 23:02:53 +02:00
on_policy_algorithm.py Use higher resolution time_ns() and avoid division by zero (#979) 2022-07-25 23:02:53 +02:00
policies.py Handling multi-dimensional action spaces (#971) 2022-08-06 14:19:20 +02:00
preprocessing.py Documentation update (#450) 2021-05-23 13:13:11 +02:00
results_plotter.py Fix default arguments + add bugbear (#363) 2021-03-25 11:35:21 +02:00
running_mean_std.py Upgrade code to Python 3.7+ syntax using pyupgrade (#887) 2022-04-25 13:01:38 +03:00
save_util.py Fix exception cause in base_class.py (#940) 2022-06-21 20:58:02 +01:00
torch_layers.py Replace "nature" with "Nature" (magazine) to reduce confusion (#965) 2022-07-15 22:48:27 +02:00
type_aliases.py Fix DictReplayBuffer.next_observations type (#1013) 2022-08-16 10:53:22 +02:00
utils.py Include running_mean and running_val when updating target networks (#1004) 2022-08-23 10:20:43 +02:00