stable-baselines3/docs/guide
Quentin Gallouédec c5adad82b2
Multiprocessing support for HerReplayBuffer (#704)
* IM compat. modif from old fork

* mp her working, without offline sampling

* update readme and doc

* fix discrete action/obs space case

* handle offline sampling

* fix pos to be consistent with the old version

* improve typing and docstring

* fix discrete obs special case

* new her, using episode uid

* deal with full buffer

* offline not implemented

* info storage; compute_reward as arg; offline sampling error

* offline sampling; timeout_termination; fix last_trans detection

* rm max_episode_length from tests

* fix loading and loading test

* Fix episode sampling strategy

* Episode interrupted not valid

* Typo

* Fix infos sampling, next_obs desired goals, offline sampling

* update tests for multienvs

* speed up code

* handle timeout sampling when samping

* give up ep_uid for ep_start and ep_lenght

* speed up sampling

* Improve docstring

* Typos and renaming

* Fix typing

* Fix linter warnings

* Renaming + add note

* fix reward type

* Fix future sampling strategy

* Fix future goal selection strategy

* env_fn as lambda

* Re-fix linter warnings

* Formatting

* Fix offline sampling

* restore the initial performance budget

* Remove max_episode_length for HerReplayBuffer kwargs

* SubprcVecEnv compat test

* Dedicated SubrocVecEnv test rm n_envs from parametrization

* Back to using the env arg instead of compute_reward

* Up VecEnv import

* fix lint warnings

* fix docstring

* Fix device issue

* actor_loss_modifier in SAV and TD3

* Merge RewardModifier and ActorLossModifier into Surgeon

* update surgeon for rnd

* fix uninteded merge

* fix uninteded merge

* fix unintended merge

* Rm unintended merge

* Fix KeyError

* Remove useless `all_inds`

* Minor docstring format

* Fix hint

* speedup!

* Speedup again

* speedup

* np.nonzero

* fix env normalization

* flat sampling for speedup

* typo

* drop online

* format

* remove observation from env_cheker (see #1335)

* update changelog

* default device to "auto"

* add comment for info storage

* add comment for ep_start and ep_length attributes

* a[b][c] to a[b, c]

* comment flatnonzero and unravel_index

* update _sample_goals docstring

* Fix future gaol sampling for split episode

* add informative error message for learning_starts too small

* use keyword arg for env

* try fix pytye

* Update stable_baselines3/common/off_policy_algorithm.py

Co-authored-by: Antonin RAFFIN <antonin.raffin@ensta.org>

* Add `copy_info_dict` option

* Ignore pytype

* Update changelog

* Rename variables and improve documentation

* Ignore new bug bear rule

* Add note about future strategy

* Add deprecation warning

* Fix bug trying to pickle buffer kwargs

---------

Co-authored-by: Antonin RAFFIN <antonin.raffin@ensta.org>
2023-03-20 12:03:57 +01:00
..
algos.rst Multiprocessing support for HerReplayBuffer (#704) 2023-03-20 12:03:57 +01:00
callbacks.rst Add progress bar callback and argument (#1095) 2022-10-06 18:17:31 +02:00
checking_nan.rst Fix typo in documentation (#1177) 2022-11-15 15:00:03 +01:00
custom_env.rst Standardize the use of from gym import spaces (#1240) 2023-01-02 14:51:11 +01:00
custom_policy.rst Add documentation about default network architecture (#1353) 2023-03-02 14:14:57 +01:00
developer.rst Stable-Baselines3 v1.0 (#354) 2021-03-17 14:20:31 +01:00
examples.rst Multiprocessing support for HerReplayBuffer (#704) 2023-03-20 12:03:57 +01:00
export.rst Fix support of image like normalized inputs (#1214) 2022-12-20 13:18:28 +01:00
imitation.rst Link to full imitation docs (#1106) 2022-10-10 21:36:30 -07:00
install.rst Update doc about Gymnasium support (#1382) 2023-03-14 12:43:19 +01:00
integrations.rst env_id consistency in tests (#1224) 2022-12-20 16:01:26 +01:00
migration.rst Multiprocessing support for HerReplayBuffer (#704) 2023-03-20 12:03:57 +01:00
quickstart.rst Drop gym.GoalEnv and other minor changes initally from #780 (#1184) 2022-11-28 18:22:31 +01:00
rl.rst Add base doc 2020-05-07 10:10:51 +02:00
rl_tips.rst Updated minor grammar error (#1041) 2022-08-31 18:04:15 +02:00
rl_zoo.rst Removed shared layers in mlp_extractor (#1292) 2023-01-23 14:55:19 +01:00
save_format.rst System info helper (#613) 2021-10-18 10:43:56 +02:00
sb3_contrib.rst Update doc: SB3 Contrib RecurrentPPO (#927) 2022-05-31 18:11:16 +02:00
tensorboard.rst Fix test_vec_normalize.py, test_tensorboard.py and common/monitor.py type hint (#1194) 2023-01-13 18:28:22 +01:00
vec_envs.rst Refactor observation stacking (#1238) 2023-02-06 22:41:59 +01:00