stable-baselines3

mirror of https://github.com/saymrwulf/stable-baselines3.git synced 2026-07-02 03:55:39 +00:00

History

Quentin Gallouédec c5adad82b2 Multiprocessing support for HerReplayBuffer (#704 ) * IM compat. modif from old fork * mp her working, without offline sampling * update readme and doc * fix discrete action/obs space case * handle offline sampling * fix pos to be consistent with the old version * improve typing and docstring * fix discrete obs special case * new her, using episode uid * deal with full buffer * offline not implemented * info storage; compute_reward as arg; offline sampling error * offline sampling; timeout_termination; fix last_trans detection * rm max_episode_length from tests * fix loading and loading test * Fix episode sampling strategy * Episode interrupted not valid * Typo * Fix infos sampling, next_obs desired goals, offline sampling * update tests for multienvs * speed up code * handle timeout sampling when samping * give up ep_uid for ep_start and ep_lenght * speed up sampling * Improve docstring * Typos and renaming * Fix typing * Fix linter warnings * Renaming + add note * fix reward type * Fix future sampling strategy * Fix future goal selection strategy * env_fn as lambda * Re-fix linter warnings * Formatting * Fix offline sampling * restore the initial performance budget * Remove max_episode_length for HerReplayBuffer kwargs * SubprcVecEnv compat test * Dedicated SubrocVecEnv test rm n_envs from parametrization * Back to using the env arg instead of compute_reward * Up VecEnv import * fix lint warnings * fix docstring * Fix device issue * actor_loss_modifier in SAV and TD3 * Merge RewardModifier and ActorLossModifier into Surgeon * update surgeon for rnd * fix uninteded merge * fix uninteded merge * fix unintended merge * Rm unintended merge * Fix KeyError * Remove useless `all_inds` * Minor docstring format * Fix hint * speedup! * Speedup again * speedup * np.nonzero * fix env normalization * flat sampling for speedup * typo * drop online * format * remove observation from env_cheker (see #1335) * update changelog * default device to "auto" * add comment for info storage * add comment for ep_start and ep_length attributes * a[b][c] to a[b, c] * comment flatnonzero and unravel_index * update _sample_goals docstring * Fix future gaol sampling for split episode * add informative error message for learning_starts too small * use keyword arg for env * try fix pytye * Update stable_baselines3/common/off_policy_algorithm.py Co-authored-by: Antonin RAFFIN <antonin.raffin@ensta.org> * Add `copy_info_dict` option * Ignore pytype * Update changelog * Rename variables and improve documentation * Ignore new bug bear rule * Add note about future strategy * Add deprecation warning * Fix bug trying to pickle buffer kwargs --------- Co-authored-by: Antonin RAFFIN <antonin.raffin@ensta.org>		2023-03-20 12:03:57 +01:00
..
algos.rst	Multiprocessing support for HerReplayBuffer (#704 )	2023-03-20 12:03:57 +01:00
callbacks.rst	Add progress bar callback and argument (#1095 )	2022-10-06 18:17:31 +02:00
checking_nan.rst	Fix typo in documentation (#1177 )	2022-11-15 15:00:03 +01:00
custom_env.rst	Standardize the use of `from gym import spaces` (#1240 )	2023-01-02 14:51:11 +01:00
custom_policy.rst	Add documentation about default network architecture (#1353 )	2023-03-02 14:14:57 +01:00
developer.rst
examples.rst	Multiprocessing support for HerReplayBuffer (#704 )	2023-03-20 12:03:57 +01:00
export.rst	Fix support of image like normalized inputs (#1214 )	2022-12-20 13:18:28 +01:00
imitation.rst	Link to full imitation docs (#1106 )	2022-10-10 21:36:30 -07:00
install.rst	Update doc about Gymnasium support (#1382 )	2023-03-14 12:43:19 +01:00
integrations.rst	`env_id` consistency in tests (#1224 )	2022-12-20 16:01:26 +01:00
migration.rst	Multiprocessing support for HerReplayBuffer (#704 )	2023-03-20 12:03:57 +01:00
quickstart.rst	Drop `gym.GoalEnv` and other minor changes initally from #780 (#1184 )	2022-11-28 18:22:31 +01:00
rl.rst
rl_tips.rst	Updated minor grammar error (#1041 )	2022-08-31 18:04:15 +02:00
rl_zoo.rst	Removed shared layers in mlp_extractor (#1292 )	2023-01-23 14:55:19 +01:00
save_format.rst	System info helper (#613 )	2021-10-18 10:43:56 +02:00
sb3_contrib.rst	Update doc: SB3 Contrib RecurrentPPO (#927 )	2022-05-31 18:11:16 +02:00
tensorboard.rst	Fix `test_vec_normalize.py`, `test_tensorboard.py` and `common/monitor.py` type hint (#1194 )	2023-01-13 18:28:22 +01:00
vec_envs.rst	Refactor observation stacking (#1238 )	2023-02-06 22:41:59 +01:00