stable-baselines3/docs/guide
Alex Pasquali 6d55a09f81
Updated custom policy docs to better explain the `mlp_extractor`'s dimensions (#1196)
* Updated custom policy docs

Better explained how the dimensions of the mlp_extractor work, including the action net and the value net after the layers specified in net_arch.

* Improved custom policy doc

Section: Custom Network Architecture.
Explained with greater detail that an action net and a value net will be added on top of the net_arch.

* Improved custom policy doc

Section: Custom Network Architecture.
Merged a comment into a note

* Alignment

Co-authored-by: Quentin GALLOUÉDEC <gallouedec.quentin@gmail.com>
2022-12-12 16:19:51 +01:00
..
algos.rst Update doc: SB3 Contrib RecurrentPPO (#927) 2022-05-31 18:11:16 +02:00
callbacks.rst Add progress bar callback and argument (#1095) 2022-10-06 18:17:31 +02:00
checking_nan.rst Fix typo in documentation (#1177) 2022-11-15 15:00:03 +01:00
custom_env.rst Standardized the use of `"` for string representation (#1086) 2022-10-03 15:15:39 +02:00
custom_policy.rst Updated custom policy docs to better explain the `mlp_extractor`'s dimensions (#1196) 2022-12-12 16:19:51 +01:00
developer.rst Stable-Baselines3 v1.0 (#354) 2021-03-17 14:20:31 +01:00
examples.rst Drop gym.GoalEnv and other minor changes initally from #780 (#1184) 2022-11-28 18:22:31 +01:00
export.rst Update export doc (fixes + add torch jit) (#1074) 2022-09-30 14:30:40 +02:00
imitation.rst Link to full imitation docs (#1106) 2022-10-10 21:36:30 -07:00
install.rst Fix url in docs (#1000) 2022-08-05 17:54:48 +02:00
integrations.rst Fix integration documentation (#1135) 2022-10-24 13:20:58 +02:00
migration.rst Deprecate create_eval_env, eval_env and eval_freq parameter (#1082) 2022-10-10 15:39:38 +02:00
quickstart.rst Drop gym.GoalEnv and other minor changes initally from #780 (#1184) 2022-11-28 18:22:31 +01:00
rl.rst Add base doc 2020-05-07 10:10:51 +02:00
rl_tips.rst Updated minor grammar error (#1041) 2022-08-31 18:04:15 +02:00
rl_zoo.rst Release v1.6.2 (#1103) 2022-10-10 16:37:11 +02:00
save_format.rst System info helper (#613) 2021-10-18 10:43:56 +02:00
sb3_contrib.rst Update doc: SB3 Contrib RecurrentPPO (#927) 2022-05-31 18:11:16 +02:00
tensorboard.rst Standardized the use of `"` for string representation (#1086) 2022-10-03 15:15:39 +02:00
vec_envs.rst Documentation fixes (#514) 2021-07-18 20:51:41 +02:00