2020-05-07 08:10:51 +00:00
.. Stable Baselines3 documentation master file, created by
2019-09-26 09:46:40 +00:00
sphinx-quickstart on Thu Sep 26 11:06:54 2019.
You can adapt this file completely to your liking, but it should at least
contain the root `toctree` directive.
2020-12-21 15:17:24 +00:00
Stable-Baselines3 Docs - Reliable Reinforcement Learning Implementations
========================================================================
2019-09-26 09:46:40 +00:00
2020-12-21 15:17:24 +00:00
`Stable Baselines3 (SB3) <https://github.com/DLR-RM/stable-baselines3> `_ is a set of reliable implementations of reinforcement learning algorithms in PyTorch.
2020-05-08 11:09:38 +00:00
It is the next major version of `Stable Baselines <https://github.com/hill-a/stable-baselines> `_ .
2020-05-07 08:10:51 +00:00
Github repository: https://github.com/DLR-RM/stable-baselines3
2019-09-26 09:46:40 +00:00
2021-12-01 17:55:21 +00:00
Paper: https://jmlr.org/papers/volume22/20-1364/20-1364.pdf
2021-03-17 13:20:31 +00:00
RL Baselines3 Zoo (training framework for SB3): https://github.com/DLR-RM/rl-baselines3-zoo
2019-09-26 09:46:40 +00:00
2021-03-17 13:20:31 +00:00
RL Baselines3 Zoo provides a collection of pre-trained agents, scripts for training, evaluating agents, tuning hyperparameters, plotting results and recording videos.
2019-09-26 09:46:40 +00:00
2020-12-21 15:17:24 +00:00
SB3 Contrib (experimental RL code, latest algorithms): https://github.com/Stable-Baselines-Team/stable-baselines3-contrib
2019-09-26 09:46:40 +00:00
2020-05-07 08:10:51 +00:00
Main Features
--------------
- Unified structure for all algorithms
- PEP8 compliant (unified code style)
- Documented functions and classes
- Tests, high code coverage and type hints
- Clean code
2020-06-01 09:55:44 +00:00
- Tensorboard support
2020-10-28 08:55:16 +00:00
- **The performance of each algorithm was tested** (see *Results* section in their respective page)
2020-05-07 08:10:51 +00:00
2019-09-26 09:46:40 +00:00
.. toctree ::
:maxdepth: 2
:caption: User Guide
2020-05-07 08:10:51 +00:00
guide/install
2019-09-26 09:46:40 +00:00
guide/quickstart
2020-05-07 08:10:51 +00:00
guide/rl_tips
guide/rl
2020-05-07 14:08:23 +00:00
guide/algos
guide/examples
2019-09-26 09:46:40 +00:00
guide/vec_envs
2020-05-07 08:10:51 +00:00
guide/custom_env
2020-05-07 14:08:23 +00:00
guide/custom_policy
2020-05-07 08:10:51 +00:00
guide/callbacks
2020-06-01 09:55:44 +00:00
guide/tensorboard
2022-01-18 14:10:25 +00:00
guide/integrations
2020-05-08 09:58:43 +00:00
guide/rl_zoo
2020-12-21 15:17:24 +00:00
guide/sb3_contrib
2020-10-24 16:33:26 +00:00
guide/imitation
2020-05-07 08:10:51 +00:00
guide/migration
guide/checking_nan
2020-05-08 14:20:21 +00:00
guide/developer
2020-10-28 08:55:16 +00:00
guide/save_format
guide/export
2019-09-26 09:46:40 +00:00
.. toctree ::
:maxdepth: 1
:caption: RL Algorithms
modules/base
2020-01-20 15:19:35 +00:00
modules/a2c
2020-07-16 12:14:22 +00:00
modules/ddpg
modules/dqn
2020-10-22 09:56:43 +00:00
modules/her
2019-09-26 09:46:40 +00:00
modules/ppo
modules/sac
modules/td3
2020-05-07 08:10:51 +00:00
.. toctree ::
:maxdepth: 1
:caption: Common
2020-05-08 11:09:38 +00:00
common/atari_wrappers
2020-10-22 09:05:52 +00:00
common/env_util
2021-05-11 10:29:30 +00:00
common/envs
2020-05-07 08:10:51 +00:00
common/distributions
common/evaluation
common/env_checker
2020-05-08 11:09:38 +00:00
common/monitor
common/logger
common/noise
common/utils
2019-09-26 09:46:40 +00:00
.. toctree ::
:maxdepth: 1
:caption: Misc
misc/changelog
2020-05-07 08:10:51 +00:00
misc/projects
2019-09-26 09:46:40 +00:00
2020-05-05 13:02:35 +00:00
Citing Stable Baselines3
2020-05-05 14:28:38 +00:00
------------------------
2019-09-26 09:46:40 +00:00
To cite this project in publications:
.. code-block :: bibtex
2021-12-01 17:55:21 +00:00
@article{stable-baselines3,
author = {Antonin Raffin and Ashley Hill and Adam Gleave and Anssi Kanervisto and Maximilian Ernestus and Noah Dormann},
title = {Stable-Baselines3: Reliable Reinforcement Learning Implementations},
journal = {Journal of Machine Learning Research},
year = {2021},
volume = {22},
number = {268},
pages = {1-8},
url = {http://jmlr.org/papers/v22/20-1364.html}
}
2019-09-26 09:46:40 +00:00
2020-08-23 11:27:52 +00:00
Contributing
------------
To any interested in making the rl baselines better, there are still some improvements
that need to be done.
You can check issues in the `repo <https://github.com/DLR-RM/stable-baselines3/issues> `_ .
If you want to contribute, please read `CONTRIBUTING.md <https://github.com/DLR-RM/stable-baselines3/blob/master/CONTRIBUTING.md> `_ first.
2019-09-26 09:46:40 +00:00
Indices and tables
-------------------
* :ref: `genindex`
* :ref: `search`
* :ref: `modindex`