mirror of
https://github.com/saymrwulf/stable-baselines3.git
synced 2026-06-01 23:30:53 +00:00
35 lines
885 B
Markdown
35 lines
885 B
Markdown
<img src="docs/\_static/img/logo.png" align="right" width="40%"/>
|
|
|
|
[](https://travis-ci.com/hill-a/stable-baselines) [](https://stable-baselines.readthedocs.io/en/master/?badge=master)
|
|
|
|
# Torchy Baselines
|
|
|
|
PyTorch version of [Stable Baselines](https://github.com/hill-a/stable-baselines), a set of improved implementations of reinforcement learning algorithms.
|
|
|
|
## Implemented Algorithms
|
|
|
|
- CEM-RL (with TD3)
|
|
- PPO
|
|
- SAC
|
|
- TD3
|
|
|
|
|
|
## Roadmap
|
|
|
|
TODO:
|
|
- save/load
|
|
- predict
|
|
- flexible mlp
|
|
- logger
|
|
- better monitor wrapper?
|
|
- A2C
|
|
|
|
Later:
|
|
- get_parameters / set_parameters
|
|
- CNN policies + normalization
|
|
- tensorboard support
|
|
- DQN
|
|
- TRPO
|
|
- ACER
|
|
- DDPG
|
|
- HER -> use stable-baselines because does not depends on tf?
|