mirror of
https://github.com/saymrwulf/stable-baselines3.git
synced 2026-05-18 21:30:19 +00:00
898 B
898 B
Torchy Baselines
PyTorch version of Stable Baselines, a set of improved implementations of reinforcement learning algorithms.
TODO:
- save/load
- predict
- better rescale (min + action * range)
- documentation
- flexible mlp
- logger
- better monitor wrapper?
- automatic choice for action distribution
Later:
- get_parameters / set_parameters
- CNN policies + normalization
- tensorboard support
- DQN
- TRPO
- A2C
- ACER
- HER -> use stable-baselines because does not depends on tf?