stable-baselines3/README.md
2019-09-25 13:20:06 +02:00

898 B

Build Status Documentation Status

Torchy Baselines

PyTorch version of Stable Baselines, a set of improved implementations of reinforcement learning algorithms.

TODO:

  • save/load
  • predict
  • better rescale (min + action * range)
  • documentation
  • flexible mlp
  • logger
  • better monitor wrapper?
  • automatic choice for action distribution

Later:

  • get_parameters / set_parameters
  • CNN policies + normalization
  • tensorboard support
  • DQN
  • TRPO
  • A2C
  • ACER
  • HER -> use stable-baselines because does not depends on tf?