PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
Find a file
Antonin Raffin b4dc9d4e4d Add doc
2019-09-26 11:46:40 +02:00
docs Add doc 2019-09-26 11:46:40 +02:00
scripts Bug fixes + add evaluate script 2019-09-06 10:44:55 +02:00
tests Add docstrings 2019-09-24 15:30:58 +02:00
torchy_baselines Add doc 2019-09-26 11:46:40 +02:00
.coveragerc Bug fixes + add evaluate script 2019-09-06 10:44:55 +02:00
.gitignore Refactor: CEM-RL closer to TD3 implementation 2019-09-09 13:43:46 +02:00
LICENSE Init: TD3 2019-09-05 17:29:41 +02:00
README.md Update collect rollout 2019-09-25 13:20:06 +02:00
setup.cfg Bug fixes + add evaluate script 2019-09-06 10:44:55 +02:00
setup.py Working SAC 2019-09-24 14:15:12 +02:00

Build Status Documentation Status

Torchy Baselines

PyTorch version of Stable Baselines, a set of improved implementations of reinforcement learning algorithms.

TODO:

  • save/load
  • predict
  • better rescale (min + action * range)
  • documentation
  • flexible mlp
  • logger
  • better monitor wrapper?
  • automatic choice for action distribution

Later:

  • get_parameters / set_parameters
  • CNN policies + normalization
  • tensorboard support
  • DQN
  • TRPO
  • A2C
  • ACER
  • HER -> use stable-baselines because does not depends on tf?