PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
Find a file
2019-12-19 15:28:36 +01:00
docs undo changes to conf.py 2019-11-21 14:52:29 +01:00
scripts Bug fixes + add evaluate script 2019-09-06 10:44:55 +02:00
tests Test for differential entropy 2019-12-18 13:45:56 +01:00
torchy_baselines Use HardTanh to relax the constrain 2019-12-19 11:59:00 +01:00
.coveragerc Bug fixes + add evaluate script 2019-09-06 10:44:55 +02:00
.gitignore Refactor: CEM-RL closer to TD3 implementation 2019-09-09 13:43:46 +02:00
LICENSE Init: TD3 2019-09-05 17:29:41 +02:00
README.md Update README (roadmap moved to github) 2019-12-19 15:28:36 +01:00
setup.cfg Add flexible mlp 2019-10-17 13:32:25 +02:00
setup.py Bump version 2019-12-05 16:44:27 +01:00

Build Status Documentation Status

Torchy Baselines

PyTorch version of Stable Baselines, a set of improved implementations of reinforcement learning algorithms.

Implemented Algorithms

  • A2C

  • CEM-RL (with TD3)

  • PPO

  • SAC

  • TD3

  • SDE support for A2C, PPO, SAC and TD3.

Roadmap

  • cf github Roadmap