saymrwulf/stable-baselines3

mirror of https://github.com/saymrwulf/stable-baselines3.git synced 2026-05-18 21:30:19 +00:00

Antonin Raffin 322399e8fe Update collect rollout

2019-09-25 13:20:06 +02:00

898 B

Raw Blame History

Torchy Baselines

PyTorch version of Stable Baselines, a set of improved implementations of reinforcement learning algorithms.

TODO:

save/load
predict
better rescale (min + action * range)
documentation
flexible mlp
logger
better monitor wrapper?
automatic choice for action distribution

Later:

get_parameters / set_parameters
CNN policies + normalization
tensorboard support
DQN
TRPO
A2C
ACER
HER -> use stable-baselines because does not depends on tf?