From a5c3418765e371cdfd831711110339dad58c8fa6 Mon Sep 17 00:00:00 2001 From: Antonin Raffin Date: Thu, 19 Dec 2019 15:28:36 +0100 Subject: [PATCH] Update README (roadmap moved to github) --- README.md | 24 +++--------------------- 1 file changed, 3 insertions(+), 21 deletions(-) diff --git a/README.md b/README.md index 8ed3ae3..d8f5fa6 100644 --- a/README.md +++ b/README.md @@ -14,27 +14,9 @@ PyTorch version of [Stable Baselines](https://github.com/hill-a/stable-baselines - SAC - TD3 +- SDE support for A2C, PPO, SAC and TD3. + ## Roadmap -TODO: -- better predict -- complete logger -- Refactor: buffer with numpy array instead of pytorch -- Refactor: remove duplicated code for evaluation -- double check the shape of log prob -- try squashing both mean and output when using SAC + SDE -- plotting? -> zoo - -Later: -- get_parameters / set_parameters -- SDE: use [affine transform](https://www.tensorflow.org/probability/api_docs/python/tfp/bijectors/Affine) - to scale the noise after a tanh transform? -- Use MultivariateNormal with full covariance matrix? -- CNN policies + normalization -- tensorboard support -- DQN -- TRPO -- ACER -- DDPG -- HER -> use stable-baselines because does not depends on tf? +- cf github Roadmap