Commit graph

30 commits

Author SHA1 Message Date
Antonin Raffin
ad32aa60f3 Add sde scheduler 2019-11-18 16:03:08 +01:00
Antonin Raffin
d8a7556d84 Merge branch 'feat/sde' into feat/offpolicy-sde 2019-11-18 15:14:05 +01:00
Antonin Raffin
b9c20d443d Update doc + add test for tanh bijector 2019-11-18 15:04:07 +01:00
Antonin Raffin
5d353d598c Start cleanup + update docstrings 2019-11-18 14:09:31 +01:00
Antonin Raffin
fb64072859 Update sde test 2019-11-15 11:07:49 +01:00
Antonin Raffin
cdb62a93fe Bug fix for off-policy normalization
Now working properly
2019-11-15 11:00:31 +01:00
Antonin Raffin
5278a6f3f8 Testing off policy normalization 2019-11-14 14:35:00 +01:00
Antonin Raffin
db87e0d36a Quick and dirty SDE version for TD3 2019-11-07 17:31:52 +01:00
Antonin Raffin
72a6f18e43 Add sde test + fix random seed 2019-10-31 14:14:30 +01:00
Antonin Raffin
42d50ed09b Add expln 2019-10-29 15:15:54 +01:00
Antonin Raffin
c15b4bda1e Add first draft of SDE 2019-10-28 18:24:13 +01:00
Antonin Raffin
0ad743c85d Add A2C 2019-10-25 10:59:15 +02:00
Antonin RAFFIN
53898f3d1a Add flexible mlp 2019-10-17 13:32:25 +02:00
Antonin Raffin
ef50bb81e8 Add support for categorical distribution 2019-10-08 13:06:38 +02:00
Antonin Raffin
37ab9d10f1 Rescale actions and add action noise 2019-10-07 16:26:03 +02:00
Antonin Raffin
32648d9029 Add docstrings 2019-09-24 15:30:58 +02:00
Antonin Raffin
d22caac616 Working SAC 2019-09-24 14:15:12 +02:00
Antonin RAFFIN
2469ff3859 Reformat 2019-09-21 17:17:09 +02:00
Antonin Raffin
a9b8276efb Attempt to fix loss of perf because of VecEnvs 2019-09-20 18:06:08 +02:00
Antonin Raffin
0e727a5f72 Full compat for VecEnv + bug fixes for cuda 2019-09-20 16:43:19 +02:00
Antonin Raffin
56053bc692 Add stable-baselines VecEnvs 2019-09-20 15:18:25 +02:00
Antonin RAFFIN
fe8b415cbf First sign of life 2019-09-19 16:21:28 +02:00
Antonin RAFFIN
e1c1d5c4ab Bug fixes (not working yet) 2019-09-18 22:12:32 +02:00
Antonin RAFFIN
6bb7e183d2 Running PPO (not working yet) 2019-09-18 15:35:17 +02:00
Antonin Raffin
5e3a84d551 Refactor policies 2019-09-12 11:19:06 +02:00
Antonin Raffin
5e38080937 Code cleanup 2019-09-06 14:04:40 +02:00
Antonin Raffin
d4e2dc8a9c Add CEM-RL 2019-09-06 14:01:10 +02:00
Antonin Raffin
90882ee846 Fixes for python 2 + env from string 2019-09-06 11:46:25 +02:00
Antonin Raffin
9cf289b997 Bug fixes + add evaluate script 2019-09-06 10:44:55 +02:00
Antonin Raffin
46d8d9725b Init: TD3 2019-09-05 17:29:41 +02:00