Antonin Raffin
|
ad32aa60f3
|
Add sde scheduler
|
2019-11-18 16:03:08 +01:00 |
|
Antonin Raffin
|
d8a7556d84
|
Merge branch 'feat/sde' into feat/offpolicy-sde
|
2019-11-18 15:14:05 +01:00 |
|
Antonin Raffin
|
b9c20d443d
|
Update doc + add test for tanh bijector
|
2019-11-18 15:04:07 +01:00 |
|
Antonin Raffin
|
5d353d598c
|
Start cleanup + update docstrings
|
2019-11-18 14:09:31 +01:00 |
|
Antonin Raffin
|
fb64072859
|
Update sde test
|
2019-11-15 11:07:49 +01:00 |
|
Antonin Raffin
|
cdb62a93fe
|
Bug fix for off-policy normalization
Now working properly
|
2019-11-15 11:00:31 +01:00 |
|
Antonin Raffin
|
5278a6f3f8
|
Testing off policy normalization
|
2019-11-14 14:35:00 +01:00 |
|
Antonin Raffin
|
db87e0d36a
|
Quick and dirty SDE version for TD3
|
2019-11-07 17:31:52 +01:00 |
|
Antonin Raffin
|
72a6f18e43
|
Add sde test + fix random seed
|
2019-10-31 14:14:30 +01:00 |
|
Antonin Raffin
|
42d50ed09b
|
Add expln
|
2019-10-29 15:15:54 +01:00 |
|
Antonin Raffin
|
c15b4bda1e
|
Add first draft of SDE
|
2019-10-28 18:24:13 +01:00 |
|
Antonin Raffin
|
0ad743c85d
|
Add A2C
|
2019-10-25 10:59:15 +02:00 |
|
Antonin RAFFIN
|
53898f3d1a
|
Add flexible mlp
|
2019-10-17 13:32:25 +02:00 |
|
Antonin Raffin
|
ef50bb81e8
|
Add support for categorical distribution
|
2019-10-08 13:06:38 +02:00 |
|
Antonin Raffin
|
37ab9d10f1
|
Rescale actions and add action noise
|
2019-10-07 16:26:03 +02:00 |
|
Antonin Raffin
|
32648d9029
|
Add docstrings
|
2019-09-24 15:30:58 +02:00 |
|
Antonin Raffin
|
d22caac616
|
Working SAC
|
2019-09-24 14:15:12 +02:00 |
|
Antonin RAFFIN
|
2469ff3859
|
Reformat
|
2019-09-21 17:17:09 +02:00 |
|
Antonin Raffin
|
a9b8276efb
|
Attempt to fix loss of perf because of VecEnvs
|
2019-09-20 18:06:08 +02:00 |
|
Antonin Raffin
|
0e727a5f72
|
Full compat for VecEnv + bug fixes for cuda
|
2019-09-20 16:43:19 +02:00 |
|
Antonin Raffin
|
56053bc692
|
Add stable-baselines VecEnvs
|
2019-09-20 15:18:25 +02:00 |
|
Antonin RAFFIN
|
fe8b415cbf
|
First sign of life
|
2019-09-19 16:21:28 +02:00 |
|
Antonin RAFFIN
|
e1c1d5c4ab
|
Bug fixes (not working yet)
|
2019-09-18 22:12:32 +02:00 |
|
Antonin RAFFIN
|
6bb7e183d2
|
Running PPO (not working yet)
|
2019-09-18 15:35:17 +02:00 |
|
Antonin Raffin
|
5e3a84d551
|
Refactor policies
|
2019-09-12 11:19:06 +02:00 |
|
Antonin Raffin
|
5e38080937
|
Code cleanup
|
2019-09-06 14:04:40 +02:00 |
|
Antonin Raffin
|
d4e2dc8a9c
|
Add CEM-RL
|
2019-09-06 14:01:10 +02:00 |
|
Antonin Raffin
|
90882ee846
|
Fixes for python 2 + env from string
|
2019-09-06 11:46:25 +02:00 |
|
Antonin Raffin
|
9cf289b997
|
Bug fixes + add evaluate script
|
2019-09-06 10:44:55 +02:00 |
|
Antonin Raffin
|
46d8d9725b
|
Init: TD3
|
2019-09-05 17:29:41 +02:00 |
|