Antonin Raffin
|
d8a7556d84
|
Merge branch 'feat/sde' into feat/offpolicy-sde
|
2019-11-18 15:14:05 +01:00 |
|
Antonin Raffin
|
ef59a7e431
|
Update version + add docstring
|
2019-11-18 15:11:19 +01:00 |
|
Antonin Raffin
|
cdb62a93fe
|
Bug fix for off-policy normalization
Now working properly
|
2019-11-15 11:00:31 +01:00 |
|
Antonin Raffin
|
5278a6f3f8
|
Testing off policy normalization
|
2019-11-14 14:35:00 +01:00 |
|
Antonin Raffin
|
a08382faab
|
Add sde update for TD3
|
2019-11-12 18:37:13 +01:00 |
|
Antonin Raffin
|
715865a0fe
|
Add noise clipping
|
2019-11-08 13:17:38 +01:00 |
|
Antonin Raffin
|
f4546837c3
|
Add std to logger
|
2019-11-07 17:41:28 +01:00 |
|
Antonin Raffin
|
db87e0d36a
|
Quick and dirty SDE version for TD3
|
2019-11-07 17:31:52 +01:00 |
|
Antonin Raffin
|
72a6f18e43
|
Add sde test + fix random seed
|
2019-10-31 14:14:30 +01:00 |
|
Antonin Raffin
|
df1e7aa000
|
Add docstring
|
2019-10-28 17:42:39 +01:00 |
|
Antonin Raffin
|
d67822718c
|
Add learning rate schedule
|
2019-10-28 16:47:13 +01:00 |
|
Antonin RAFFIN
|
3bc746c6ee
|
Add logger for PPO
|
2019-10-17 13:44:48 +02:00 |
|
Antonin Raffin
|
b5656531d1
|
Enable logger for SAC/TD3 + refactor
|
2019-10-10 13:47:13 +02:00 |
|
Antonin Raffin
|
4d0c033bf2
|
Bug fix when randomly sampling actions
|
2019-10-07 16:36:48 +02:00 |
|
Antonin Raffin
|
37ab9d10f1
|
Rescale actions and add action noise
|
2019-10-07 16:26:03 +02:00 |
|
Antonin Raffin
|
322399e8fe
|
Update collect rollout
|
2019-09-25 13:20:06 +02:00 |
|
Antonin Raffin
|
6bfbb7198a
|
Rename seed
|
2019-09-24 16:59:47 +02:00 |
|
Antonin Raffin
|
32648d9029
|
Add docstrings
|
2019-09-24 15:30:58 +02:00 |
|
Antonin RAFFIN
|
2469ff3859
|
Reformat
|
2019-09-21 17:17:09 +02:00 |
|
Antonin RAFFIN
|
bcdd99d22c
|
Fix deterministic run
|
2019-09-21 15:53:28 +02:00 |
|
Antonin Raffin
|
0e727a5f72
|
Full compat for VecEnv + bug fixes for cuda
|
2019-09-20 16:43:19 +02:00 |
|
Antonin Raffin
|
255ff10bff
|
PPO VecEnv compat
|
2019-09-20 15:19:04 +02:00 |
|
Antonin RAFFIN
|
cc4380eccd
|
Add eval env and clip vf
|
2019-09-19 17:18:41 +02:00 |
|
Antonin Raffin
|
54dd7ea60d
|
Start PPO
|
2019-09-18 13:10:27 +02:00 |
|
Antonin Raffin
|
f04754afec
|
Refactor for collecting rollout
|
2019-09-12 14:00:55 +02:00 |
|
Antonin Raffin
|
5e3a84d551
|
Refactor policies
|
2019-09-12 11:19:06 +02:00 |
|
Antonin Raffin
|
90882ee846
|
Fixes for python 2 + env from string
|
2019-09-06 11:46:25 +02:00 |
|
Antonin Raffin
|
904742714d
|
Fixes for python 2
|
2019-09-06 11:43:02 +02:00 |
|
Antonin Raffin
|
68028c71a1
|
Seed env + fix max action
|
2019-09-06 11:09:56 +02:00 |
|
Antonin Raffin
|
9cf289b997
|
Bug fixes + add evaluate script
|
2019-09-06 10:44:55 +02:00 |
|
Antonin Raffin
|
46d8d9725b
|
Init: TD3
|
2019-09-05 17:29:41 +02:00 |
|