Commit graph

22 commits

Author SHA1 Message Date
Antonin Raffin
df1e7aa000 Add docstring 2019-10-28 17:42:39 +01:00
Antonin Raffin
d67822718c Add learning rate schedule 2019-10-28 16:47:13 +01:00
Antonin RAFFIN
3bc746c6ee Add logger for PPO 2019-10-17 13:44:48 +02:00
Antonin Raffin
b5656531d1 Enable logger for SAC/TD3 + refactor 2019-10-10 13:47:13 +02:00
Antonin Raffin
4d0c033bf2 Bug fix when randomly sampling actions 2019-10-07 16:36:48 +02:00
Antonin Raffin
37ab9d10f1 Rescale actions and add action noise 2019-10-07 16:26:03 +02:00
Antonin Raffin
322399e8fe Update collect rollout 2019-09-25 13:20:06 +02:00
Antonin Raffin
6bfbb7198a Rename seed 2019-09-24 16:59:47 +02:00
Antonin Raffin
32648d9029 Add docstrings 2019-09-24 15:30:58 +02:00
Antonin RAFFIN
2469ff3859 Reformat 2019-09-21 17:17:09 +02:00
Antonin RAFFIN
bcdd99d22c Fix deterministic run 2019-09-21 15:53:28 +02:00
Antonin Raffin
0e727a5f72 Full compat for VecEnv + bug fixes for cuda 2019-09-20 16:43:19 +02:00
Antonin Raffin
255ff10bff PPO VecEnv compat 2019-09-20 15:19:04 +02:00
Antonin RAFFIN
cc4380eccd Add eval env and clip vf 2019-09-19 17:18:41 +02:00
Antonin Raffin
54dd7ea60d Start PPO 2019-09-18 13:10:27 +02:00
Antonin Raffin
f04754afec Refactor for collecting rollout 2019-09-12 14:00:55 +02:00
Antonin Raffin
5e3a84d551 Refactor policies 2019-09-12 11:19:06 +02:00
Antonin Raffin
90882ee846 Fixes for python 2 + env from string 2019-09-06 11:46:25 +02:00
Antonin Raffin
904742714d Fixes for python 2 2019-09-06 11:43:02 +02:00
Antonin Raffin
68028c71a1 Seed env + fix max action 2019-09-06 11:09:56 +02:00
Antonin Raffin
9cf289b997 Bug fixes + add evaluate script 2019-09-06 10:44:55 +02:00
Antonin Raffin
46d8d9725b Init: TD3 2019-09-05 17:29:41 +02:00