Antonin Raffin
|
dbaa5daca6
|
Add logger and Monitor wrapper
|
2019-10-10 13:41:54 +02:00 |
|
Antonin Raffin
|
ef50bb81e8
|
Add support for categorical distribution
|
2019-10-08 13:06:38 +02:00 |
|
Antonin Raffin
|
4d0c033bf2
|
Bug fix when randomly sampling actions
|
2019-10-07 16:36:48 +02:00 |
|
Antonin Raffin
|
37ab9d10f1
|
Rescale actions and add action noise
|
2019-10-07 16:26:03 +02:00 |
|
Antonin RAFFIN
|
12f854e1aa
|
Fix learning starts
|
2019-10-01 21:56:37 +02:00 |
|
Antonin Raffin
|
440166fe26
|
Add a parameter to disable ortho init
|
2019-09-26 16:29:47 +02:00 |
|
Antonin Raffin
|
b4dc9d4e4d
|
Add doc
|
2019-09-26 11:46:40 +02:00 |
|
Antonin Raffin
|
70e5de1d1b
|
Update SAC defaults
|
2019-09-25 17:07:54 +02:00 |
|
Antonin Raffin
|
0e4fc9c0ac
|
Bug fix SAC
|
2019-09-25 13:30:08 +02:00 |
|
Antonin Raffin
|
322399e8fe
|
Update collect rollout
|
2019-09-25 13:20:06 +02:00 |
|
Antonin Raffin
|
6bfbb7198a
|
Rename seed
|
2019-09-24 16:59:47 +02:00 |
|
Antonin Raffin
|
32648d9029
|
Add docstrings
|
2019-09-24 15:30:58 +02:00 |
|
Antonin Raffin
|
f4fe1362f0
|
Renaming
|
2019-09-24 14:53:03 +02:00 |
|
Antonin Raffin
|
d22caac616
|
Working SAC
|
2019-09-24 14:15:12 +02:00 |
|
Antonin RAFFIN
|
98e9560913
|
Remove note
|
2019-09-22 21:33:35 +02:00 |
|
Antonin RAFFIN
|
1bd2e42995
|
Add a note for squashed gaussian
|
2019-09-22 14:15:22 +02:00 |
|
Antonin RAFFIN
|
b157b4465a
|
Add logo
|
2019-09-22 13:57:18 +02:00 |
|
Antonin RAFFIN
|
7627a8644c
|
Add roadmap
|
2019-09-22 13:43:01 +02:00 |
|
Antonin RAFFIN
|
8adb8f9931
|
Change default dist to gaussian
|
2019-09-22 12:56:27 +02:00 |
|
Antonin RAFFIN
|
ddaafcbc36
|
Refactor: add distributions
|
2019-09-22 12:52:49 +02:00 |
|
Antonin RAFFIN
|
70e1d673a9
|
Separate policy and value net
|
2019-09-21 18:12:06 +02:00 |
|
Antonin RAFFIN
|
2469ff3859
|
Reformat
|
2019-09-21 17:17:09 +02:00 |
|
Antonin RAFFIN
|
3ececcd3a9
|
Add tensorboard example
|
2019-09-21 17:09:26 +02:00 |
|
Antonin RAFFIN
|
e8ddd1f901
|
Improve initialization
|
2019-09-21 16:48:51 +02:00 |
|
Antonin RAFFIN
|
dfe1ab9690
|
Revert buffer update
|
2019-09-21 16:03:22 +02:00 |
|
Antonin RAFFIN
|
a196306d9e
|
Update replay buffer
|
2019-09-21 15:54:26 +02:00 |
|
Antonin RAFFIN
|
bcdd99d22c
|
Fix deterministic run
|
2019-09-21 15:53:28 +02:00 |
|
Antonin Raffin
|
a9b8276efb
|
Attempt to fix loss of perf because of VecEnvs
|
2019-09-20 18:06:08 +02:00 |
|
Antonin Raffin
|
0e727a5f72
|
Full compat for VecEnv + bug fixes for cuda
|
2019-09-20 16:43:19 +02:00 |
|
Antonin Raffin
|
255ff10bff
|
PPO VecEnv compat
|
2019-09-20 15:19:04 +02:00 |
|
Antonin Raffin
|
56053bc692
|
Add stable-baselines VecEnvs
|
2019-09-20 15:18:25 +02:00 |
|
Antonin RAFFIN
|
cc4380eccd
|
Add eval env and clip vf
|
2019-09-19 17:18:41 +02:00 |
|
Antonin RAFFIN
|
fe8b415cbf
|
First sign of life
|
2019-09-19 16:21:28 +02:00 |
|
Antonin RAFFIN
|
ad089f5b19
|
Add explained variance
|
2019-09-19 11:43:27 +02:00 |
|
Antonin RAFFIN
|
26f0c8d8e5
|
Refactor buffer
|
2019-09-19 11:43:15 +02:00 |
|
Antonin RAFFIN
|
149148d4c7
|
Bug fix actor forward
|
2019-09-18 23:55:41 +02:00 |
|
Antonin RAFFIN
|
525fe43552
|
Bug fix rollout buffer
|
2019-09-18 23:48:47 +02:00 |
|
Antonin RAFFIN
|
e1c1d5c4ab
|
Bug fixes (not working yet)
|
2019-09-18 22:12:32 +02:00 |
|
Antonin RAFFIN
|
6bb7e183d2
|
Running PPO (not working yet)
|
2019-09-18 15:35:17 +02:00 |
|
Antonin Raffin
|
54dd7ea60d
|
Start PPO
|
2019-09-18 13:10:27 +02:00 |
|
Antonin Raffin
|
2a660e9a41
|
Update closer to original implementation for CEMRL
|
2019-09-12 15:38:15 +02:00 |
|
Antonin Raffin
|
f04754afec
|
Refactor for collecting rollout
|
2019-09-12 14:00:55 +02:00 |
|
Antonin Raffin
|
5e3a84d551
|
Refactor policies
|
2019-09-12 11:19:06 +02:00 |
|
Antonin Raffin
|
c3c87f8311
|
Add update styles for CEM-RL
|
2019-09-10 13:07:15 +02:00 |
|
Antonin Raffin
|
d22b66fc10
|
Fixes for CUDA support
|
2019-09-09 16:45:55 +02:00 |
|
Antonin Raffin
|
d333abe963
|
Remove stdout flush
|
2019-09-09 13:51:18 +02:00 |
|
Antonin Raffin
|
12431b0e92
|
Refactor: CEM-RL closer to TD3 implementation
|
2019-09-09 13:43:46 +02:00 |
|
Antonin Raffin
|
6cce61d183
|
Fix random exploration
|
2019-09-06 14:11:27 +02:00 |
|
Antonin Raffin
|
5e38080937
|
Code cleanup
|
2019-09-06 14:04:40 +02:00 |
|
Antonin Raffin
|
d4e2dc8a9c
|
Add CEM-RL
|
2019-09-06 14:01:10 +02:00 |
|