Antonin Raffin
|
9345b85cfc
|
Update changelog and README
|
2020-01-22 17:23:42 +01:00 |
|
Antonin Raffin
|
9e250b6818
|
Build doc
|
2020-01-20 16:19:35 +01:00 |
|
Antonin Raffin
|
a5c3418765
|
Update README (roadmap moved to github)
|
2019-12-19 15:28:36 +01:00 |
|
Antonin Raffin
|
d63cef7693
|
Add gradient clipping for SAC
|
2019-12-06 18:32:57 +01:00 |
|
Antonin Raffin
|
233f346d53
|
Update todos
|
2019-12-06 17:46:56 +01:00 |
|
Antonin Raffin
|
fe67a98711
|
Log more values
|
2019-11-26 17:44:06 +01:00 |
|
Antonin Raffin
|
d0003ee4ec
|
Enable kwargs for proba dist
|
2019-11-25 14:00:21 +01:00 |
|
Antonin Raffin
|
604a19fbc3
|
Cleanup + update doc
|
2019-11-22 13:33:12 +01:00 |
|
Antonin Raffin
|
cdb62a93fe
|
Bug fix for off-policy normalization
Now working properly
|
2019-11-15 11:00:31 +01:00 |
|
Antonin Raffin
|
5278a6f3f8
|
Testing off policy normalization
|
2019-11-14 14:35:00 +01:00 |
|
Antonin Raffin
|
da325a0ba7
|
Solve NaN issue and reduce number of parameters
|
2019-11-13 13:02:37 +01:00 |
|
Antonin Raffin
|
0ad743c85d
|
Add A2C
|
2019-10-25 10:59:15 +02:00 |
|
Antonin Raffin
|
ef50bb81e8
|
Add support for categorical distribution
|
2019-10-08 13:06:38 +02:00 |
|
Antonin Raffin
|
4d0c033bf2
|
Bug fix when randomly sampling actions
|
2019-10-07 16:36:48 +02:00 |
|
Antonin Raffin
|
440166fe26
|
Add a parameter to disable ortho init
|
2019-09-26 16:29:47 +02:00 |
|
Antonin Raffin
|
322399e8fe
|
Update collect rollout
|
2019-09-25 13:20:06 +02:00 |
|
Antonin Raffin
|
d22caac616
|
Working SAC
|
2019-09-24 14:15:12 +02:00 |
|
Antonin RAFFIN
|
b157b4465a
|
Add logo
|
2019-09-22 13:57:18 +02:00 |
|
Antonin RAFFIN
|
7627a8644c
|
Add roadmap
|
2019-09-22 13:43:01 +02:00 |
|
Antonin Raffin
|
46d8d9725b
|
Init: TD3
|
2019-09-05 17:29:41 +02:00 |
|
Raffin, Antonin
|
ad6076bb7a
|
Initial commit
|
2019-09-05 13:26:14 +02:00 |
|