Raffin, Antonin
|
7f7b288dce
|
Merge pull request #59 from Antonin-Raffin/feat/logger
Improve logging
|
2020-03-16 14:57:36 +01:00 |
|
Antonin RAFFIN
|
b37c23c149
|
Bump version and fix
|
2020-03-16 14:05:21 +01:00 |
|
Antonin RAFFIN
|
c3187604bc
|
Code cleanup: rename lr to lr_schedule + typing
|
2020-03-16 14:01:32 +01:00 |
|
Antonin RAFFIN
|
a67bb75438
|
Type td3 policies
|
2020-03-16 13:31:06 +01:00 |
|
Antonin RAFFIN
|
d4ddb3d021
|
Update plotter
|
2020-03-16 12:04:57 +01:00 |
|
Antonin Raffin
|
cf89cac3e9
|
Update a2c logging
|
2020-03-13 11:48:16 +01:00 |
|
Antonin Raffin
|
29d7018265
|
Add better logging for SAC and PPO
|
2020-03-13 11:43:12 +01:00 |
|
Antonin Raffin
|
c39421fa64
|
Fix colors in results plotter
|
2020-03-13 10:59:16 +01:00 |
|
Raffin, Antonin
|
bfbe96c167
|
Merge pull request #57 from Antonin-Raffin/misc/improvements
Misc improvements
|
2020-03-12 15:42:05 +01:00 |
|
Antonin Raffin
|
70e601c03c
|
Improve code and bump version
|
2020-03-12 15:34:35 +01:00 |
|
Antonin Raffin
|
765d8fc5b2
|
Fix event callback
|
2020-03-12 13:24:11 +01:00 |
|
Antonin Raffin
|
b64873ffff
|
Sync callbacks
|
2020-03-12 12:34:25 +01:00 |
|
Antonin Raffin
|
18f38f8cf5
|
Reformat
|
2020-03-12 11:12:10 +01:00 |
|
Antonin Raffin
|
037986a91d
|
Add test for expln
|
2020-03-11 16:35:13 +01:00 |
|
Antonin Raffin
|
c5e5812894
|
Finish typing A2C and PPO
|
2020-03-11 13:01:42 +01:00 |
|
Antonin Raffin
|
90d1558534
|
Type and reorder arguments
|
2020-03-11 12:45:21 +01:00 |
|
Antonin Raffin
|
7e3736ed56
|
Type A2C and PPO init
|
2020-03-10 18:17:47 +01:00 |
|
Antonin Raffin
|
35d0d2b320
|
More typing
|
2020-03-10 18:09:45 +01:00 |
|
Antonin Raffin
|
6ebad92e1b
|
Remove default seed and bump dependencies
|
2020-03-10 17:43:54 +01:00 |
|
Antonin Raffin
|
80fb62e22d
|
Bump version
|
2020-03-10 17:10:15 +01:00 |
|
Antonin Raffin
|
f159a4a9f2
|
Bug fix for A2C
|
2020-03-10 17:08:39 +01:00 |
|
Antonin Raffin
|
20ee8cb68d
|
Update changelog and add more namedtuples
|
2020-03-10 16:55:13 +01:00 |
|
Antonin Raffin
|
fb4e66213d
|
Use NamedTuple for buffers
|
2020-03-10 16:43:10 +01:00 |
|
Antonin Raffin
|
1e81f38d66
|
Update changelog
|
2020-03-09 19:05:22 +01:00 |
|
Antonin Raffin
|
67894dab9f
|
Add clip_mean parameter
|
2020-03-09 19:02:40 +01:00 |
|
Antonin Raffin
|
26ccf499b3
|
Use normal sampling for SAC
|
2020-02-21 14:50:28 +01:00 |
|
Antonin Raffin
|
809a3d3d38
|
Release 0.2.0
|
2020-02-14 14:39:24 +01:00 |
|
Raffin, Antonin
|
f8e39953a6
|
Merge pull request #52 from Antonin-Raffin/refactor/predict
Refactor predict method
|
2020-02-14 14:34:44 +01:00 |
|
Antonin Raffin
|
af46aa19d1
|
Add copyright notice
|
2020-02-14 14:33:41 +01:00 |
|
Antonin Raffin
|
4392759057
|
Comment unused code
|
2020-02-14 14:15:55 +01:00 |
|
Antonin Raffin
|
e31b139c47
|
Add test for predict method
|
2020-02-14 14:03:41 +01:00 |
|
Antonin Raffin
|
8b559d71ab
|
Remove deprecated monitor format and improve tests
|
2020-02-14 13:42:16 +01:00 |
|
Antonin Raffin
|
a2b1bf06d3
|
Add squash_output attribute to policy
|
2020-02-14 11:12:07 +01:00 |
|
Antonin Raffin
|
aa8b4eb22a
|
Reformat and type the distributions
|
2020-02-13 13:46:22 +01:00 |
|
Antonin Raffin
|
f1a4fa2d3f
|
Improve predict method
|
2020-02-12 15:25:05 +01:00 |
|
Antonin Raffin
|
9caea35a11
|
Add results plotter
|
2020-02-12 14:31:15 +01:00 |
|
Antonin Raffin
|
7bafdb3a67
|
Add get_vec_normalize_env()
|
2020-02-12 11:34:29 +01:00 |
|
Raffin, Antonin
|
cbb0843201
|
Merge pull request #51 from Antonin-Raffin/fix/entropy-squashed
Fix entropy loss for squashed Gaussian and VecEnv seeding
|
2020-02-11 17:46:56 +01:00 |
|
Antonin Raffin
|
240833ffef
|
Add type aliases for buffer samples
|
2020-02-11 17:33:22 +01:00 |
|
Antonin Raffin
|
2ce31c1e21
|
Fix entropy loss for squashed Gaussian and VecEnv seeding
|
2020-02-11 17:22:03 +01:00 |
|
Raffin, Antonin
|
02a080f647
|
Merge pull request #50 from Antonin-Raffin/refactor/off-policy
Add Off Policy base class
|
2020-02-11 16:48:34 +01:00 |
|
Antonin Raffin
|
2afcf395b9
|
Update tests
|
2020-02-11 16:42:25 +01:00 |
|
Antonin Raffin
|
b7dcc8d58e
|
Add extend method
|
2020-02-11 16:40:44 +01:00 |
|
Antonin Raffin
|
8eb82c86e3
|
Save last mean reward
|
2020-02-11 13:22:44 +01:00 |
|
Antonin Raffin
|
75a86881b3
|
Add save/load for replay buffer
|
2020-02-05 13:10:02 +01:00 |
|
Antonin Raffin
|
31a862c3a9
|
Log success rate
|
2020-02-04 13:24:09 +01:00 |
|
Antonin Raffin
|
8acac6b0f4
|
Update docstring
|
2020-02-03 18:31:13 +01:00 |
|
Antonin Raffin
|
16121cf2b8
|
Create OffPolicyRLModel
|
2020-02-03 18:18:41 +01:00 |
|
Raffin, Antonin
|
9d52a7d7d6
|
Merge pull request #49 from Antonin-Raffin/refactor/buffers
Refactor buffers
|
2020-02-03 16:03:24 +01:00 |
|
Antonin Raffin
|
d850a35311
|
Update tests
|
2020-02-03 15:57:37 +01:00 |
|