Commit graph

298 commits

Author SHA1 Message Date
Raffin, Antonin
7f7b288dce Merge pull request #59 from Antonin-Raffin/feat/logger
Improve logging
2020-03-16 14:57:36 +01:00
Antonin RAFFIN
b37c23c149 Bump version and fix 2020-03-16 14:05:21 +01:00
Antonin RAFFIN
c3187604bc Code cleanup: rename lr to lr_schedule + typing 2020-03-16 14:01:32 +01:00
Antonin RAFFIN
a67bb75438 Type td3 policies 2020-03-16 13:31:06 +01:00
Antonin RAFFIN
d4ddb3d021 Update plotter 2020-03-16 12:04:57 +01:00
Antonin Raffin
cf89cac3e9 Update a2c logging 2020-03-13 11:48:16 +01:00
Antonin Raffin
29d7018265 Add better logging for SAC and PPO 2020-03-13 11:43:12 +01:00
Antonin Raffin
c39421fa64 Fix colors in results plotter 2020-03-13 10:59:16 +01:00
Raffin, Antonin
bfbe96c167 Merge pull request #57 from Antonin-Raffin/misc/improvements
Misc improvements
2020-03-12 15:42:05 +01:00
Antonin Raffin
70e601c03c Improve code and bump version 2020-03-12 15:34:35 +01:00
Antonin Raffin
765d8fc5b2 Fix event callback 2020-03-12 13:24:11 +01:00
Antonin Raffin
b64873ffff Sync callbacks 2020-03-12 12:34:25 +01:00
Antonin Raffin
18f38f8cf5 Reformat 2020-03-12 11:12:10 +01:00
Antonin Raffin
037986a91d Add test for expln 2020-03-11 16:35:13 +01:00
Antonin Raffin
c5e5812894 Finish typing A2C and PPO 2020-03-11 13:01:42 +01:00
Antonin Raffin
90d1558534 Type and reorder arguments 2020-03-11 12:45:21 +01:00
Antonin Raffin
7e3736ed56 Type A2C and PPO init 2020-03-10 18:17:47 +01:00
Antonin Raffin
35d0d2b320 More typing 2020-03-10 18:09:45 +01:00
Antonin Raffin
6ebad92e1b Remove default seed and bump dependencies 2020-03-10 17:43:54 +01:00
Antonin Raffin
80fb62e22d Bump version 2020-03-10 17:10:15 +01:00
Antonin Raffin
f159a4a9f2 Bug fix for A2C 2020-03-10 17:08:39 +01:00
Antonin Raffin
20ee8cb68d Update changelog and add more namedtuples 2020-03-10 16:55:13 +01:00
Antonin Raffin
fb4e66213d Use NamedTuple for buffers 2020-03-10 16:43:10 +01:00
Antonin Raffin
1e81f38d66 Update changelog 2020-03-09 19:05:22 +01:00
Antonin Raffin
67894dab9f Add clip_mean parameter 2020-03-09 19:02:40 +01:00
Antonin Raffin
26ccf499b3 Use normal sampling for SAC 2020-02-21 14:50:28 +01:00
Antonin Raffin
809a3d3d38 Release 0.2.0 2020-02-14 14:39:24 +01:00
Raffin, Antonin
f8e39953a6 Merge pull request #52 from Antonin-Raffin/refactor/predict
Refactor predict method
2020-02-14 14:34:44 +01:00
Antonin Raffin
af46aa19d1 Add copyright notice 2020-02-14 14:33:41 +01:00
Antonin Raffin
4392759057 Comment unused code 2020-02-14 14:15:55 +01:00
Antonin Raffin
e31b139c47 Add test for predict method 2020-02-14 14:03:41 +01:00
Antonin Raffin
8b559d71ab Remove deprecated monitor format and improve tests 2020-02-14 13:42:16 +01:00
Antonin Raffin
a2b1bf06d3 Add squash_output attribute to policy 2020-02-14 11:12:07 +01:00
Antonin Raffin
aa8b4eb22a Reformat and type the distributions 2020-02-13 13:46:22 +01:00
Antonin Raffin
f1a4fa2d3f Improve predict method 2020-02-12 15:25:05 +01:00
Antonin Raffin
9caea35a11 Add results plotter 2020-02-12 14:31:15 +01:00
Antonin Raffin
7bafdb3a67 Add get_vec_normalize_env() 2020-02-12 11:34:29 +01:00
Raffin, Antonin
cbb0843201 Merge pull request #51 from Antonin-Raffin/fix/entropy-squashed
Fix entropy loss for squashed Gaussian and VecEnv seeding
2020-02-11 17:46:56 +01:00
Antonin Raffin
240833ffef Add type aliases for buffer samples 2020-02-11 17:33:22 +01:00
Antonin Raffin
2ce31c1e21 Fix entropy loss for squashed Gaussian and VecEnv seeding 2020-02-11 17:22:03 +01:00
Raffin, Antonin
02a080f647 Merge pull request #50 from Antonin-Raffin/refactor/off-policy
Add Off Policy base class
2020-02-11 16:48:34 +01:00
Antonin Raffin
2afcf395b9 Update tests 2020-02-11 16:42:25 +01:00
Antonin Raffin
b7dcc8d58e Add extend method 2020-02-11 16:40:44 +01:00
Antonin Raffin
8eb82c86e3 Save last mean reward 2020-02-11 13:22:44 +01:00
Antonin Raffin
75a86881b3 Add save/load for replay buffer 2020-02-05 13:10:02 +01:00
Antonin Raffin
31a862c3a9 Log success rate 2020-02-04 13:24:09 +01:00
Antonin Raffin
8acac6b0f4 Update docstring 2020-02-03 18:31:13 +01:00
Antonin Raffin
16121cf2b8 Create OffPolicyRLModel 2020-02-03 18:18:41 +01:00
Raffin, Antonin
9d52a7d7d6 Merge pull request #49 from Antonin-Raffin/refactor/buffers
Refactor buffers
2020-02-03 16:03:24 +01:00
Antonin Raffin
d850a35311 Update tests 2020-02-03 15:57:37 +01:00