Commit graph

311 commits

Author SHA1 Message Date
Raffin, Antonin
fc7b1e80f1 Merge pull request #63 from Antonin-Raffin/feat/preprocessing
Refactor preprocessing
2020-03-25 17:00:12 +01:00
Antonin RAFFIN
fa599c65a6 Add support for Discrete observation spaces 2020-03-25 16:42:05 +01:00
Antonin RAFFIN
52d2cd6a1b Remove gym warnings 2020-03-25 15:54:58 +01:00
Antonin RAFFIN
72a88a8d92 Fix type hint for activation fn 2020-03-24 10:10:37 +01:00
Antonin RAFFIN
ba18258af6 Add proper preprocessing 2020-03-23 17:15:30 +01:00
Antonin RAFFIN
4b2092f55a Remove base network 2020-03-23 15:31:14 +01:00
Antonin RAFFIN
dcb54b5301 Remove CEMRL 2020-03-23 14:48:38 +01:00
Antonin RAFFIN
b96a081e5f Add check for image space 2020-03-20 11:20:57 +01:00
Antonin RAFFIN
57b37513b6 Refactor handling of obs and action space
+ remove duplicated code
2020-03-20 10:09:09 +01:00
Antonin RAFFIN
7251b9d2c2 Release v0.3.0 2020-03-19 11:11:36 +01:00
Raffin, Antonin
326b7a31c7 Merge pull request #61 from Antonin-Raffin/feat/more-tests
More tests and sync SB
2020-03-19 11:08:27 +01:00
Antonin RAFFIN
fd9e73cfb8 Fix entropy computation 2020-03-19 10:19:48 +01:00
Antonin RAFFIN
9485b90a41 Sync predict with SB and add version file 2020-03-18 15:11:19 +01:00
Raffin, Antonin
7f7b288dce Merge pull request #59 from Antonin-Raffin/feat/logger
Improve logging
2020-03-16 14:57:36 +01:00
Antonin RAFFIN
b37c23c149 Bump version and fix 2020-03-16 14:05:21 +01:00
Antonin RAFFIN
c3187604bc Code cleanup: rename lr to lr_schedule + typing 2020-03-16 14:01:32 +01:00
Antonin RAFFIN
a67bb75438 Type td3 policies 2020-03-16 13:31:06 +01:00
Antonin RAFFIN
d4ddb3d021 Update plotter 2020-03-16 12:04:57 +01:00
Antonin Raffin
cf89cac3e9 Update a2c logging 2020-03-13 11:48:16 +01:00
Antonin Raffin
29d7018265 Add better logging for SAC and PPO 2020-03-13 11:43:12 +01:00
Antonin Raffin
c39421fa64 Fix colors in results plotter 2020-03-13 10:59:16 +01:00
Raffin, Antonin
bfbe96c167 Merge pull request #57 from Antonin-Raffin/misc/improvements
Misc improvements
2020-03-12 15:42:05 +01:00
Antonin Raffin
70e601c03c Improve code and bump version 2020-03-12 15:34:35 +01:00
Antonin Raffin
765d8fc5b2 Fix event callback 2020-03-12 13:24:11 +01:00
Antonin Raffin
b64873ffff Sync callbacks 2020-03-12 12:34:25 +01:00
Antonin Raffin
18f38f8cf5 Reformat 2020-03-12 11:12:10 +01:00
Antonin Raffin
037986a91d Add test for expln 2020-03-11 16:35:13 +01:00
Antonin Raffin
c5e5812894 Finish typing A2C and PPO 2020-03-11 13:01:42 +01:00
Antonin Raffin
90d1558534 Type and reorder arguments 2020-03-11 12:45:21 +01:00
Antonin Raffin
7e3736ed56 Type A2C and PPO init 2020-03-10 18:17:47 +01:00
Antonin Raffin
35d0d2b320 More typing 2020-03-10 18:09:45 +01:00
Antonin Raffin
6ebad92e1b Remove default seed and bump dependencies 2020-03-10 17:43:54 +01:00
Antonin Raffin
80fb62e22d Bump version 2020-03-10 17:10:15 +01:00
Antonin Raffin
f159a4a9f2 Bug fix for A2C 2020-03-10 17:08:39 +01:00
Antonin Raffin
20ee8cb68d Update changelog and add more namedtuples 2020-03-10 16:55:13 +01:00
Antonin Raffin
fb4e66213d Use NamedTuple for buffers 2020-03-10 16:43:10 +01:00
Antonin Raffin
1e81f38d66 Update changelog 2020-03-09 19:05:22 +01:00
Antonin Raffin
67894dab9f Add clip_mean parameter 2020-03-09 19:02:40 +01:00
Antonin Raffin
26ccf499b3 Use normal sampling for SAC 2020-02-21 14:50:28 +01:00
Antonin Raffin
809a3d3d38 Release 0.2.0 2020-02-14 14:39:24 +01:00
Raffin, Antonin
f8e39953a6 Merge pull request #52 from Antonin-Raffin/refactor/predict
Refactor predict method
2020-02-14 14:34:44 +01:00
Antonin Raffin
af46aa19d1 Add copyright notice 2020-02-14 14:33:41 +01:00
Antonin Raffin
4392759057 Comment unused code 2020-02-14 14:15:55 +01:00
Antonin Raffin
e31b139c47 Add test for predict method 2020-02-14 14:03:41 +01:00
Antonin Raffin
8b559d71ab Remove deprecated monitor format and improve tests 2020-02-14 13:42:16 +01:00
Antonin Raffin
a2b1bf06d3 Add squash_output attribute to policy 2020-02-14 11:12:07 +01:00
Antonin Raffin
aa8b4eb22a Reformat and type the distributions 2020-02-13 13:46:22 +01:00
Antonin Raffin
f1a4fa2d3f Improve predict method 2020-02-12 15:25:05 +01:00
Antonin Raffin
9caea35a11 Add results plotter 2020-02-12 14:31:15 +01:00
Antonin Raffin
7bafdb3a67 Add get_vec_normalize_env() 2020-02-12 11:34:29 +01:00