Commit graph

64 commits

Author SHA1 Message Date
Antonin RAFFIN
e6ff4bbd6c Update setup 2020-05-07 16:24:19 +02:00
Antonin RAFFIN
aa66012764 Update requirements 2020-05-07 16:21:33 +02:00
Antonin RAFFIN
c5f29a3931 Fix docs 2020-05-07 16:15:32 +02:00
Antonin RAFFIN
8046a24719 More doc + sync VecEnvs + atari 2020-05-07 16:08:23 +02:00
Antonin RAFFIN
73afaf157c Add version.txt to package 2020-05-07 12:19:29 +02:00
Antonin RAFFIN
98e05cf578 Add dockerfile + update doc 2020-05-07 11:08:49 +02:00
Antonin RAFFIN
d17f29c8ad Add base doc 2020-05-07 10:10:51 +02:00
Antonin RAFFIN
a3f9efe04a Update doc 2020-05-05 17:41:57 +02:00
Antonin RAFFIN
580317158b Update changelog 2020-05-05 17:21:56 +02:00
Antonin RAFFIN
0481fbe727 Update changelog 2020-05-05 16:54:33 +02:00
Antonin RAFFIN
2c34a4d694 Sync with Stable-Baselines 2020-05-05 16:28:38 +02:00
Antonin RAFFIN
d542732c8d Rename to stable-baselines3 2020-05-05 15:02:35 +02:00
Antonin RAFFIN
88cee2ba55 Add type hints and f-strings to logger 2020-05-05 14:49:32 +02:00
Antonin RAFFIN
041f2bc59a Cleanup, bug fixes + more tests 2020-04-22 13:14:22 +02:00
Antonin RAFFIN
8aac9e819d Add VecTransposeImage and fix for SAC 2020-04-21 20:41:58 +02:00
Antonin RAFFIN
93c2a01f91 Start CNN support (failing for SAC) 2020-04-21 16:22:46 +02:00
Antonin RAFFIN
f347474e6a Independent save/load for policies 2020-04-20 15:59:44 +02:00
Antonin RAFFIN
17f9246257 Add get_device util and fix squash_output 2020-04-20 15:43:11 +02:00
Antonin RAFFIN
aa1026ee87 Added `optimizer and optimizer_kwargs to policy_kwargs` 2020-04-17 15:13:45 +02:00
Antonin RAFFIN
0e44cdce44 Fixed `reset_num_timesteps` behavior 2020-04-17 12:36:27 +02:00
Antonin RAFFIN
08a22c4834 Release 0.4.0 2020-04-14 18:13:51 +02:00
Antonin RAFFIN
fdecd512db Add save/load weights for policies and refactor action distributions 2020-03-31 16:29:13 +02:00
Antonin RAFFIN
fa599c65a6 Add support for Discrete observation spaces 2020-03-25 16:42:05 +01:00
Antonin RAFFIN
72a88a8d92 Fix type hint for activation fn 2020-03-24 10:10:37 +01:00
Antonin RAFFIN
ba18258af6 Add proper preprocessing 2020-03-23 17:15:30 +01:00
Antonin RAFFIN
dcb54b5301 Remove CEMRL 2020-03-23 14:48:38 +01:00
Antonin RAFFIN
57b37513b6 Refactor handling of obs and action space
+ remove duplicated code
2020-03-20 10:09:09 +01:00
Antonin RAFFIN
7251b9d2c2 Release v0.3.0 2020-03-19 11:11:36 +01:00
Antonin RAFFIN
fd9e73cfb8 Fix entropy computation 2020-03-19 10:19:48 +01:00
Antonin RAFFIN
9485b90a41 Sync predict with SB and add version file 2020-03-18 15:11:19 +01:00
Antonin RAFFIN
c3187604bc Code cleanup: rename lr to lr_schedule + typing 2020-03-16 14:01:32 +01:00
Antonin Raffin
29d7018265 Add better logging for SAC and PPO 2020-03-13 11:43:12 +01:00
Antonin Raffin
c39421fa64 Fix colors in results plotter 2020-03-13 10:59:16 +01:00
Antonin Raffin
b64873ffff Sync callbacks 2020-03-12 12:34:25 +01:00
Antonin Raffin
037986a91d Add test for expln 2020-03-11 16:35:13 +01:00
Antonin Raffin
6ebad92e1b Remove default seed and bump dependencies 2020-03-10 17:43:54 +01:00
Antonin Raffin
20ee8cb68d Update changelog and add more namedtuples 2020-03-10 16:55:13 +01:00
Antonin Raffin
1e81f38d66 Update changelog 2020-03-09 19:05:22 +01:00
Antonin Raffin
26ccf499b3 Use normal sampling for SAC 2020-02-21 14:50:28 +01:00
Antonin Raffin
809a3d3d38 Release 0.2.0 2020-02-14 14:39:24 +01:00
Antonin Raffin
8b559d71ab Remove deprecated monitor format and improve tests 2020-02-14 13:42:16 +01:00
Antonin Raffin
f1a4fa2d3f Improve predict method 2020-02-12 15:25:05 +01:00
Antonin Raffin
9caea35a11 Add results plotter 2020-02-12 14:31:15 +01:00
Antonin Raffin
7bafdb3a67 Add get_vec_normalize_env() 2020-02-12 11:34:29 +01:00
Antonin Raffin
2ce31c1e21 Fix entropy loss for squashed Gaussian and VecEnv seeding 2020-02-11 17:22:03 +01:00
Antonin Raffin
b7dcc8d58e Add extend method 2020-02-11 16:40:44 +01:00
Antonin Raffin
75a86881b3 Add save/load for replay buffer 2020-02-05 13:10:02 +01:00
Antonin Raffin
d850a35311 Update tests 2020-02-03 15:57:37 +01:00
Antonin Raffin
c2318149dd Update changelog and version 2020-02-03 15:50:40 +01:00
Antonin Raffin
5d4e73544c Fix reset_num_timesteps 2020-01-31 13:16:28 +01:00