Commit graph

354 commits

Author SHA1 Message Date
Antonin RAFFIN
04d85ac2e2 Fix import in tests 2020-05-05 16:32:08 +02:00
Antonin RAFFIN
62bde9a970 Fix import 2020-05-05 16:30:26 +02:00
Antonin RAFFIN
2c34a4d694 Sync with Stable-Baselines 2020-05-05 16:28:38 +02:00
Antonin RAFFIN
d542732c8d Rename to stable-baselines3 2020-05-05 15:02:35 +02:00
Raffin, Antonin
4a2c247592 Merge pull request #74 from Antonin-Raffin/feat/open-source
Add type hints and f-strings to logger
2020-05-05 14:54:33 +02:00
Antonin RAFFIN
88cee2ba55 Add type hints and f-strings to logger 2020-05-05 14:49:32 +02:00
Raffin, Antonin
03bd086555 Merge pull request #72 from Antonin-Raffin/feat/cnn
Add CNN support
2020-05-05 11:51:14 +02:00
Antonin RAFFIN
5572e28e44 Update makefile 2020-04-30 11:26:28 +02:00
Antonin RAFFIN
559f43b522 Improve error messages 2020-04-27 11:12:19 +02:00
Antonin RAFFIN
52ad917a3a Fix f-string 2020-04-25 21:45:03 +02:00
Antonin RAFFIN
7ae54206ce Reformat and code cleanup 2020-04-23 15:18:21 +02:00
Antonin RAFFIN
71df3c7409 Add docstrings and missing types 2020-04-23 14:56:05 +02:00
Antonin RAFFIN
271a0a7818 Add sanity check for clip_range_vf 2020-04-23 09:59:09 +02:00
Antonin RAFFIN
02b452a9f9 Avoid double wrapping 2020-04-22 18:28:00 +02:00
Antonin RAFFIN
f38ddcb278 Allow any number of channels 2020-04-22 16:11:23 +02:00
Antonin RAFFIN
f3cb0688c4 Fix custom optimizer 2020-04-22 13:21:11 +02:00
Antonin RAFFIN
041f2bc59a Cleanup, bug fixes + more tests 2020-04-22 13:14:22 +02:00
Antonin RAFFIN
73fb8d1c63 Add CNN support for TD3 2020-04-22 11:05:46 +02:00
Antonin RAFFIN
8f4155180e Revert "Filter out features extractor weights"
This reverts commit 93f9de799add0874878a91fe9eaf8162321066b5.
2020-04-22 10:56:33 +02:00
Antonin RAFFIN
4b826f2e2d Filter out features extractor weights 2020-04-22 10:56:17 +02:00
Antonin RAFFIN
b289aca5fe Bug fixes at loading and predict time 2020-04-21 21:06:07 +02:00
Antonin RAFFIN
8aac9e819d Add VecTransposeImage and fix for SAC 2020-04-21 20:41:58 +02:00
Antonin RAFFIN
93c2a01f91 Start CNN support (failing for SAC) 2020-04-21 16:22:46 +02:00
Raffin, Antonin
6d2fc328c9 Merge pull request #71 from Antonin-Raffin/feat/full-policy-save
Independent policy save/load and bug fixes
2020-04-21 14:56:44 +02:00
Antonin RAFFIN
6bd2d87f33 Improve doc 2020-04-20 16:21:47 +02:00
Antonin RAFFIN
f347474e6a Independent save/load for policies 2020-04-20 15:59:44 +02:00
Antonin RAFFIN
864c976d4d Complete save/load for A2C/PPO policies 2020-04-20 15:50:46 +02:00
Antonin RAFFIN
a4df08cd28 Complete save/load for TD3 policy 2020-04-20 15:43:23 +02:00
Antonin RAFFIN
17f9246257 Add get_device util and fix squash_output 2020-04-20 15:43:11 +02:00
Antonin RAFFIN
aa1026ee87 Added `optimizer and optimizer_kwargs to policy_kwargs` 2020-04-17 15:13:45 +02:00
Antonin RAFFIN
0e44cdce44 Fixed `reset_num_timesteps` behavior 2020-04-17 12:36:27 +02:00
Antonin RAFFIN
08a22c4834 Release 0.4.0 2020-04-14 18:13:51 +02:00
Raffin, Antonin
cf840ed928 Merge pull request #65 from Antonin-Raffin/feat/policy-save-load
Policy save/load - Action dist refactor
2020-04-06 15:53:42 +02:00
Antonin RAFFIN
c2c5d0798f Fix: pass device for SAC 2020-04-06 15:17:30 +02:00
Antonin RAFFIN
c97dbb3173 Fix device arg that was not pass to children networks 2020-04-06 15:12:20 +02:00
Antonin RAFFIN
6e470d0f72 Fix: don't change the device when saving 2020-03-31 18:38:37 +02:00
Antonin RAFFIN
71ce9ef2f4 Add test for actor 2020-03-31 18:26:26 +02:00
Antonin RAFFIN
2c5e41ec47 Renaming (everything plural) for consistency 2020-03-31 18:13:03 +02:00
Antonin RAFFIN
5a731d04ee Fix docstring 2020-03-31 18:00:29 +02:00
Antonin RAFFIN
c264403816 Rename for consistency
+ add _predict to actors
+ improve sac actor code
2020-03-31 17:48:23 +02:00
Antonin RAFFIN
2bbf6a9462 Minor: remove comment 2020-03-31 16:40:53 +02:00
Antonin RAFFIN
fdecd512db Add save/load weights for policies and refactor action distributions 2020-03-31 16:29:13 +02:00
Antonin RAFFIN
b782f3a208 Fix for test failures 2020-03-31 10:18:56 +02:00
Raffin, Antonin
fc7b1e80f1 Merge pull request #63 from Antonin-Raffin/feat/preprocessing
Refactor preprocessing
2020-03-25 17:00:12 +01:00
Antonin RAFFIN
fa599c65a6 Add support for Discrete observation spaces 2020-03-25 16:42:05 +01:00
Antonin RAFFIN
52d2cd6a1b Remove gym warnings 2020-03-25 15:54:58 +01:00
Antonin RAFFIN
72a88a8d92 Fix type hint for activation fn 2020-03-24 10:10:37 +01:00
Antonin RAFFIN
ba18258af6 Add proper preprocessing 2020-03-23 17:15:30 +01:00
Antonin RAFFIN
4b2092f55a Remove base network 2020-03-23 15:31:14 +01:00
Antonin RAFFIN
dcb54b5301 Remove CEMRL 2020-03-23 14:48:38 +01:00