Antonin RAFFIN
|
041f2bc59a
|
Cleanup, bug fixes + more tests
|
2020-04-22 13:14:22 +02:00 |
|
Antonin RAFFIN
|
73fb8d1c63
|
Add CNN support for TD3
|
2020-04-22 11:05:46 +02:00 |
|
Antonin RAFFIN
|
8f4155180e
|
Revert "Filter out features extractor weights"
This reverts commit 93f9de799add0874878a91fe9eaf8162321066b5.
|
2020-04-22 10:56:33 +02:00 |
|
Antonin RAFFIN
|
4b826f2e2d
|
Filter out features extractor weights
|
2020-04-22 10:56:17 +02:00 |
|
Antonin RAFFIN
|
b289aca5fe
|
Bug fixes at loading and predict time
|
2020-04-21 21:06:07 +02:00 |
|
Antonin RAFFIN
|
8aac9e819d
|
Add VecTransposeImage and fix for SAC
|
2020-04-21 20:41:58 +02:00 |
|
Antonin RAFFIN
|
93c2a01f91
|
Start CNN support (failing for SAC)
|
2020-04-21 16:22:46 +02:00 |
|
Raffin, Antonin
|
6d2fc328c9
|
Merge pull request #71 from Antonin-Raffin/feat/full-policy-save
Independent policy save/load and bug fixes
|
2020-04-21 14:56:44 +02:00 |
|
Antonin RAFFIN
|
6bd2d87f33
|
Improve doc
|
2020-04-20 16:21:47 +02:00 |
|
Antonin RAFFIN
|
f347474e6a
|
Independent save/load for policies
|
2020-04-20 15:59:44 +02:00 |
|
Antonin RAFFIN
|
864c976d4d
|
Complete save/load for A2C/PPO policies
|
2020-04-20 15:50:46 +02:00 |
|
Antonin RAFFIN
|
a4df08cd28
|
Complete save/load for TD3 policy
|
2020-04-20 15:43:23 +02:00 |
|
Antonin RAFFIN
|
17f9246257
|
Add get_device util and fix squash_output
|
2020-04-20 15:43:11 +02:00 |
|
Antonin RAFFIN
|
aa1026ee87
|
Added `optimizer and optimizer_kwargs to policy_kwargs`
|
2020-04-17 15:13:45 +02:00 |
|
Antonin RAFFIN
|
0e44cdce44
|
Fixed `reset_num_timesteps` behavior
|
2020-04-17 12:36:27 +02:00 |
|
Antonin RAFFIN
|
08a22c4834
|
Release 0.4.0
|
2020-04-14 18:13:51 +02:00 |
|
Raffin, Antonin
|
cf840ed928
|
Merge pull request #65 from Antonin-Raffin/feat/policy-save-load
Policy save/load - Action dist refactor
|
2020-04-06 15:53:42 +02:00 |
|
Antonin RAFFIN
|
c2c5d0798f
|
Fix: pass device for SAC
|
2020-04-06 15:17:30 +02:00 |
|
Antonin RAFFIN
|
c97dbb3173
|
Fix device arg that was not pass to children networks
|
2020-04-06 15:12:20 +02:00 |
|
Antonin RAFFIN
|
6e470d0f72
|
Fix: don't change the device when saving
|
2020-03-31 18:38:37 +02:00 |
|
Antonin RAFFIN
|
71ce9ef2f4
|
Add test for actor
|
2020-03-31 18:26:26 +02:00 |
|
Antonin RAFFIN
|
2c5e41ec47
|
Renaming (everything plural) for consistency
|
2020-03-31 18:13:03 +02:00 |
|
Antonin RAFFIN
|
5a731d04ee
|
Fix docstring
|
2020-03-31 18:00:29 +02:00 |
|
Antonin RAFFIN
|
c264403816
|
Rename for consistency
+ add _predict to actors
+ improve sac actor code
|
2020-03-31 17:48:23 +02:00 |
|
Antonin RAFFIN
|
2bbf6a9462
|
Minor: remove comment
|
2020-03-31 16:40:53 +02:00 |
|
Antonin RAFFIN
|
fdecd512db
|
Add save/load weights for policies and refactor action distributions
|
2020-03-31 16:29:13 +02:00 |
|
Antonin RAFFIN
|
b782f3a208
|
Fix for test failures
|
2020-03-31 10:18:56 +02:00 |
|
Raffin, Antonin
|
fc7b1e80f1
|
Merge pull request #63 from Antonin-Raffin/feat/preprocessing
Refactor preprocessing
|
2020-03-25 17:00:12 +01:00 |
|
Antonin RAFFIN
|
fa599c65a6
|
Add support for Discrete observation spaces
|
2020-03-25 16:42:05 +01:00 |
|
Antonin RAFFIN
|
52d2cd6a1b
|
Remove gym warnings
|
2020-03-25 15:54:58 +01:00 |
|
Antonin RAFFIN
|
72a88a8d92
|
Fix type hint for activation fn
|
2020-03-24 10:10:37 +01:00 |
|
Antonin RAFFIN
|
ba18258af6
|
Add proper preprocessing
|
2020-03-23 17:15:30 +01:00 |
|
Antonin RAFFIN
|
4b2092f55a
|
Remove base network
|
2020-03-23 15:31:14 +01:00 |
|
Antonin RAFFIN
|
dcb54b5301
|
Remove CEMRL
|
2020-03-23 14:48:38 +01:00 |
|
Antonin RAFFIN
|
b96a081e5f
|
Add check for image space
|
2020-03-20 11:20:57 +01:00 |
|
Antonin RAFFIN
|
57b37513b6
|
Refactor handling of obs and action space
+ remove duplicated code
|
2020-03-20 10:09:09 +01:00 |
|
Antonin RAFFIN
|
7251b9d2c2
|
Release v0.3.0
|
2020-03-19 11:11:36 +01:00 |
|
Raffin, Antonin
|
326b7a31c7
|
Merge pull request #61 from Antonin-Raffin/feat/more-tests
More tests and sync SB
|
2020-03-19 11:08:27 +01:00 |
|
Antonin RAFFIN
|
fd9e73cfb8
|
Fix entropy computation
|
2020-03-19 10:19:48 +01:00 |
|
Antonin RAFFIN
|
9485b90a41
|
Sync predict with SB and add version file
|
2020-03-18 15:11:19 +01:00 |
|
Raffin, Antonin
|
7f7b288dce
|
Merge pull request #59 from Antonin-Raffin/feat/logger
Improve logging
|
2020-03-16 14:57:36 +01:00 |
|
Antonin RAFFIN
|
b37c23c149
|
Bump version and fix
|
2020-03-16 14:05:21 +01:00 |
|
Antonin RAFFIN
|
c3187604bc
|
Code cleanup: rename lr to lr_schedule + typing
|
2020-03-16 14:01:32 +01:00 |
|
Antonin RAFFIN
|
a67bb75438
|
Type td3 policies
|
2020-03-16 13:31:06 +01:00 |
|
Antonin RAFFIN
|
d4ddb3d021
|
Update plotter
|
2020-03-16 12:04:57 +01:00 |
|
Antonin Raffin
|
cf89cac3e9
|
Update a2c logging
|
2020-03-13 11:48:16 +01:00 |
|
Antonin Raffin
|
29d7018265
|
Add better logging for SAC and PPO
|
2020-03-13 11:43:12 +01:00 |
|
Antonin Raffin
|
c39421fa64
|
Fix colors in results plotter
|
2020-03-13 10:59:16 +01:00 |
|
Raffin, Antonin
|
bfbe96c167
|
Merge pull request #57 from Antonin-Raffin/misc/improvements
Misc improvements
|
2020-03-12 15:42:05 +01:00 |
|
Antonin Raffin
|
70e601c03c
|
Improve code and bump version
|
2020-03-12 15:34:35 +01:00 |
|