Antonin RAFFIN
041f2bc59a
Cleanup, bug fixes + more tests
2020-04-22 13:14:22 +02:00
Antonin RAFFIN
8aac9e819d
Add VecTransposeImage and fix for SAC
2020-04-21 20:41:58 +02:00
Antonin RAFFIN
93c2a01f91
Start CNN support (failing for SAC)
2020-04-21 16:22:46 +02:00
Antonin RAFFIN
f347474e6a
Independent save/load for policies
2020-04-20 15:59:44 +02:00
Antonin RAFFIN
17f9246257
Add get_device util and fix squash_output
2020-04-20 15:43:11 +02:00
Antonin RAFFIN
aa1026ee87
Added `optimizer and optimizer_kwargs to policy_kwargs`
2020-04-17 15:13:45 +02:00
Antonin RAFFIN
0e44cdce44
Fixed `reset_num_timesteps` behavior
2020-04-17 12:36:27 +02:00
Antonin RAFFIN
08a22c4834
Release 0.4.0
2020-04-14 18:13:51 +02:00
Antonin RAFFIN
fdecd512db
Add save/load weights for policies and refactor action distributions
2020-03-31 16:29:13 +02:00
Antonin RAFFIN
fa599c65a6
Add support for Discrete observation spaces
2020-03-25 16:42:05 +01:00
Antonin RAFFIN
72a88a8d92
Fix type hint for activation fn
2020-03-24 10:10:37 +01:00
Antonin RAFFIN
ba18258af6
Add proper preprocessing
2020-03-23 17:15:30 +01:00
Antonin RAFFIN
dcb54b5301
Remove CEMRL
2020-03-23 14:48:38 +01:00
Antonin RAFFIN
57b37513b6
Refactor handling of obs and action space
...
+ remove duplicated code
2020-03-20 10:09:09 +01:00
Antonin RAFFIN
7251b9d2c2
Release v0.3.0
2020-03-19 11:11:36 +01:00
Antonin RAFFIN
fd9e73cfb8
Fix entropy computation
2020-03-19 10:19:48 +01:00
Antonin RAFFIN
9485b90a41
Sync predict with SB and add version file
2020-03-18 15:11:19 +01:00
Antonin RAFFIN
c3187604bc
Code cleanup: rename lr to lr_schedule + typing
2020-03-16 14:01:32 +01:00
Antonin Raffin
29d7018265
Add better logging for SAC and PPO
2020-03-13 11:43:12 +01:00
Antonin Raffin
c39421fa64
Fix colors in results plotter
2020-03-13 10:59:16 +01:00
Antonin Raffin
b64873ffff
Sync callbacks
2020-03-12 12:34:25 +01:00
Antonin Raffin
037986a91d
Add test for expln
2020-03-11 16:35:13 +01:00
Antonin Raffin
6ebad92e1b
Remove default seed and bump dependencies
2020-03-10 17:43:54 +01:00
Antonin Raffin
20ee8cb68d
Update changelog and add more namedtuples
2020-03-10 16:55:13 +01:00
Antonin Raffin
1e81f38d66
Update changelog
2020-03-09 19:05:22 +01:00
Antonin Raffin
26ccf499b3
Use normal sampling for SAC
2020-02-21 14:50:28 +01:00
Antonin Raffin
809a3d3d38
Release 0.2.0
2020-02-14 14:39:24 +01:00
Antonin Raffin
8b559d71ab
Remove deprecated monitor format and improve tests
2020-02-14 13:42:16 +01:00
Antonin Raffin
f1a4fa2d3f
Improve predict method
2020-02-12 15:25:05 +01:00
Antonin Raffin
9caea35a11
Add results plotter
2020-02-12 14:31:15 +01:00
Antonin Raffin
7bafdb3a67
Add get_vec_normalize_env()
2020-02-12 11:34:29 +01:00
Antonin Raffin
2ce31c1e21
Fix entropy loss for squashed Gaussian and VecEnv seeding
2020-02-11 17:22:03 +01:00
Antonin Raffin
b7dcc8d58e
Add extend method
2020-02-11 16:40:44 +01:00
Antonin Raffin
75a86881b3
Add save/load for replay buffer
2020-02-05 13:10:02 +01:00
Antonin Raffin
c2318149dd
Update changelog and version
2020-02-03 15:50:40 +01:00
Antonin Raffin
5d4e73544c
Fix reset_num_timesteps
2020-01-31 13:16:28 +01:00
Antonin Raffin
6d59bfd4a0
Merge branch 'master' into feat/callbacks
2020-01-31 13:09:55 +01:00
Dormann, Noah
1f0dd60b97
Fix saving on GPU - Loading on CPU ( #45 )
...
* removed policy from save, changed th.loads to map to device
* found hack: catch pickle exception and trying th.load with mapping instead, otherwise raise exception with more information -> loading cuda on cpu raises exception -> leads to th.load with map being called
* deleted todo
* updated changelog
* start of saving refactor
* first working c
* all tests pass, save refactored
* - backwards compatibilty not always
- make pytest all passing
- make typing all passing
* Fixes and simplify the save method
* Remove unused param
* Fix backward compat
* Fix docstring
2020-01-31 13:06:55 +01:00
Antonin Raffin
98037352f5
Update changelog
2020-01-27 15:57:34 +01:00
Antonin Raffin
b66003cfb3
Add callback support
2020-01-27 14:32:31 +01:00
Antonin Raffin
0328a39d1b
Update changelog
2020-01-22 17:25:08 +01:00
Antonin Raffin
9345b85cfc
Update changelog and README
2020-01-22 17:23:42 +01:00
Antonin Raffin
9e250b6818
Build doc
2020-01-20 16:19:35 +01:00
Antonin Raffin
b4dc9d4e4d
Add doc
2019-09-26 11:46:40 +02:00