Antonin RAFFIN
041f2bc59a
Cleanup, bug fixes + more tests
2020-04-22 13:14:22 +02:00
Antonin RAFFIN
73fb8d1c63
Add CNN support for TD3
2020-04-22 11:05:46 +02:00
Antonin RAFFIN
8aac9e819d
Add VecTransposeImage and fix for SAC
2020-04-21 20:41:58 +02:00
Antonin RAFFIN
93c2a01f91
Start CNN support (failing for SAC)
2020-04-21 16:22:46 +02:00
Antonin RAFFIN
f347474e6a
Independent save/load for policies
2020-04-20 15:59:44 +02:00
Antonin RAFFIN
aa1026ee87
Added `optimizer and optimizer_kwargs to policy_kwargs`
2020-04-17 15:13:45 +02:00
Antonin RAFFIN
71ce9ef2f4
Add test for actor
2020-03-31 18:26:26 +02:00
Antonin RAFFIN
c264403816
Rename for consistency
...
+ add _predict to actors
+ improve sac actor code
2020-03-31 17:48:23 +02:00
Antonin RAFFIN
2bbf6a9462
Minor: remove comment
2020-03-31 16:40:53 +02:00
Antonin RAFFIN
fdecd512db
Add save/load weights for policies and refactor action distributions
2020-03-31 16:29:13 +02:00
Antonin RAFFIN
b782f3a208
Fix for test failures
2020-03-31 10:18:56 +02:00
Antonin RAFFIN
fa599c65a6
Add support for Discrete observation spaces
2020-03-25 16:42:05 +01:00
Antonin RAFFIN
4b2092f55a
Remove base network
2020-03-23 15:31:14 +01:00
Antonin RAFFIN
dcb54b5301
Remove CEMRL
2020-03-23 14:48:38 +01:00
Antonin RAFFIN
fd9e73cfb8
Fix entropy computation
2020-03-19 10:19:48 +01:00
Antonin RAFFIN
9485b90a41
Sync predict with SB and add version file
2020-03-18 15:11:19 +01:00
Antonin Raffin
70e601c03c
Improve code and bump version
2020-03-12 15:34:35 +01:00
Antonin Raffin
b64873ffff
Sync callbacks
2020-03-12 12:34:25 +01:00
Antonin Raffin
18f38f8cf5
Reformat
2020-03-12 11:12:10 +01:00
Antonin Raffin
037986a91d
Add test for expln
2020-03-11 16:35:13 +01:00
Antonin Raffin
4392759057
Comment unused code
2020-02-14 14:15:55 +01:00
Antonin Raffin
e31b139c47
Add test for predict method
2020-02-14 14:03:41 +01:00
Antonin Raffin
8b559d71ab
Remove deprecated monitor format and improve tests
2020-02-14 13:42:16 +01:00
Antonin Raffin
f1a4fa2d3f
Improve predict method
2020-02-12 15:25:05 +01:00
Antonin Raffin
7bafdb3a67
Add get_vec_normalize_env()
2020-02-12 11:34:29 +01:00
Antonin Raffin
2ce31c1e21
Fix entropy loss for squashed Gaussian and VecEnv seeding
2020-02-11 17:22:03 +01:00
Antonin Raffin
2afcf395b9
Update tests
2020-02-11 16:42:25 +01:00
Antonin Raffin
b7dcc8d58e
Add extend method
2020-02-11 16:40:44 +01:00
Antonin Raffin
75a86881b3
Add save/load for replay buffer
2020-02-05 13:10:02 +01:00
Antonin Raffin
d850a35311
Update tests
2020-02-03 15:57:37 +01:00
Antonin Raffin
ec657cc34e
Fix tests and change log_path behavior for EvalCallback
2020-01-31 13:42:04 +01:00
Antonin Raffin
6d59bfd4a0
Merge branch 'master' into feat/callbacks
2020-01-31 13:09:55 +01:00
Dormann, Noah
1f0dd60b97
Fix saving on GPU - Loading on CPU ( #45 )
...
* removed policy from save, changed th.loads to map to device
* found hack: catch pickle exception and trying th.load with mapping instead, otherwise raise exception with more information -> loading cuda on cpu raises exception -> leads to th.load with map being called
* deleted todo
* updated changelog
* start of saving refactor
* first working c
* all tests pass, save refactored
* - backwards compatibilty not always
- make pytest all passing
- make typing all passing
* Fixes and simplify the save method
* Remove unused param
* Fix backward compat
* Fix docstring
2020-01-31 13:06:55 +01:00
Antonin Raffin
b66003cfb3
Add callback support
2020-01-27 14:32:31 +01:00
Antonin Raffin
e5c6601726
Update VecNormalize (pickling) and improve tests
2020-01-20 11:58:16 +01:00
Antonin Raffin
89db65b1fb
Improve logger testing + add readers
2020-01-20 11:58:00 +01:00
Antonin Raffin
c542009641
Clean up code + bug fixes
2020-01-20 11:17:55 +01:00
Antonin Raffin
07345e5e27
Test for differential entropy
2019-12-18 13:45:56 +01:00
Antonin Raffin
0117cc37f4
Merge branch 'master' into feat/sde-features
2019-12-05 16:33:41 +01:00
Noah Dormann
88d4f44d55
added set_env test and set_env wrapping
2019-12-05 13:59:07 +01:00
Noah Dormann
8062ed6036
fixed load, to check if environment ist correctly
2019-12-05 13:36:19 +01:00
Noah Dormann
c3b0398d56
Changed load so it still works when env not saved
...
improved save function
2019-12-05 08:40:28 +01:00
Dormann, Noah
362bba73ba
adapted common style
...
Co-Authored-By: Raffin, Antonin <Antonin.Raffin@dlr.de>
2019-12-05 08:07:43 +01:00
Antonin Raffin
3cdd5f20af
Bug fix + add test for sde net arch
2019-12-02 14:14:48 +01:00
Antonin Raffin
21e655ecbf
Add test for SAC with different entropy temperature
2019-12-02 11:47:52 +01:00
Antonin RAFFIN
03a84f97ea
Add monte-carlo test for SDE distribution
2019-12-01 16:46:39 +01:00
Noah Dormann
c82025e673
Add Test for exclude/include feature of save
2019-11-28 16:07:15 +01:00
Noah Dormann
e95858784a
Formatted all files
2019-11-28 15:38:04 +01:00
Noah Dormann
9ff59eaf3d
Added attribute self.policy_class to prevent errors when using self.policy as class
2019-11-28 15:25:01 +01:00
Noah Dormann
e26564e0ec
Added function for setting up any attributes that weren't saved and thus not loaded
2019-11-28 13:35:16 +01:00