Antonin RAFFIN
d542732c8d
Rename to stable-baselines3
2020-05-05 15:02:35 +02:00
Antonin RAFFIN
f38ddcb278
Allow any number of channels
2020-04-22 16:11:23 +02:00
Antonin RAFFIN
041f2bc59a
Cleanup, bug fixes + more tests
2020-04-22 13:14:22 +02:00
Antonin RAFFIN
f347474e6a
Independent save/load for policies
2020-04-20 15:59:44 +02:00
Antonin RAFFIN
71ce9ef2f4
Add test for actor
2020-03-31 18:26:26 +02:00
Antonin RAFFIN
2bbf6a9462
Minor: remove comment
2020-03-31 16:40:53 +02:00
Antonin RAFFIN
fdecd512db
Add save/load weights for policies and refactor action distributions
2020-03-31 16:29:13 +02:00
Antonin RAFFIN
fa599c65a6
Add support for Discrete observation spaces
2020-03-25 16:42:05 +01:00
Antonin RAFFIN
dcb54b5301
Remove CEMRL
2020-03-23 14:48:38 +01:00
Antonin RAFFIN
9485b90a41
Sync predict with SB and add version file
2020-03-18 15:11:19 +01:00
Antonin Raffin
18f38f8cf5
Reformat
2020-03-12 11:12:10 +01:00
Antonin Raffin
e31b139c47
Add test for predict method
2020-02-14 14:03:41 +01:00
Antonin Raffin
f1a4fa2d3f
Improve predict method
2020-02-12 15:25:05 +01:00
Antonin Raffin
2afcf395b9
Update tests
2020-02-11 16:42:25 +01:00
Antonin Raffin
b7dcc8d58e
Add extend method
2020-02-11 16:40:44 +01:00
Antonin Raffin
75a86881b3
Add save/load for replay buffer
2020-02-05 13:10:02 +01:00
Antonin Raffin
ec657cc34e
Fix tests and change log_path behavior for EvalCallback
2020-01-31 13:42:04 +01:00
Dormann, Noah
1f0dd60b97
Fix saving on GPU - Loading on CPU ( #45 )
...
* removed policy from save, changed th.loads to map to device
* found hack: catch pickle exception and trying th.load with mapping instead, otherwise raise exception with more information -> loading cuda on cpu raises exception -> leads to th.load with map being called
* deleted todo
* updated changelog
* start of saving refactor
* first working c
* all tests pass, save refactored
* - backwards compatibilty not always
- make pytest all passing
- make typing all passing
* Fixes and simplify the save method
* Remove unused param
* Fix backward compat
* Fix docstring
2020-01-31 13:06:55 +01:00
Antonin Raffin
c542009641
Clean up code + bug fixes
2020-01-20 11:17:55 +01:00
Noah Dormann
88d4f44d55
added set_env test and set_env wrapping
2019-12-05 13:59:07 +01:00
Noah Dormann
8062ed6036
fixed load, to check if environment ist correctly
2019-12-05 13:36:19 +01:00
Noah Dormann
c3b0398d56
Changed load so it still works when env not saved
...
improved save function
2019-12-05 08:40:28 +01:00
Dormann, Noah
362bba73ba
adapted common style
...
Co-Authored-By: Raffin, Antonin <Antonin.Raffin@dlr.de>
2019-12-05 08:07:43 +01:00
Noah Dormann
c82025e673
Add Test for exclude/include feature of save
2019-11-28 16:07:15 +01:00
Noah Dormann
e95858784a
Formatted all files
2019-11-28 15:38:04 +01:00
Noah Dormann
9ff59eaf3d
Added attribute self.policy_class to prevent errors when using self.policy as class
2019-11-28 15:25:01 +01:00
Noah Dormann
e26564e0ec
Added function for setting up any attributes that weren't saved and thus not loaded
2019-11-28 13:35:16 +01:00
Noah Dormann
812cab84ac
Changed PPO deterministic
2019-11-28 11:20:40 +01:00
Noah Dormann
2d72f6d1b5
Added SAC, TD3, A2C
...
Missing CEMRL
2019-11-21 16:46:53 +01:00
Noah Dormann
775a50cc5c
saving all variables now added a2c support
2019-11-21 16:24:18 +01:00
Noah Dormann
526c37bf1f
refactored the assets in test_save_load
...
fixed base_class 'params.pth'
2019-11-21 15:44:57 +01:00
Noah Dormann
fb5f192fc4
Implemented Changes suggested from Antonin-Raffin
...
Added Optimizer saving
2019-11-21 14:39:44 +01:00
Noah Dormann
b20b70db48
Clean reformat
2019-11-21 11:51:47 +01:00
Noah Dormann
5bca52a87d
rearranged imports
2019-11-21 11:44:37 +01:00
Noah Dormann
4b6234a1c8
finished test_save_load.py test
2019-11-21 11:39:47 +01:00
Noah Dormann
cc744a48b5
first save and load features
2019-11-12 17:03:57 +01:00