Commit graph

449 commits

Author SHA1 Message Date
Noah Dormann
6928879f5a Refactored doc-strings 2019-11-28 16:30:13 +01:00
Noah Dormann
7ce610fade Deleted exact match parameter of load_parameters 2019-11-28 16:11:35 +01:00
Noah Dormann
c82025e673 Add Test for exclude/include feature of save 2019-11-28 16:07:15 +01:00
Noah Dormann
ee6f938ddc Added option to explicitly specify excluded parameters 2019-11-28 15:42:53 +01:00
Noah Dormann
e95858784a Formatted all files 2019-11-28 15:38:04 +01:00
Noah Dormann
751ccf85e7 _setup_model() is now called when model is loaded 2019-11-28 15:33:52 +01:00
Noah Dormann
9ff59eaf3d Added attribute self.policy_class to prevent errors when using self.policy as class 2019-11-28 15:25:01 +01:00
Noah Dormann
e26564e0ec Added function for setting up any attributes that weren't saved and thus not loaded 2019-11-28 13:35:16 +01:00
Noah Dormann
c75582dfbe resolving conflicts
# Conflicts:
#	torchy_baselines/a2c/a2c.py
#	torchy_baselines/ppo/ppo.py

Added optimizer params test
2019-11-28 12:12:06 +01:00
Noah Dormann
812cab84ac Changed PPO deterministic 2019-11-28 11:20:40 +01:00
Noah Dormann
b75ffe166a Cleared base_class.load description 2019-11-28 10:36:21 +01:00
Antonin Raffin
fbe29a7298 Track down autograd error
"Trying to backward through the graph a second time" -> added a comment
2019-11-27 17:29:47 +01:00
Antonin Raffin
fe67a98711 Log more values 2019-11-26 17:44:06 +01:00
Antonin Raffin
5483e02d1a Add SDE support for SAC 2019-11-26 15:26:12 +01:00
Antonin Raffin
d26fcf4566 Fix grad computation for sde test 2019-11-26 11:57:48 +01:00
Antonin Raffin
0885dbe74b Bug fix in choosing the distribution 2019-11-25 15:02:10 +01:00
Antonin Raffin
5d6649d92b Enable separate feature extraction for SDE 2019-11-25 14:54:13 +01:00
Antonin Raffin
d0003ee4ec Enable kwargs for proba dist 2019-11-25 14:00:21 +01:00
Antonin Raffin
5bbb14188d Unify A2C and TD3 SDE implementation 2019-11-25 13:19:33 +01:00
Antonin Raffin
c56865e10d Cleanup CEM, rename variables + add comments 2019-11-22 19:02:00 +01:00
Antonin Raffin
c47be0086e Add docstrings 2019-11-22 17:24:47 +01:00
Antonin Raffin
f0f2f10d1e Change default architecture 2019-11-22 17:24:14 +01:00
Antonin Raffin
03d2ab10f8 Fix clipped action when adapting noise with TD3 2019-11-22 15:04:34 +01:00
Antonin Raffin
604a19fbc3 Cleanup + update doc 2019-11-22 13:33:12 +01:00
Antonin Raffin
b84e5e9e27 Move flexible mlp to common 2019-11-22 13:06:41 +01:00
Antonin Raffin
ea3902cd32 Add doc for CEM-RL 2019-11-22 13:03:57 +01:00
Antonin Raffin
81a15414b0 Merge branch 'master' into feat/offpolicy-sde 2019-11-22 11:43:13 +01:00
Antonin Raffin
99ea0b3a54 Cleanup 2019-11-22 11:42:58 +01:00
Noah Dormann
4f8f936451 Don't save replay_buffer by default 2019-11-21 17:27:46 +01:00
Noah Dormann
cfb822aa91 Corrected test_run.py 2019-11-21 16:54:30 +01:00
Noah Dormann
924ba9aea6 cleaned comments on model specific get and load functions 2019-11-21 16:50:59 +01:00
Noah Dormann
03a0d437ef refactor 2019-11-21 16:49:49 +01:00
Noah Dormann
2d72f6d1b5 Added SAC, TD3, A2C
Missing CEMRL
2019-11-21 16:46:53 +01:00
Noah Dormann
775a50cc5c saving all variables now added a2c support 2019-11-21 16:24:18 +01:00
Noah Dormann
526c37bf1f refactored the assets in test_save_load
fixed base_class 'params.pth'
2019-11-21 15:44:57 +01:00
Noah Dormann
26f31fd25b corrected comment sections 2019-11-21 14:55:56 +01:00
Noah Dormann
d31a399140 undo changes to conf.py 2019-11-21 14:52:29 +01:00
Noah Dormann
17f84053b3 save implementation for a2c needed before uncommenting save and load test in test_run.py::test_onpolicy 2019-11-21 14:44:02 +01:00
Noah Dormann
fb5f192fc4 Implemented Changes suggested from Antonin-Raffin
Added Optimizer saving
2019-11-21 14:39:44 +01:00
Noah Dormann
a7655ca6e1 Reformated every file with PEP 8 errors 2019-11-21 13:01:03 +01:00
Raffin, Antonin
8b218b8359 Merge pull request #2 from Antonin-Raffin/feat/sde
Add State Dependent Exploration (SDE)
2019-11-21 12:50:42 +01:00
Noah Dormann
b20b70db48 Clean reformat 2019-11-21 11:51:47 +01:00
Noah Dormann
5bca52a87d rearranged imports 2019-11-21 11:44:37 +01:00
Noah Dormann
4b6234a1c8 finished test_save_load.py test 2019-11-21 11:39:47 +01:00
Antonin Raffin
ad32aa60f3 Add sde scheduler 2019-11-18 16:03:08 +01:00
Antonin Raffin
d8a7556d84 Merge branch 'feat/sde' into feat/offpolicy-sde 2019-11-18 15:14:05 +01:00
Antonin Raffin
ef59a7e431 Update version + add docstring 2019-11-18 15:11:19 +01:00
Antonin Raffin
b9c20d443d Update doc + add test for tanh bijector 2019-11-18 15:04:07 +01:00
Antonin Raffin
5d353d598c Start cleanup + update docstrings 2019-11-18 14:09:31 +01:00
Antonin Raffin
fb64072859 Update sde test 2019-11-15 11:07:49 +01:00