Noah Dormann
|
6928879f5a
|
Refactored doc-strings
|
2019-11-28 16:30:13 +01:00 |
|
Noah Dormann
|
7ce610fade
|
Deleted exact match parameter of load_parameters
|
2019-11-28 16:11:35 +01:00 |
|
Noah Dormann
|
c82025e673
|
Add Test for exclude/include feature of save
|
2019-11-28 16:07:15 +01:00 |
|
Noah Dormann
|
ee6f938ddc
|
Added option to explicitly specify excluded parameters
|
2019-11-28 15:42:53 +01:00 |
|
Noah Dormann
|
e95858784a
|
Formatted all files
|
2019-11-28 15:38:04 +01:00 |
|
Noah Dormann
|
751ccf85e7
|
_setup_model() is now called when model is loaded
|
2019-11-28 15:33:52 +01:00 |
|
Noah Dormann
|
9ff59eaf3d
|
Added attribute self.policy_class to prevent errors when using self.policy as class
|
2019-11-28 15:25:01 +01:00 |
|
Noah Dormann
|
e26564e0ec
|
Added function for setting up any attributes that weren't saved and thus not loaded
|
2019-11-28 13:35:16 +01:00 |
|
Noah Dormann
|
c75582dfbe
|
resolving conflicts
# Conflicts:
# torchy_baselines/a2c/a2c.py
# torchy_baselines/ppo/ppo.py
Added optimizer params test
|
2019-11-28 12:12:06 +01:00 |
|
Noah Dormann
|
812cab84ac
|
Changed PPO deterministic
|
2019-11-28 11:20:40 +01:00 |
|
Noah Dormann
|
b75ffe166a
|
Cleared base_class.load description
|
2019-11-28 10:36:21 +01:00 |
|
Antonin Raffin
|
fbe29a7298
|
Track down autograd error
"Trying to backward through the graph a second time" -> added a comment
|
2019-11-27 17:29:47 +01:00 |
|
Antonin Raffin
|
fe67a98711
|
Log more values
|
2019-11-26 17:44:06 +01:00 |
|
Antonin Raffin
|
5483e02d1a
|
Add SDE support for SAC
|
2019-11-26 15:26:12 +01:00 |
|
Antonin Raffin
|
d26fcf4566
|
Fix grad computation for sde test
|
2019-11-26 11:57:48 +01:00 |
|
Antonin Raffin
|
0885dbe74b
|
Bug fix in choosing the distribution
|
2019-11-25 15:02:10 +01:00 |
|
Antonin Raffin
|
5d6649d92b
|
Enable separate feature extraction for SDE
|
2019-11-25 14:54:13 +01:00 |
|
Antonin Raffin
|
d0003ee4ec
|
Enable kwargs for proba dist
|
2019-11-25 14:00:21 +01:00 |
|
Antonin Raffin
|
5bbb14188d
|
Unify A2C and TD3 SDE implementation
|
2019-11-25 13:19:33 +01:00 |
|
Antonin Raffin
|
c56865e10d
|
Cleanup CEM, rename variables + add comments
|
2019-11-22 19:02:00 +01:00 |
|
Antonin Raffin
|
c47be0086e
|
Add docstrings
|
2019-11-22 17:24:47 +01:00 |
|
Antonin Raffin
|
f0f2f10d1e
|
Change default architecture
|
2019-11-22 17:24:14 +01:00 |
|
Antonin Raffin
|
03d2ab10f8
|
Fix clipped action when adapting noise with TD3
|
2019-11-22 15:04:34 +01:00 |
|
Antonin Raffin
|
604a19fbc3
|
Cleanup + update doc
|
2019-11-22 13:33:12 +01:00 |
|
Antonin Raffin
|
b84e5e9e27
|
Move flexible mlp to common
|
2019-11-22 13:06:41 +01:00 |
|
Antonin Raffin
|
ea3902cd32
|
Add doc for CEM-RL
|
2019-11-22 13:03:57 +01:00 |
|
Antonin Raffin
|
81a15414b0
|
Merge branch 'master' into feat/offpolicy-sde
|
2019-11-22 11:43:13 +01:00 |
|
Antonin Raffin
|
99ea0b3a54
|
Cleanup
|
2019-11-22 11:42:58 +01:00 |
|
Noah Dormann
|
4f8f936451
|
Don't save replay_buffer by default
|
2019-11-21 17:27:46 +01:00 |
|
Noah Dormann
|
cfb822aa91
|
Corrected test_run.py
|
2019-11-21 16:54:30 +01:00 |
|
Noah Dormann
|
924ba9aea6
|
cleaned comments on model specific get and load functions
|
2019-11-21 16:50:59 +01:00 |
|
Noah Dormann
|
03a0d437ef
|
refactor
|
2019-11-21 16:49:49 +01:00 |
|
Noah Dormann
|
2d72f6d1b5
|
Added SAC, TD3, A2C
Missing CEMRL
|
2019-11-21 16:46:53 +01:00 |
|
Noah Dormann
|
775a50cc5c
|
saving all variables now added a2c support
|
2019-11-21 16:24:18 +01:00 |
|
Noah Dormann
|
526c37bf1f
|
refactored the assets in test_save_load
fixed base_class 'params.pth'
|
2019-11-21 15:44:57 +01:00 |
|
Noah Dormann
|
26f31fd25b
|
corrected comment sections
|
2019-11-21 14:55:56 +01:00 |
|
Noah Dormann
|
d31a399140
|
undo changes to conf.py
|
2019-11-21 14:52:29 +01:00 |
|
Noah Dormann
|
17f84053b3
|
save implementation for a2c needed before uncommenting save and load test in test_run.py::test_onpolicy
|
2019-11-21 14:44:02 +01:00 |
|
Noah Dormann
|
fb5f192fc4
|
Implemented Changes suggested from Antonin-Raffin
Added Optimizer saving
|
2019-11-21 14:39:44 +01:00 |
|
Noah Dormann
|
a7655ca6e1
|
Reformated every file with PEP 8 errors
|
2019-11-21 13:01:03 +01:00 |
|
Raffin, Antonin
|
8b218b8359
|
Merge pull request #2 from Antonin-Raffin/feat/sde
Add State Dependent Exploration (SDE)
|
2019-11-21 12:50:42 +01:00 |
|
Noah Dormann
|
b20b70db48
|
Clean reformat
|
2019-11-21 11:51:47 +01:00 |
|
Noah Dormann
|
5bca52a87d
|
rearranged imports
|
2019-11-21 11:44:37 +01:00 |
|
Noah Dormann
|
4b6234a1c8
|
finished test_save_load.py test
|
2019-11-21 11:39:47 +01:00 |
|
Antonin Raffin
|
ad32aa60f3
|
Add sde scheduler
|
2019-11-18 16:03:08 +01:00 |
|
Antonin Raffin
|
d8a7556d84
|
Merge branch 'feat/sde' into feat/offpolicy-sde
|
2019-11-18 15:14:05 +01:00 |
|
Antonin Raffin
|
ef59a7e431
|
Update version + add docstring
|
2019-11-18 15:11:19 +01:00 |
|
Antonin Raffin
|
b9c20d443d
|
Update doc + add test for tanh bijector
|
2019-11-18 15:04:07 +01:00 |
|
Antonin Raffin
|
5d353d598c
|
Start cleanup + update docstrings
|
2019-11-18 14:09:31 +01:00 |
|
Antonin Raffin
|
fb64072859
|
Update sde test
|
2019-11-15 11:07:49 +01:00 |
|