Antonin Raffin
|
e5c6601726
|
Update VecNormalize (pickling) and improve tests
|
2020-01-20 11:58:16 +01:00 |
|
Antonin Raffin
|
89db65b1fb
|
Improve logger testing + add readers
|
2020-01-20 11:58:00 +01:00 |
|
Antonin Raffin
|
c542009641
|
Clean up code + bug fixes
|
2020-01-20 11:17:55 +01:00 |
|
Antonin Raffin
|
07345e5e27
|
Test for differential entropy
|
2019-12-18 13:45:56 +01:00 |
|
Antonin Raffin
|
0117cc37f4
|
Merge branch 'master' into feat/sde-features
|
2019-12-05 16:33:41 +01:00 |
|
Noah Dormann
|
88d4f44d55
|
added set_env test and set_env wrapping
|
2019-12-05 13:59:07 +01:00 |
|
Noah Dormann
|
8062ed6036
|
fixed load, to check if environment ist correctly
|
2019-12-05 13:36:19 +01:00 |
|
Noah Dormann
|
c3b0398d56
|
Changed load so it still works when env not saved
improved save function
|
2019-12-05 08:40:28 +01:00 |
|
Dormann, Noah
|
362bba73ba
|
adapted common style
Co-Authored-By: Raffin, Antonin <Antonin.Raffin@dlr.de>
|
2019-12-05 08:07:43 +01:00 |
|
Antonin Raffin
|
3cdd5f20af
|
Bug fix + add test for sde net arch
|
2019-12-02 14:14:48 +01:00 |
|
Antonin Raffin
|
21e655ecbf
|
Add test for SAC with different entropy temperature
|
2019-12-02 11:47:52 +01:00 |
|
Antonin RAFFIN
|
03a84f97ea
|
Add monte-carlo test for SDE distribution
|
2019-12-01 16:46:39 +01:00 |
|
Noah Dormann
|
c82025e673
|
Add Test for exclude/include feature of save
|
2019-11-28 16:07:15 +01:00 |
|
Noah Dormann
|
e95858784a
|
Formatted all files
|
2019-11-28 15:38:04 +01:00 |
|
Noah Dormann
|
9ff59eaf3d
|
Added attribute self.policy_class to prevent errors when using self.policy as class
|
2019-11-28 15:25:01 +01:00 |
|
Noah Dormann
|
e26564e0ec
|
Added function for setting up any attributes that weren't saved and thus not loaded
|
2019-11-28 13:35:16 +01:00 |
|
Noah Dormann
|
c75582dfbe
|
resolving conflicts
# Conflicts:
# torchy_baselines/a2c/a2c.py
# torchy_baselines/ppo/ppo.py
Added optimizer params test
|
2019-11-28 12:12:06 +01:00 |
|
Noah Dormann
|
812cab84ac
|
Changed PPO deterministic
|
2019-11-28 11:20:40 +01:00 |
|
Antonin Raffin
|
5483e02d1a
|
Add SDE support for SAC
|
2019-11-26 15:26:12 +01:00 |
|
Antonin Raffin
|
d26fcf4566
|
Fix grad computation for sde test
|
2019-11-26 11:57:48 +01:00 |
|
Antonin Raffin
|
0885dbe74b
|
Bug fix in choosing the distribution
|
2019-11-25 15:02:10 +01:00 |
|
Antonin Raffin
|
5d6649d92b
|
Enable separate feature extraction for SDE
|
2019-11-25 14:54:13 +01:00 |
|
Noah Dormann
|
cfb822aa91
|
Corrected test_run.py
|
2019-11-21 16:54:30 +01:00 |
|
Noah Dormann
|
2d72f6d1b5
|
Added SAC, TD3, A2C
Missing CEMRL
|
2019-11-21 16:46:53 +01:00 |
|
Noah Dormann
|
775a50cc5c
|
saving all variables now added a2c support
|
2019-11-21 16:24:18 +01:00 |
|
Noah Dormann
|
526c37bf1f
|
refactored the assets in test_save_load
fixed base_class 'params.pth'
|
2019-11-21 15:44:57 +01:00 |
|
Noah Dormann
|
17f84053b3
|
save implementation for a2c needed before uncommenting save and load test in test_run.py::test_onpolicy
|
2019-11-21 14:44:02 +01:00 |
|
Noah Dormann
|
fb5f192fc4
|
Implemented Changes suggested from Antonin-Raffin
Added Optimizer saving
|
2019-11-21 14:39:44 +01:00 |
|
Noah Dormann
|
a7655ca6e1
|
Reformated every file with PEP 8 errors
|
2019-11-21 13:01:03 +01:00 |
|
Noah Dormann
|
b20b70db48
|
Clean reformat
|
2019-11-21 11:51:47 +01:00 |
|
Noah Dormann
|
5bca52a87d
|
rearranged imports
|
2019-11-21 11:44:37 +01:00 |
|
Noah Dormann
|
4b6234a1c8
|
finished test_save_load.py test
|
2019-11-21 11:39:47 +01:00 |
|
Antonin Raffin
|
ad32aa60f3
|
Add sde scheduler
|
2019-11-18 16:03:08 +01:00 |
|
Antonin Raffin
|
d8a7556d84
|
Merge branch 'feat/sde' into feat/offpolicy-sde
|
2019-11-18 15:14:05 +01:00 |
|
Antonin Raffin
|
b9c20d443d
|
Update doc + add test for tanh bijector
|
2019-11-18 15:04:07 +01:00 |
|
Antonin Raffin
|
5d353d598c
|
Start cleanup + update docstrings
|
2019-11-18 14:09:31 +01:00 |
|
Antonin Raffin
|
fb64072859
|
Update sde test
|
2019-11-15 11:07:49 +01:00 |
|
Antonin Raffin
|
cdb62a93fe
|
Bug fix for off-policy normalization
Now working properly
|
2019-11-15 11:00:31 +01:00 |
|
Antonin Raffin
|
5278a6f3f8
|
Testing off policy normalization
|
2019-11-14 14:35:00 +01:00 |
|
Noah Dormann
|
cc744a48b5
|
first save and load features
|
2019-11-12 17:03:57 +01:00 |
|
Antonin Raffin
|
db87e0d36a
|
Quick and dirty SDE version for TD3
|
2019-11-07 17:31:52 +01:00 |
|
Antonin Raffin
|
72a6f18e43
|
Add sde test + fix random seed
|
2019-10-31 14:14:30 +01:00 |
|
Antonin Raffin
|
42d50ed09b
|
Add expln
|
2019-10-29 15:15:54 +01:00 |
|
Antonin Raffin
|
c15b4bda1e
|
Add first draft of SDE
|
2019-10-28 18:24:13 +01:00 |
|
Antonin Raffin
|
0ad743c85d
|
Add A2C
|
2019-10-25 10:59:15 +02:00 |
|
Antonin RAFFIN
|
53898f3d1a
|
Add flexible mlp
|
2019-10-17 13:32:25 +02:00 |
|
Antonin Raffin
|
ef50bb81e8
|
Add support for categorical distribution
|
2019-10-08 13:06:38 +02:00 |
|
Antonin Raffin
|
37ab9d10f1
|
Rescale actions and add action noise
|
2019-10-07 16:26:03 +02:00 |
|
Antonin Raffin
|
32648d9029
|
Add docstrings
|
2019-09-24 15:30:58 +02:00 |
|
Antonin Raffin
|
d22caac616
|
Working SAC
|
2019-09-24 14:15:12 +02:00 |
|