Dormann, Noah
|
362bba73ba
|
adapted common style
Co-Authored-By: Raffin, Antonin <Antonin.Raffin@dlr.de>
|
2019-12-05 08:07:43 +01:00 |
|
Noah Dormann
|
c82025e673
|
Add Test for exclude/include feature of save
|
2019-11-28 16:07:15 +01:00 |
|
Noah Dormann
|
e95858784a
|
Formatted all files
|
2019-11-28 15:38:04 +01:00 |
|
Noah Dormann
|
9ff59eaf3d
|
Added attribute self.policy_class to prevent errors when using self.policy as class
|
2019-11-28 15:25:01 +01:00 |
|
Noah Dormann
|
e26564e0ec
|
Added function for setting up any attributes that weren't saved and thus not loaded
|
2019-11-28 13:35:16 +01:00 |
|
Noah Dormann
|
c75582dfbe
|
resolving conflicts
# Conflicts:
# torchy_baselines/a2c/a2c.py
# torchy_baselines/ppo/ppo.py
Added optimizer params test
|
2019-11-28 12:12:06 +01:00 |
|
Noah Dormann
|
812cab84ac
|
Changed PPO deterministic
|
2019-11-28 11:20:40 +01:00 |
|
Noah Dormann
|
cfb822aa91
|
Corrected test_run.py
|
2019-11-21 16:54:30 +01:00 |
|
Noah Dormann
|
2d72f6d1b5
|
Added SAC, TD3, A2C
Missing CEMRL
|
2019-11-21 16:46:53 +01:00 |
|
Noah Dormann
|
775a50cc5c
|
saving all variables now added a2c support
|
2019-11-21 16:24:18 +01:00 |
|
Noah Dormann
|
526c37bf1f
|
refactored the assets in test_save_load
fixed base_class 'params.pth'
|
2019-11-21 15:44:57 +01:00 |
|
Noah Dormann
|
17f84053b3
|
save implementation for a2c needed before uncommenting save and load test in test_run.py::test_onpolicy
|
2019-11-21 14:44:02 +01:00 |
|
Noah Dormann
|
fb5f192fc4
|
Implemented Changes suggested from Antonin-Raffin
Added Optimizer saving
|
2019-11-21 14:39:44 +01:00 |
|
Noah Dormann
|
a7655ca6e1
|
Reformated every file with PEP 8 errors
|
2019-11-21 13:01:03 +01:00 |
|
Noah Dormann
|
b20b70db48
|
Clean reformat
|
2019-11-21 11:51:47 +01:00 |
|
Noah Dormann
|
5bca52a87d
|
rearranged imports
|
2019-11-21 11:44:37 +01:00 |
|
Noah Dormann
|
4b6234a1c8
|
finished test_save_load.py test
|
2019-11-21 11:39:47 +01:00 |
|
Antonin Raffin
|
b9c20d443d
|
Update doc + add test for tanh bijector
|
2019-11-18 15:04:07 +01:00 |
|
Antonin Raffin
|
5d353d598c
|
Start cleanup + update docstrings
|
2019-11-18 14:09:31 +01:00 |
|
Noah Dormann
|
cc744a48b5
|
first save and load features
|
2019-11-12 17:03:57 +01:00 |
|
Antonin Raffin
|
72a6f18e43
|
Add sde test + fix random seed
|
2019-10-31 14:14:30 +01:00 |
|
Antonin Raffin
|
42d50ed09b
|
Add expln
|
2019-10-29 15:15:54 +01:00 |
|
Antonin Raffin
|
c15b4bda1e
|
Add first draft of SDE
|
2019-10-28 18:24:13 +01:00 |
|
Antonin Raffin
|
0ad743c85d
|
Add A2C
|
2019-10-25 10:59:15 +02:00 |
|
Antonin RAFFIN
|
53898f3d1a
|
Add flexible mlp
|
2019-10-17 13:32:25 +02:00 |
|
Antonin Raffin
|
ef50bb81e8
|
Add support for categorical distribution
|
2019-10-08 13:06:38 +02:00 |
|
Antonin Raffin
|
37ab9d10f1
|
Rescale actions and add action noise
|
2019-10-07 16:26:03 +02:00 |
|
Antonin Raffin
|
32648d9029
|
Add docstrings
|
2019-09-24 15:30:58 +02:00 |
|
Antonin Raffin
|
d22caac616
|
Working SAC
|
2019-09-24 14:15:12 +02:00 |
|
Antonin RAFFIN
|
2469ff3859
|
Reformat
|
2019-09-21 17:17:09 +02:00 |
|
Antonin Raffin
|
a9b8276efb
|
Attempt to fix loss of perf because of VecEnvs
|
2019-09-20 18:06:08 +02:00 |
|
Antonin Raffin
|
0e727a5f72
|
Full compat for VecEnv + bug fixes for cuda
|
2019-09-20 16:43:19 +02:00 |
|
Antonin Raffin
|
56053bc692
|
Add stable-baselines VecEnvs
|
2019-09-20 15:18:25 +02:00 |
|
Antonin RAFFIN
|
fe8b415cbf
|
First sign of life
|
2019-09-19 16:21:28 +02:00 |
|
Antonin RAFFIN
|
e1c1d5c4ab
|
Bug fixes (not working yet)
|
2019-09-18 22:12:32 +02:00 |
|
Antonin RAFFIN
|
6bb7e183d2
|
Running PPO (not working yet)
|
2019-09-18 15:35:17 +02:00 |
|
Antonin Raffin
|
5e3a84d551
|
Refactor policies
|
2019-09-12 11:19:06 +02:00 |
|
Antonin Raffin
|
5e38080937
|
Code cleanup
|
2019-09-06 14:04:40 +02:00 |
|
Antonin Raffin
|
d4e2dc8a9c
|
Add CEM-RL
|
2019-09-06 14:01:10 +02:00 |
|
Antonin Raffin
|
90882ee846
|
Fixes for python 2 + env from string
|
2019-09-06 11:46:25 +02:00 |
|
Antonin Raffin
|
9cf289b997
|
Bug fixes + add evaluate script
|
2019-09-06 10:44:55 +02:00 |
|
Antonin Raffin
|
46d8d9725b
|
Init: TD3
|
2019-09-05 17:29:41 +02:00 |
|