Antonin Raffin
|
bff0ca0ea8
|
Use HardTanh to relax the constrain
|
2019-12-19 11:59:00 +01:00 |
|
Antonin Raffin
|
c05c990285
|
Remove norm clipping
|
2019-12-18 16:56:51 +01:00 |
|
Antonin Raffin
|
e49d97bf98
|
Fix infs in SAC by bounding the mean
|
2019-12-18 13:45:33 +01:00 |
|
Antonin Raffin
|
57708a628c
|
Add value function for SDE + TD3
|
2019-12-17 15:01:08 +01:00 |
|
Antonin Raffin
|
1d6f9bf100
|
Add sample freq for SDE
|
2019-12-17 11:47:21 +01:00 |
|
Antonin Raffin
|
919dfee452
|
Try to clip grad norm
|
2019-12-17 11:14:44 +01:00 |
|
Antonin Raffin
|
d63cef7693
|
Add gradient clipping for SAC
|
2019-12-06 18:32:57 +01:00 |
|
Antonin Raffin
|
233f346d53
|
Update todos
|
2019-12-06 17:46:56 +01:00 |
|
Antonin Raffin
|
6c423add8d
|
Bump version
|
2019-12-05 16:44:27 +01:00 |
|
Antonin Raffin
|
0117cc37f4
|
Merge branch 'master' into feat/sde-features
|
2019-12-05 16:33:41 +01:00 |
|
Noah Dormann
|
aa67147796
|
clarified bytesIO use for load
|
2019-12-05 15:45:05 +01:00 |
|
Raffin, Antonin
|
bac9d4efed
|
Update torchy_baselines/common/base_class.py
|
2019-12-05 14:53:14 +01:00 |
|
Raffin, Antonin
|
695cdc63a4
|
Update torchy_baselines/common/base_class.py
|
2019-12-05 14:52:59 +01:00 |
|
Raffin, Antonin
|
424a554567
|
Update docstring
|
2019-12-05 14:50:11 +01:00 |
|
Raffin, Antonin
|
464dd773e6
|
Update comment
|
2019-12-05 14:46:02 +01:00 |
|
Raffin, Antonin
|
03ecb17ef6
|
Update error message
|
2019-12-05 14:41:39 +01:00 |
|
Noah Dormann
|
88d4f44d55
|
added set_env test and set_env wrapping
|
2019-12-05 13:59:07 +01:00 |
|
Noah Dormann
|
cf1d7118a5
|
replaced file with file_path
|
2019-12-05 13:44:02 +01:00 |
|
Noah Dormann
|
8062ed6036
|
fixed load, to check if environment ist correctly
|
2019-12-05 13:36:19 +01:00 |
|
Noah Dormann
|
4b1bab7f85
|
implemented set_env method
|
2019-12-05 09:11:30 +01:00 |
|
Noah Dormann
|
8460bfe397
|
added some comments to _load_from_file
|
2019-12-05 08:56:04 +01:00 |
|
Noah Dormann
|
6560ae9952
|
using other_file instead of other_files
|
2019-12-05 08:52:26 +01:00 |
|
Noah Dormann
|
7c8d375bcb
|
added get_parameter_list function
|
2019-12-05 08:50:11 +01:00 |
|
Noah Dormann
|
ff7c4d24f4
|
deleted types in train_actor td3
|
2019-12-05 08:46:26 +01:00 |
|
Noah Dormann
|
4c0f6cbe53
|
update get_opt_parameters to remove duplicate code
|
2019-12-05 08:43:12 +01:00 |
|
Noah Dormann
|
c3b0398d56
|
Changed load so it still works when env not saved
improved save function
|
2019-12-05 08:40:28 +01:00 |
|
Dormann, Noah
|
bea2799691
|
comment refactoring
Co-Authored-By: Raffin, Antonin <Antonin.Raffin@dlr.de>
|
2019-12-05 08:10:39 +01:00 |
|
Dormann, Noah
|
a756f40223
|
comment refactoring
Co-Authored-By: Raffin, Antonin <Antonin.Raffin@dlr.de>
|
2019-12-05 08:10:25 +01:00 |
|
Dormann, Noah
|
fdb544e775
|
comment refactoring
Co-Authored-By: Raffin, Antonin <Antonin.Raffin@dlr.de>
|
2019-12-05 08:10:14 +01:00 |
|
Dormann, Noah
|
aa66d2f82e
|
comment refactoring
Co-Authored-By: Raffin, Antonin <Antonin.Raffin@dlr.de>
|
2019-12-05 08:09:51 +01:00 |
|
Dormann, Noah
|
85d3743224
|
added standart exclude parameters
Co-Authored-By: Raffin, Antonin <Antonin.Raffin@dlr.de>
|
2019-12-05 08:09:26 +01:00 |
|
Dormann, Noah
|
03bf513e5e
|
comment refactoring
Co-Authored-By: Raffin, Antonin <Antonin.Raffin@dlr.de>
|
2019-12-05 08:08:20 +01:00 |
|
Antonin Raffin
|
1ac1a7cad5
|
Reformat
|
2019-12-02 14:27:38 +01:00 |
|
Antonin Raffin
|
3cdd5f20af
|
Bug fix + add test for sde net arch
|
2019-12-02 14:14:48 +01:00 |
|
Antonin Raffin
|
8e9802784c
|
Revert previous changes in SAC + SDE
|
2019-12-02 14:06:30 +01:00 |
|
Antonin Raffin
|
4e39a0627c
|
Refactor: enable sde net arch for TD3 and SAC
|
2019-12-02 14:06:17 +01:00 |
|
Antonin Raffin
|
a2a8bbdf11
|
Sample n matrices for A2C/PPO when using SDE
|
2019-12-02 11:48:34 +01:00 |
|
Antonin Raffin
|
7a6a500398
|
Allow to use states directly as features for sde
|
2019-12-02 11:48:16 +01:00 |
|
Antonin RAFFIN
|
879191b26a
|
Bug fix in SAC with constant ent coeff + try batch sde matrices
|
2019-12-01 13:11:13 +01:00 |
|
Noah Dormann
|
6928879f5a
|
Refactored doc-strings
|
2019-11-28 16:30:13 +01:00 |
|
Noah Dormann
|
7ce610fade
|
Deleted exact match parameter of load_parameters
|
2019-11-28 16:11:35 +01:00 |
|
Noah Dormann
|
ee6f938ddc
|
Added option to explicitly specify excluded parameters
|
2019-11-28 15:42:53 +01:00 |
|
Noah Dormann
|
e95858784a
|
Formatted all files
|
2019-11-28 15:38:04 +01:00 |
|
Noah Dormann
|
751ccf85e7
|
_setup_model() is now called when model is loaded
|
2019-11-28 15:33:52 +01:00 |
|
Noah Dormann
|
9ff59eaf3d
|
Added attribute self.policy_class to prevent errors when using self.policy as class
|
2019-11-28 15:25:01 +01:00 |
|
Noah Dormann
|
e26564e0ec
|
Added function for setting up any attributes that weren't saved and thus not loaded
|
2019-11-28 13:35:16 +01:00 |
|
Noah Dormann
|
c75582dfbe
|
resolving conflicts
# Conflicts:
# torchy_baselines/a2c/a2c.py
# torchy_baselines/ppo/ppo.py
Added optimizer params test
|
2019-11-28 12:12:06 +01:00 |
|
Noah Dormann
|
812cab84ac
|
Changed PPO deterministic
|
2019-11-28 11:20:40 +01:00 |
|
Noah Dormann
|
b75ffe166a
|
Cleared base_class.load description
|
2019-11-28 10:36:21 +01:00 |
|
Antonin Raffin
|
fbe29a7298
|
Track down autograd error
"Trying to backward through the graph a second time" -> added a comment
|
2019-11-27 17:29:47 +01:00 |
|