Commit graph

174 commits

Author SHA1 Message Date
Antonin Raffin
bff0ca0ea8 Use HardTanh to relax the constrain 2019-12-19 11:59:00 +01:00
Antonin Raffin
c05c990285 Remove norm clipping 2019-12-18 16:56:51 +01:00
Antonin Raffin
e49d97bf98 Fix infs in SAC by bounding the mean 2019-12-18 13:45:33 +01:00
Antonin Raffin
57708a628c Add value function for SDE + TD3 2019-12-17 15:01:08 +01:00
Antonin Raffin
1d6f9bf100 Add sample freq for SDE 2019-12-17 11:47:21 +01:00
Antonin Raffin
919dfee452 Try to clip grad norm 2019-12-17 11:14:44 +01:00
Antonin Raffin
d63cef7693 Add gradient clipping for SAC 2019-12-06 18:32:57 +01:00
Antonin Raffin
233f346d53 Update todos 2019-12-06 17:46:56 +01:00
Antonin Raffin
6c423add8d Bump version 2019-12-05 16:44:27 +01:00
Antonin Raffin
0117cc37f4 Merge branch 'master' into feat/sde-features 2019-12-05 16:33:41 +01:00
Noah Dormann
aa67147796 clarified bytesIO use for load 2019-12-05 15:45:05 +01:00
Raffin, Antonin
bac9d4efed Update torchy_baselines/common/base_class.py 2019-12-05 14:53:14 +01:00
Raffin, Antonin
695cdc63a4 Update torchy_baselines/common/base_class.py 2019-12-05 14:52:59 +01:00
Raffin, Antonin
424a554567 Update docstring 2019-12-05 14:50:11 +01:00
Raffin, Antonin
464dd773e6 Update comment 2019-12-05 14:46:02 +01:00
Raffin, Antonin
03ecb17ef6 Update error message 2019-12-05 14:41:39 +01:00
Noah Dormann
88d4f44d55 added set_env test and set_env wrapping 2019-12-05 13:59:07 +01:00
Noah Dormann
cf1d7118a5 replaced file with file_path 2019-12-05 13:44:02 +01:00
Noah Dormann
8062ed6036 fixed load, to check if environment ist correctly 2019-12-05 13:36:19 +01:00
Noah Dormann
4b1bab7f85 implemented set_env method 2019-12-05 09:11:30 +01:00
Noah Dormann
8460bfe397 added some comments to _load_from_file 2019-12-05 08:56:04 +01:00
Noah Dormann
6560ae9952 using other_file instead of other_files 2019-12-05 08:52:26 +01:00
Noah Dormann
7c8d375bcb added get_parameter_list function 2019-12-05 08:50:11 +01:00
Noah Dormann
ff7c4d24f4 deleted types in train_actor td3 2019-12-05 08:46:26 +01:00
Noah Dormann
4c0f6cbe53 update get_opt_parameters to remove duplicate code 2019-12-05 08:43:12 +01:00
Noah Dormann
c3b0398d56 Changed load so it still works when env not saved
improved save function
2019-12-05 08:40:28 +01:00
Dormann, Noah
bea2799691 comment refactoring
Co-Authored-By: Raffin, Antonin <Antonin.Raffin@dlr.de>
2019-12-05 08:10:39 +01:00
Dormann, Noah
a756f40223 comment refactoring
Co-Authored-By: Raffin, Antonin <Antonin.Raffin@dlr.de>
2019-12-05 08:10:25 +01:00
Dormann, Noah
fdb544e775 comment refactoring
Co-Authored-By: Raffin, Antonin <Antonin.Raffin@dlr.de>
2019-12-05 08:10:14 +01:00
Dormann, Noah
aa66d2f82e comment refactoring
Co-Authored-By: Raffin, Antonin <Antonin.Raffin@dlr.de>
2019-12-05 08:09:51 +01:00
Dormann, Noah
85d3743224 added standart exclude parameters
Co-Authored-By: Raffin, Antonin <Antonin.Raffin@dlr.de>
2019-12-05 08:09:26 +01:00
Dormann, Noah
03bf513e5e comment refactoring
Co-Authored-By: Raffin, Antonin <Antonin.Raffin@dlr.de>
2019-12-05 08:08:20 +01:00
Antonin Raffin
1ac1a7cad5 Reformat 2019-12-02 14:27:38 +01:00
Antonin Raffin
3cdd5f20af Bug fix + add test for sde net arch 2019-12-02 14:14:48 +01:00
Antonin Raffin
8e9802784c Revert previous changes in SAC + SDE 2019-12-02 14:06:30 +01:00
Antonin Raffin
4e39a0627c Refactor: enable sde net arch for TD3 and SAC 2019-12-02 14:06:17 +01:00
Antonin Raffin
a2a8bbdf11 Sample n matrices for A2C/PPO when using SDE 2019-12-02 11:48:34 +01:00
Antonin Raffin
7a6a500398 Allow to use states directly as features for sde 2019-12-02 11:48:16 +01:00
Antonin RAFFIN
879191b26a Bug fix in SAC with constant ent coeff + try batch sde matrices 2019-12-01 13:11:13 +01:00
Noah Dormann
6928879f5a Refactored doc-strings 2019-11-28 16:30:13 +01:00
Noah Dormann
7ce610fade Deleted exact match parameter of load_parameters 2019-11-28 16:11:35 +01:00
Noah Dormann
ee6f938ddc Added option to explicitly specify excluded parameters 2019-11-28 15:42:53 +01:00
Noah Dormann
e95858784a Formatted all files 2019-11-28 15:38:04 +01:00
Noah Dormann
751ccf85e7 _setup_model() is now called when model is loaded 2019-11-28 15:33:52 +01:00
Noah Dormann
9ff59eaf3d Added attribute self.policy_class to prevent errors when using self.policy as class 2019-11-28 15:25:01 +01:00
Noah Dormann
e26564e0ec Added function for setting up any attributes that weren't saved and thus not loaded 2019-11-28 13:35:16 +01:00
Noah Dormann
c75582dfbe resolving conflicts
# Conflicts:
#	torchy_baselines/a2c/a2c.py
#	torchy_baselines/ppo/ppo.py

Added optimizer params test
2019-11-28 12:12:06 +01:00
Noah Dormann
812cab84ac Changed PPO deterministic 2019-11-28 11:20:40 +01:00
Noah Dormann
b75ffe166a Cleared base_class.load description 2019-11-28 10:36:21 +01:00
Antonin Raffin
fbe29a7298 Track down autograd error
"Trying to backward through the graph a second time" -> added a comment
2019-11-27 17:29:47 +01:00