Commit graph

197 commits

Author SHA1 Message Date
Antonin Raffin
a5c3418765 Update README (roadmap moved to github) 2019-12-19 15:28:36 +01:00
Antonin Raffin
bff0ca0ea8 Use HardTanh to relax the constrain 2019-12-19 11:59:00 +01:00
Antonin Raffin
c05c990285 Remove norm clipping 2019-12-18 16:56:51 +01:00
Antonin Raffin
07345e5e27 Test for differential entropy 2019-12-18 13:45:56 +01:00
Antonin Raffin
e49d97bf98 Fix infs in SAC by bounding the mean 2019-12-18 13:45:33 +01:00
Antonin Raffin
57708a628c Add value function for SDE + TD3 2019-12-17 15:01:08 +01:00
Antonin Raffin
1d6f9bf100 Add sample freq for SDE 2019-12-17 11:47:21 +01:00
Antonin Raffin
4957f05810 Merge branch 'master' into feat/sde-features 2019-12-17 11:15:22 +01:00
Antonin Raffin
919dfee452 Try to clip grad norm 2019-12-17 11:14:44 +01:00
Raffin, Antonin
8874b9dd6b Merge pull request #5 from Antonin-Raffin/feat/td3-sde
Off-Policy State Dependent Exploration
2019-12-17 11:09:01 +01:00
Antonin Raffin
d63cef7693 Add gradient clipping for SAC 2019-12-06 18:32:57 +01:00
Antonin Raffin
233f346d53 Update todos 2019-12-06 17:46:56 +01:00
Antonin Raffin
6c423add8d Bump version 2019-12-05 16:44:27 +01:00
Antonin Raffin
1f2b047ab3 Merge branch 'master' into feat/td3-sde 2019-12-05 16:35:57 +01:00
Antonin Raffin
0117cc37f4 Merge branch 'master' into feat/sde-features 2019-12-05 16:33:41 +01:00
Raffin, Antonin
2690fa4fba Merge pull request #3 from Antonin-Raffin/safe_load_modules
Save and load methods
2019-12-05 16:24:39 +01:00
Noah Dormann
aa67147796 clarified bytesIO use for load 2019-12-05 15:45:05 +01:00
Raffin, Antonin
bac9d4efed Update torchy_baselines/common/base_class.py 2019-12-05 14:53:14 +01:00
Raffin, Antonin
695cdc63a4 Update torchy_baselines/common/base_class.py 2019-12-05 14:52:59 +01:00
Raffin, Antonin
424a554567 Update docstring 2019-12-05 14:50:11 +01:00
Raffin, Antonin
464dd773e6 Update comment 2019-12-05 14:46:02 +01:00
Raffin, Antonin
03ecb17ef6 Update error message 2019-12-05 14:41:39 +01:00
Noah Dormann
88d4f44d55 added set_env test and set_env wrapping 2019-12-05 13:59:07 +01:00
Noah Dormann
cf1d7118a5 replaced file with file_path 2019-12-05 13:44:02 +01:00
Noah Dormann
8062ed6036 fixed load, to check if environment ist correctly 2019-12-05 13:36:19 +01:00
Noah Dormann
4b1bab7f85 implemented set_env method 2019-12-05 09:11:30 +01:00
Noah Dormann
8460bfe397 added some comments to _load_from_file 2019-12-05 08:56:04 +01:00
Noah Dormann
6560ae9952 using other_file instead of other_files 2019-12-05 08:52:26 +01:00
Noah Dormann
7c8d375bcb added get_parameter_list function 2019-12-05 08:50:11 +01:00
Noah Dormann
ff7c4d24f4 deleted types in train_actor td3 2019-12-05 08:46:26 +01:00
Noah Dormann
4c0f6cbe53 update get_opt_parameters to remove duplicate code 2019-12-05 08:43:12 +01:00
Noah Dormann
c3b0398d56 Changed load so it still works when env not saved
improved save function
2019-12-05 08:40:28 +01:00
Dormann, Noah
bea2799691 comment refactoring
Co-Authored-By: Raffin, Antonin <Antonin.Raffin@dlr.de>
2019-12-05 08:10:39 +01:00
Dormann, Noah
a756f40223 comment refactoring
Co-Authored-By: Raffin, Antonin <Antonin.Raffin@dlr.de>
2019-12-05 08:10:25 +01:00
Dormann, Noah
fdb544e775 comment refactoring
Co-Authored-By: Raffin, Antonin <Antonin.Raffin@dlr.de>
2019-12-05 08:10:14 +01:00
Dormann, Noah
aa66d2f82e comment refactoring
Co-Authored-By: Raffin, Antonin <Antonin.Raffin@dlr.de>
2019-12-05 08:09:51 +01:00
Dormann, Noah
85d3743224 added standart exclude parameters
Co-Authored-By: Raffin, Antonin <Antonin.Raffin@dlr.de>
2019-12-05 08:09:26 +01:00
Dormann, Noah
03bf513e5e comment refactoring
Co-Authored-By: Raffin, Antonin <Antonin.Raffin@dlr.de>
2019-12-05 08:08:20 +01:00
Dormann, Noah
362bba73ba adapted common style
Co-Authored-By: Raffin, Antonin <Antonin.Raffin@dlr.de>
2019-12-05 08:07:43 +01:00
Antonin Raffin
1ac1a7cad5 Reformat 2019-12-02 14:27:38 +01:00
Antonin Raffin
3cdd5f20af Bug fix + add test for sde net arch 2019-12-02 14:14:48 +01:00
Antonin Raffin
8e9802784c Revert previous changes in SAC + SDE 2019-12-02 14:06:30 +01:00
Antonin Raffin
4e39a0627c Refactor: enable sde net arch for TD3 and SAC 2019-12-02 14:06:17 +01:00
Antonin Raffin
a2a8bbdf11 Sample n matrices for A2C/PPO when using SDE 2019-12-02 11:48:34 +01:00
Antonin Raffin
7a6a500398 Allow to use states directly as features for sde 2019-12-02 11:48:16 +01:00
Antonin Raffin
21e655ecbf Add test for SAC with different entropy temperature 2019-12-02 11:47:52 +01:00
Antonin RAFFIN
03a84f97ea Add monte-carlo test for SDE distribution 2019-12-01 16:46:39 +01:00
Antonin RAFFIN
879191b26a Bug fix in SAC with constant ent coeff + try batch sde matrices 2019-12-01 13:11:13 +01:00
Noah Dormann
6928879f5a Refactored doc-strings 2019-11-28 16:30:13 +01:00
Noah Dormann
7ce610fade Deleted exact match parameter of load_parameters 2019-11-28 16:11:35 +01:00