Antonin Raffin
|
a5c3418765
|
Update README (roadmap moved to github)
|
2019-12-19 15:28:36 +01:00 |
|
Antonin Raffin
|
bff0ca0ea8
|
Use HardTanh to relax the constrain
|
2019-12-19 11:59:00 +01:00 |
|
Antonin Raffin
|
c05c990285
|
Remove norm clipping
|
2019-12-18 16:56:51 +01:00 |
|
Antonin Raffin
|
07345e5e27
|
Test for differential entropy
|
2019-12-18 13:45:56 +01:00 |
|
Antonin Raffin
|
e49d97bf98
|
Fix infs in SAC by bounding the mean
|
2019-12-18 13:45:33 +01:00 |
|
Antonin Raffin
|
57708a628c
|
Add value function for SDE + TD3
|
2019-12-17 15:01:08 +01:00 |
|
Antonin Raffin
|
1d6f9bf100
|
Add sample freq for SDE
|
2019-12-17 11:47:21 +01:00 |
|
Antonin Raffin
|
4957f05810
|
Merge branch 'master' into feat/sde-features
|
2019-12-17 11:15:22 +01:00 |
|
Antonin Raffin
|
919dfee452
|
Try to clip grad norm
|
2019-12-17 11:14:44 +01:00 |
|
Raffin, Antonin
|
8874b9dd6b
|
Merge pull request #5 from Antonin-Raffin/feat/td3-sde
Off-Policy State Dependent Exploration
|
2019-12-17 11:09:01 +01:00 |
|
Antonin Raffin
|
d63cef7693
|
Add gradient clipping for SAC
|
2019-12-06 18:32:57 +01:00 |
|
Antonin Raffin
|
233f346d53
|
Update todos
|
2019-12-06 17:46:56 +01:00 |
|
Antonin Raffin
|
6c423add8d
|
Bump version
|
2019-12-05 16:44:27 +01:00 |
|
Antonin Raffin
|
1f2b047ab3
|
Merge branch 'master' into feat/td3-sde
|
2019-12-05 16:35:57 +01:00 |
|
Antonin Raffin
|
0117cc37f4
|
Merge branch 'master' into feat/sde-features
|
2019-12-05 16:33:41 +01:00 |
|
Raffin, Antonin
|
2690fa4fba
|
Merge pull request #3 from Antonin-Raffin/safe_load_modules
Save and load methods
|
2019-12-05 16:24:39 +01:00 |
|
Noah Dormann
|
aa67147796
|
clarified bytesIO use for load
|
2019-12-05 15:45:05 +01:00 |
|
Raffin, Antonin
|
bac9d4efed
|
Update torchy_baselines/common/base_class.py
|
2019-12-05 14:53:14 +01:00 |
|
Raffin, Antonin
|
695cdc63a4
|
Update torchy_baselines/common/base_class.py
|
2019-12-05 14:52:59 +01:00 |
|
Raffin, Antonin
|
424a554567
|
Update docstring
|
2019-12-05 14:50:11 +01:00 |
|
Raffin, Antonin
|
464dd773e6
|
Update comment
|
2019-12-05 14:46:02 +01:00 |
|
Raffin, Antonin
|
03ecb17ef6
|
Update error message
|
2019-12-05 14:41:39 +01:00 |
|
Noah Dormann
|
88d4f44d55
|
added set_env test and set_env wrapping
|
2019-12-05 13:59:07 +01:00 |
|
Noah Dormann
|
cf1d7118a5
|
replaced file with file_path
|
2019-12-05 13:44:02 +01:00 |
|
Noah Dormann
|
8062ed6036
|
fixed load, to check if environment ist correctly
|
2019-12-05 13:36:19 +01:00 |
|
Noah Dormann
|
4b1bab7f85
|
implemented set_env method
|
2019-12-05 09:11:30 +01:00 |
|
Noah Dormann
|
8460bfe397
|
added some comments to _load_from_file
|
2019-12-05 08:56:04 +01:00 |
|
Noah Dormann
|
6560ae9952
|
using other_file instead of other_files
|
2019-12-05 08:52:26 +01:00 |
|
Noah Dormann
|
7c8d375bcb
|
added get_parameter_list function
|
2019-12-05 08:50:11 +01:00 |
|
Noah Dormann
|
ff7c4d24f4
|
deleted types in train_actor td3
|
2019-12-05 08:46:26 +01:00 |
|
Noah Dormann
|
4c0f6cbe53
|
update get_opt_parameters to remove duplicate code
|
2019-12-05 08:43:12 +01:00 |
|
Noah Dormann
|
c3b0398d56
|
Changed load so it still works when env not saved
improved save function
|
2019-12-05 08:40:28 +01:00 |
|
Dormann, Noah
|
bea2799691
|
comment refactoring
Co-Authored-By: Raffin, Antonin <Antonin.Raffin@dlr.de>
|
2019-12-05 08:10:39 +01:00 |
|
Dormann, Noah
|
a756f40223
|
comment refactoring
Co-Authored-By: Raffin, Antonin <Antonin.Raffin@dlr.de>
|
2019-12-05 08:10:25 +01:00 |
|
Dormann, Noah
|
fdb544e775
|
comment refactoring
Co-Authored-By: Raffin, Antonin <Antonin.Raffin@dlr.de>
|
2019-12-05 08:10:14 +01:00 |
|
Dormann, Noah
|
aa66d2f82e
|
comment refactoring
Co-Authored-By: Raffin, Antonin <Antonin.Raffin@dlr.de>
|
2019-12-05 08:09:51 +01:00 |
|
Dormann, Noah
|
85d3743224
|
added standart exclude parameters
Co-Authored-By: Raffin, Antonin <Antonin.Raffin@dlr.de>
|
2019-12-05 08:09:26 +01:00 |
|
Dormann, Noah
|
03bf513e5e
|
comment refactoring
Co-Authored-By: Raffin, Antonin <Antonin.Raffin@dlr.de>
|
2019-12-05 08:08:20 +01:00 |
|
Dormann, Noah
|
362bba73ba
|
adapted common style
Co-Authored-By: Raffin, Antonin <Antonin.Raffin@dlr.de>
|
2019-12-05 08:07:43 +01:00 |
|
Antonin Raffin
|
1ac1a7cad5
|
Reformat
|
2019-12-02 14:27:38 +01:00 |
|
Antonin Raffin
|
3cdd5f20af
|
Bug fix + add test for sde net arch
|
2019-12-02 14:14:48 +01:00 |
|
Antonin Raffin
|
8e9802784c
|
Revert previous changes in SAC + SDE
|
2019-12-02 14:06:30 +01:00 |
|
Antonin Raffin
|
4e39a0627c
|
Refactor: enable sde net arch for TD3 and SAC
|
2019-12-02 14:06:17 +01:00 |
|
Antonin Raffin
|
a2a8bbdf11
|
Sample n matrices for A2C/PPO when using SDE
|
2019-12-02 11:48:34 +01:00 |
|
Antonin Raffin
|
7a6a500398
|
Allow to use states directly as features for sde
|
2019-12-02 11:48:16 +01:00 |
|
Antonin Raffin
|
21e655ecbf
|
Add test for SAC with different entropy temperature
|
2019-12-02 11:47:52 +01:00 |
|
Antonin RAFFIN
|
03a84f97ea
|
Add monte-carlo test for SDE distribution
|
2019-12-01 16:46:39 +01:00 |
|
Antonin RAFFIN
|
879191b26a
|
Bug fix in SAC with constant ent coeff + try batch sde matrices
|
2019-12-01 13:11:13 +01:00 |
|
Noah Dormann
|
6928879f5a
|
Refactored doc-strings
|
2019-11-28 16:30:13 +01:00 |
|
Noah Dormann
|
7ce610fade
|
Deleted exact match parameter of load_parameters
|
2019-11-28 16:11:35 +01:00 |
|