Commit graph

203 commits

Author SHA1 Message Date
Antonin Raffin
fb57a6b80c Update docstring 2020-01-23 11:20:12 +01:00
Antonin Raffin
7265d9e352 Fix multiline f-string 2020-01-23 10:56:53 +01:00
Antonin Raffin
ff0eddfb17 Partially type base class 2020-01-22 17:51:27 +01:00
Antonin Raffin
44fce7c02a Fix typing errors and typos 2020-01-22 17:17:12 +01:00
Antonin Raffin
88f07bafb6 Convert format to f-strings 2020-01-22 16:39:25 +01:00
Antonin Raffin
37f9f13684 Revert all changes for python 2
+ Add makefile and pytype
2020-01-22 16:18:27 +01:00
Antonin Raffin
9e250b6818 Build doc 2020-01-20 16:19:35 +01:00
Antonin Raffin
b8df12afe2 Release v0.1.0 2020-01-20 13:01:14 +01:00
Antonin Raffin
0bed698ec5 Raise error for abstract methods 2020-01-20 12:57:40 +01:00
Antonin Raffin
e5c6601726 Update VecNormalize (pickling) and improve tests 2020-01-20 11:58:16 +01:00
Antonin Raffin
89db65b1fb Improve logger testing + add readers 2020-01-20 11:58:00 +01:00
Antonin Raffin
c542009641 Clean up code + bug fixes 2020-01-20 11:17:55 +01:00
Antonin Raffin
ea20721632 Add TODO 2020-01-15 15:58:45 +01:00
Antonin Raffin
03e853997a Add squash_output and expln as policy param for ppo and a2c 2020-01-15 13:21:20 +01:00
Antonin Raffin
60d5f4463d Add use_expln option for td3 2020-01-08 17:04:28 +01:00
Antonin Raffin
d3a718b94e Add extra dependency 2020-01-08 11:26:57 +01:00
Antonin Raffin
299ca007b5 Add comment about warmup phase 2020-01-07 17:36:26 +01:00
Antonin Raffin
8831eff163 Unify evaluation 2020-01-07 14:00:03 +01:00
Antonin RAFFIN
aa7b91333e Add seeding for subproc vecenv 2019-12-30 12:01:37 +01:00
Antonin RAFFIN
4a79f7e5a7 Print std reward for evaluation 2019-12-24 13:12:04 +01:00
Antonin RAFFIN
57c890f3e9 LeakyClip not working yet 2019-12-22 14:38:30 +01:00
Antonin RAFFIN
3a7508ac16 Fix double clip 2019-12-22 13:56:30 +01:00
Antonin Raffin
f6c475a44b Add use_expln as a policy argument 2019-12-20 18:10:24 +01:00
Antonin Raffin
7f34108ed6 Fix exp_ln computation 2019-12-20 18:02:01 +01:00
Antonin Raffin
9b3b34c9c4 Sample batch_size noise matrices for SAC 2019-12-20 11:28:44 +01:00
Antonin Raffin
161c608f9c Re-sample noise matrix for PPO 2019-12-20 11:28:20 +01:00
Antonin Raffin
e894f1f11b Add leakyclip 2019-12-19 18:20:02 +01:00
Antonin Raffin
69428346bd Bump version 2019-12-19 15:32:03 +01:00
Antonin Raffin
84ebc3d7da Relax the HardTanh limit 2019-12-19 15:28:51 +01:00
Antonin Raffin
bff0ca0ea8 Use HardTanh to relax the constrain 2019-12-19 11:59:00 +01:00
Antonin Raffin
c05c990285 Remove norm clipping 2019-12-18 16:56:51 +01:00
Antonin Raffin
e49d97bf98 Fix infs in SAC by bounding the mean 2019-12-18 13:45:33 +01:00
Antonin Raffin
57708a628c Add value function for SDE + TD3 2019-12-17 15:01:08 +01:00
Antonin Raffin
1d6f9bf100 Add sample freq for SDE 2019-12-17 11:47:21 +01:00
Antonin Raffin
919dfee452 Try to clip grad norm 2019-12-17 11:14:44 +01:00
Antonin Raffin
d63cef7693 Add gradient clipping for SAC 2019-12-06 18:32:57 +01:00
Antonin Raffin
233f346d53 Update todos 2019-12-06 17:46:56 +01:00
Antonin Raffin
6c423add8d Bump version 2019-12-05 16:44:27 +01:00
Antonin Raffin
0117cc37f4 Merge branch 'master' into feat/sde-features 2019-12-05 16:33:41 +01:00
Noah Dormann
aa67147796 clarified bytesIO use for load 2019-12-05 15:45:05 +01:00
Raffin, Antonin
bac9d4efed Update torchy_baselines/common/base_class.py 2019-12-05 14:53:14 +01:00
Raffin, Antonin
695cdc63a4 Update torchy_baselines/common/base_class.py 2019-12-05 14:52:59 +01:00
Raffin, Antonin
424a554567 Update docstring 2019-12-05 14:50:11 +01:00
Raffin, Antonin
464dd773e6 Update comment 2019-12-05 14:46:02 +01:00
Raffin, Antonin
03ecb17ef6 Update error message 2019-12-05 14:41:39 +01:00
Noah Dormann
88d4f44d55 added set_env test and set_env wrapping 2019-12-05 13:59:07 +01:00
Noah Dormann
cf1d7118a5 replaced file with file_path 2019-12-05 13:44:02 +01:00
Noah Dormann
8062ed6036 fixed load, to check if environment ist correctly 2019-12-05 13:36:19 +01:00
Noah Dormann
4b1bab7f85 implemented set_env method 2019-12-05 09:11:30 +01:00
Noah Dormann
8460bfe397 added some comments to _load_from_file 2019-12-05 08:56:04 +01:00