Commit graph

86 commits

Author SHA1 Message Date
Noah Dormann
4f8f936451 Don't save replay_buffer by default 2019-11-21 17:27:46 +01:00
Noah Dormann
cfb822aa91 Corrected test_run.py 2019-11-21 16:54:30 +01:00
Noah Dormann
924ba9aea6 cleaned comments on model specific get and load functions 2019-11-21 16:50:59 +01:00
Noah Dormann
03a0d437ef refactor 2019-11-21 16:49:49 +01:00
Noah Dormann
2d72f6d1b5 Added SAC, TD3, A2C
Missing CEMRL
2019-11-21 16:46:53 +01:00
Noah Dormann
775a50cc5c saving all variables now added a2c support 2019-11-21 16:24:18 +01:00
Noah Dormann
526c37bf1f refactored the assets in test_save_load
fixed base_class 'params.pth'
2019-11-21 15:44:57 +01:00
Noah Dormann
26f31fd25b corrected comment sections 2019-11-21 14:55:56 +01:00
Noah Dormann
d31a399140 undo changes to conf.py 2019-11-21 14:52:29 +01:00
Noah Dormann
17f84053b3 save implementation for a2c needed before uncommenting save and load test in test_run.py::test_onpolicy 2019-11-21 14:44:02 +01:00
Noah Dormann
fb5f192fc4 Implemented Changes suggested from Antonin-Raffin
Added Optimizer saving
2019-11-21 14:39:44 +01:00
Noah Dormann
a7655ca6e1 Reformated every file with PEP 8 errors 2019-11-21 13:01:03 +01:00
Noah Dormann
b20b70db48 Clean reformat 2019-11-21 11:51:47 +01:00
Noah Dormann
5bca52a87d rearranged imports 2019-11-21 11:44:37 +01:00
Noah Dormann
4b6234a1c8 finished test_save_load.py test 2019-11-21 11:39:47 +01:00
Noah Dormann
6cf80ccfe2 reordered imports 2019-11-12 17:12:10 +01:00
Noah Dormann
cc744a48b5 first save and load features 2019-11-12 17:03:57 +01:00
Raffin, Antonin
701daa8cb8 Merge pull request #1 from Antonin-Raffin/feat/a2c
Add A2C algorithm
2019-10-28 18:18:55 +01:00
Antonin Raffin
df1e7aa000 Add docstring 2019-10-28 17:42:39 +01:00
Antonin Raffin
d67822718c Add learning rate schedule 2019-10-28 16:47:13 +01:00
Antonin Raffin
799e30ff3d Bug fixes for A2C and PPO 2019-10-28 14:27:32 +01:00
Antonin Raffin
b150167bdd Update default hyperparams 2019-10-25 13:01:00 +02:00
Antonin Raffin
584f549fa1 Bug fix for discrete actions 2019-10-25 12:00:37 +02:00
Antonin Raffin
f8bcb8ee16 Update A2C params 2019-10-25 11:31:20 +02:00
Antonin Raffin
0ad743c85d Add A2C 2019-10-25 10:59:15 +02:00
Antonin RAFFIN
3bc746c6ee Add logger for PPO 2019-10-17 13:44:48 +02:00
Antonin RAFFIN
53898f3d1a Add flexible mlp 2019-10-17 13:32:25 +02:00
Antonin Raffin
64de9923d6 Buf fixes for python 2 2019-10-15 13:24:53 +02:00
Antonin Raffin
ab64ff464e Add tensorboard_log dummy arg 2019-10-14 11:09:22 +02:00
Antonin Raffin
b5656531d1 Enable logger for SAC/TD3 + refactor 2019-10-10 13:47:13 +02:00
Antonin Raffin
dbaa5daca6 Add logger and Monitor wrapper 2019-10-10 13:41:54 +02:00
Antonin Raffin
ef50bb81e8 Add support for categorical distribution 2019-10-08 13:06:38 +02:00
Antonin Raffin
4d0c033bf2 Bug fix when randomly sampling actions 2019-10-07 16:36:48 +02:00
Antonin Raffin
37ab9d10f1 Rescale actions and add action noise 2019-10-07 16:26:03 +02:00
Antonin RAFFIN
12f854e1aa Fix learning starts 2019-10-01 21:56:37 +02:00
Antonin Raffin
440166fe26 Add a parameter to disable ortho init 2019-09-26 16:29:47 +02:00
Antonin Raffin
b4dc9d4e4d Add doc 2019-09-26 11:46:40 +02:00
Antonin Raffin
70e5de1d1b Update SAC defaults 2019-09-25 17:07:54 +02:00
Antonin Raffin
0e4fc9c0ac Bug fix SAC 2019-09-25 13:30:08 +02:00
Antonin Raffin
322399e8fe Update collect rollout 2019-09-25 13:20:06 +02:00
Antonin Raffin
6bfbb7198a Rename seed 2019-09-24 16:59:47 +02:00
Antonin Raffin
32648d9029 Add docstrings 2019-09-24 15:30:58 +02:00
Antonin Raffin
f4fe1362f0 Renaming 2019-09-24 14:53:03 +02:00
Antonin Raffin
d22caac616 Working SAC 2019-09-24 14:15:12 +02:00
Antonin RAFFIN
98e9560913 Remove note 2019-09-22 21:33:35 +02:00
Antonin RAFFIN
1bd2e42995 Add a note for squashed gaussian 2019-09-22 14:15:22 +02:00
Antonin RAFFIN
b157b4465a Add logo 2019-09-22 13:57:18 +02:00
Antonin RAFFIN
7627a8644c Add roadmap 2019-09-22 13:43:01 +02:00
Antonin RAFFIN
8adb8f9931 Change default dist to gaussian 2019-09-22 12:56:27 +02:00
Antonin RAFFIN
ddaafcbc36 Refactor: add distributions 2019-09-22 12:52:49 +02:00