Commit graph

449 commits

Author SHA1 Message Date
Antonin Raffin
d850a35311 Update tests 2020-02-03 15:57:37 +01:00
Antonin Raffin
c2318149dd Update changelog and version 2020-02-03 15:50:40 +01:00
Antonin Raffin
546fbca20c Copy by default 2020-02-03 15:42:14 +01:00
Antonin Raffin
f7af08bea4 Refactor buffers to use numpy 2020-02-03 15:40:34 +01:00
Antonin Raffin
e3c5b1621e Sync Monitor with Stable Baselines 2020-02-03 15:07:35 +01:00
Antonin Raffin
f0dba88275 Fix statement that had no effects 2020-02-03 10:51:29 +01:00
Raffin, Antonin
0143518a99 Merge pull request #47 from Antonin-Raffin/feat/callbacks
Add callback support
2020-01-31 14:09:25 +01:00
Antonin Raffin
6710f1576c Fix eval log path 2020-01-31 13:48:25 +01:00
Antonin Raffin
ec657cc34e Fix tests and change log_path behavior for EvalCallback 2020-01-31 13:42:04 +01:00
Antonin Raffin
5d4e73544c Fix reset_num_timesteps 2020-01-31 13:16:28 +01:00
Antonin Raffin
6d59bfd4a0 Merge branch 'master' into feat/callbacks 2020-01-31 13:09:55 +01:00
Dormann, Noah
1f0dd60b97 Fix saving on GPU - Loading on CPU (#45)
* removed policy from save, changed th.loads to map to device

* found hack: catch pickle exception and trying th.load with mapping instead, otherwise raise exception with more information -> loading cuda on cpu raises exception -> leads to th.load with map being called

* deleted todo

* updated changelog

* start of saving refactor

* first working c

* all tests pass, save refactored

* - backwards compatibilty not always
- make pytest all passing
- make typing all passing

* Fixes and simplify the save method

* Remove unused param

* Fix backward compat

* Fix docstring
2020-01-31 13:06:55 +01:00
Antonin Raffin
6ae8421615 Update docstring 2020-01-28 10:28:44 +01:00
Antonin Raffin
5c94a225ef Minor edit to the doc 2020-01-28 10:24:02 +01:00
Antonin Raffin
98037352f5 Update changelog 2020-01-27 15:57:34 +01:00
Antonin Raffin
a628354721 Refactor evaluation 2020-01-27 15:53:27 +01:00
Antonin Raffin
d514cd9126 Add templates 2020-01-27 14:36:11 +01:00
Antonin Raffin
b66003cfb3 Add callback support 2020-01-27 14:32:31 +01:00
Raffin, Antonin
cc3b023533 Merge pull request #44 from Antonin-Raffin/typing
Add typing and update requirement to python 3.6
2020-01-27 11:57:48 +01:00
Antonin Raffin
fb57a6b80c Update docstring 2020-01-23 11:20:12 +01:00
Antonin Raffin
7265d9e352 Fix multiline f-string 2020-01-23 10:56:53 +01:00
Antonin Raffin
ff0eddfb17 Partially type base class 2020-01-22 17:51:27 +01:00
Antonin Raffin
0328a39d1b Update changelog 2020-01-22 17:25:08 +01:00
Antonin Raffin
9345b85cfc Update changelog and README 2020-01-22 17:23:42 +01:00
Antonin Raffin
44fce7c02a Fix typing errors and typos 2020-01-22 17:17:12 +01:00
Antonin Raffin
88f07bafb6 Convert format to f-strings 2020-01-22 16:39:25 +01:00
Antonin Raffin
37f9f13684 Revert all changes for python 2
+ Add makefile and pytype
2020-01-22 16:18:27 +01:00
Raffin, Antonin
8152b34aaa Merge pull request #41 from Antonin-Raffin/docs/build
Build documentation
2020-01-20 16:21:10 +01:00
Antonin Raffin
9e250b6818 Build doc 2020-01-20 16:19:35 +01:00
Antonin Raffin
b8df12afe2 Release v0.1.0 2020-01-20 13:01:14 +01:00
Raffin, Antonin
358b27e9c9 Merge pull request #6 from Antonin-Raffin/feat/sde-features
Feature Extract for SDE
2020-01-20 13:00:18 +01:00
Antonin Raffin
0bed698ec5 Raise error for abstract methods 2020-01-20 12:57:40 +01:00
Antonin Raffin
e5c6601726 Update VecNormalize (pickling) and improve tests 2020-01-20 11:58:16 +01:00
Antonin Raffin
89db65b1fb Improve logger testing + add readers 2020-01-20 11:58:00 +01:00
Antonin Raffin
c542009641 Clean up code + bug fixes 2020-01-20 11:17:55 +01:00
Antonin Raffin
ea20721632 Add TODO 2020-01-15 15:58:45 +01:00
Antonin Raffin
03e853997a Add squash_output and expln as policy param for ppo and a2c 2020-01-15 13:21:20 +01:00
Antonin Raffin
60d5f4463d Add use_expln option for td3 2020-01-08 17:04:28 +01:00
Antonin Raffin
d3a718b94e Add extra dependency 2020-01-08 11:26:57 +01:00
Antonin Raffin
299ca007b5 Add comment about warmup phase 2020-01-07 17:36:26 +01:00
Antonin Raffin
8831eff163 Unify evaluation 2020-01-07 14:00:03 +01:00
Antonin RAFFIN
aa7b91333e Add seeding for subproc vecenv 2019-12-30 12:01:37 +01:00
Antonin RAFFIN
4a79f7e5a7 Print std reward for evaluation 2019-12-24 13:12:04 +01:00
Antonin RAFFIN
57c890f3e9 LeakyClip not working yet 2019-12-22 14:38:30 +01:00
Antonin RAFFIN
3a7508ac16 Fix double clip 2019-12-22 13:56:30 +01:00
Antonin Raffin
f6c475a44b Add use_expln as a policy argument 2019-12-20 18:10:24 +01:00
Antonin Raffin
7f34108ed6 Fix exp_ln computation 2019-12-20 18:02:01 +01:00
Antonin Raffin
9b3b34c9c4 Sample batch_size noise matrices for SAC 2019-12-20 11:28:44 +01:00
Antonin Raffin
161c608f9c Re-sample noise matrix for PPO 2019-12-20 11:28:20 +01:00
Antonin Raffin
e894f1f11b Add leakyclip 2019-12-19 18:20:02 +01:00