Raffin, Antonin
cbb0843201
Merge pull request #51 from Antonin-Raffin/fix/entropy-squashed
...
Fix entropy loss for squashed Gaussian and VecEnv seeding
2020-02-11 17:46:56 +01:00
Antonin Raffin
240833ffef
Add type aliases for buffer samples
2020-02-11 17:33:22 +01:00
Antonin Raffin
2ce31c1e21
Fix entropy loss for squashed Gaussian and VecEnv seeding
2020-02-11 17:22:03 +01:00
Raffin, Antonin
02a080f647
Merge pull request #50 from Antonin-Raffin/refactor/off-policy
...
Add Off Policy base class
2020-02-11 16:48:34 +01:00
Antonin Raffin
2afcf395b9
Update tests
2020-02-11 16:42:25 +01:00
Antonin Raffin
b7dcc8d58e
Add extend method
2020-02-11 16:40:44 +01:00
Antonin Raffin
8eb82c86e3
Save last mean reward
2020-02-11 13:22:44 +01:00
Antonin Raffin
75a86881b3
Add save/load for replay buffer
2020-02-05 13:10:02 +01:00
Antonin Raffin
31a862c3a9
Log success rate
2020-02-04 13:24:09 +01:00
Antonin Raffin
8acac6b0f4
Update docstring
2020-02-03 18:31:13 +01:00
Antonin Raffin
16121cf2b8
Create OffPolicyRLModel
2020-02-03 18:18:41 +01:00
Raffin, Antonin
9d52a7d7d6
Merge pull request #49 from Antonin-Raffin/refactor/buffers
...
Refactor buffers
2020-02-03 16:03:24 +01:00
Antonin Raffin
d850a35311
Update tests
2020-02-03 15:57:37 +01:00
Antonin Raffin
c2318149dd
Update changelog and version
2020-02-03 15:50:40 +01:00
Antonin Raffin
546fbca20c
Copy by default
2020-02-03 15:42:14 +01:00
Antonin Raffin
f7af08bea4
Refactor buffers to use numpy
2020-02-03 15:40:34 +01:00
Antonin Raffin
e3c5b1621e
Sync Monitor with Stable Baselines
2020-02-03 15:07:35 +01:00
Antonin Raffin
f0dba88275
Fix statement that had no effects
2020-02-03 10:51:29 +01:00
Raffin, Antonin
0143518a99
Merge pull request #47 from Antonin-Raffin/feat/callbacks
...
Add callback support
2020-01-31 14:09:25 +01:00
Antonin Raffin
6710f1576c
Fix eval log path
2020-01-31 13:48:25 +01:00
Antonin Raffin
ec657cc34e
Fix tests and change log_path behavior for EvalCallback
2020-01-31 13:42:04 +01:00
Antonin Raffin
5d4e73544c
Fix reset_num_timesteps
2020-01-31 13:16:28 +01:00
Antonin Raffin
6d59bfd4a0
Merge branch 'master' into feat/callbacks
2020-01-31 13:09:55 +01:00
Dormann, Noah
1f0dd60b97
Fix saving on GPU - Loading on CPU ( #45 )
...
* removed policy from save, changed th.loads to map to device
* found hack: catch pickle exception and trying th.load with mapping instead, otherwise raise exception with more information -> loading cuda on cpu raises exception -> leads to th.load with map being called
* deleted todo
* updated changelog
* start of saving refactor
* first working c
* all tests pass, save refactored
* - backwards compatibilty not always
- make pytest all passing
- make typing all passing
* Fixes and simplify the save method
* Remove unused param
* Fix backward compat
* Fix docstring
2020-01-31 13:06:55 +01:00
Antonin Raffin
6ae8421615
Update docstring
2020-01-28 10:28:44 +01:00
Antonin Raffin
5c94a225ef
Minor edit to the doc
2020-01-28 10:24:02 +01:00
Antonin Raffin
98037352f5
Update changelog
2020-01-27 15:57:34 +01:00
Antonin Raffin
a628354721
Refactor evaluation
2020-01-27 15:53:27 +01:00
Antonin Raffin
d514cd9126
Add templates
2020-01-27 14:36:11 +01:00
Antonin Raffin
b66003cfb3
Add callback support
2020-01-27 14:32:31 +01:00
Raffin, Antonin
cc3b023533
Merge pull request #44 from Antonin-Raffin/typing
...
Add typing and update requirement to python 3.6
2020-01-27 11:57:48 +01:00
Antonin Raffin
fb57a6b80c
Update docstring
2020-01-23 11:20:12 +01:00
Antonin Raffin
7265d9e352
Fix multiline f-string
2020-01-23 10:56:53 +01:00
Antonin Raffin
ff0eddfb17
Partially type base class
2020-01-22 17:51:27 +01:00
Antonin Raffin
0328a39d1b
Update changelog
2020-01-22 17:25:08 +01:00
Antonin Raffin
9345b85cfc
Update changelog and README
2020-01-22 17:23:42 +01:00
Antonin Raffin
44fce7c02a
Fix typing errors and typos
2020-01-22 17:17:12 +01:00
Antonin Raffin
88f07bafb6
Convert format to f-strings
2020-01-22 16:39:25 +01:00
Antonin Raffin
37f9f13684
Revert all changes for python 2
...
+ Add makefile and pytype
2020-01-22 16:18:27 +01:00
Raffin, Antonin
8152b34aaa
Merge pull request #41 from Antonin-Raffin/docs/build
...
Build documentation
2020-01-20 16:21:10 +01:00
Antonin Raffin
9e250b6818
Build doc
2020-01-20 16:19:35 +01:00
Antonin Raffin
b8df12afe2
Release v0.1.0
2020-01-20 13:01:14 +01:00
Raffin, Antonin
358b27e9c9
Merge pull request #6 from Antonin-Raffin/feat/sde-features
...
Feature Extract for SDE
2020-01-20 13:00:18 +01:00
Antonin Raffin
0bed698ec5
Raise error for abstract methods
2020-01-20 12:57:40 +01:00
Antonin Raffin
e5c6601726
Update VecNormalize (pickling) and improve tests
2020-01-20 11:58:16 +01:00
Antonin Raffin
89db65b1fb
Improve logger testing + add readers
2020-01-20 11:58:00 +01:00
Antonin Raffin
c542009641
Clean up code + bug fixes
2020-01-20 11:17:55 +01:00
Antonin Raffin
ea20721632
Add TODO
2020-01-15 15:58:45 +01:00
Antonin Raffin
03e853997a
Add squash_output and expln as policy param for ppo and a2c
2020-01-15 13:21:20 +01:00
Antonin Raffin
60d5f4463d
Add use_expln option for td3
2020-01-08 17:04:28 +01:00