Antonin Raffin
a2b1bf06d3
Add squash_output attribute to policy
2020-02-14 11:12:07 +01:00
Antonin Raffin
aa8b4eb22a
Reformat and type the distributions
2020-02-13 13:46:22 +01:00
Antonin Raffin
f1a4fa2d3f
Improve predict method
2020-02-12 15:25:05 +01:00
Antonin Raffin
7bafdb3a67
Add get_vec_normalize_env()
2020-02-12 11:34:29 +01:00
Antonin Raffin
75a86881b3
Add save/load for replay buffer
2020-02-05 13:10:02 +01:00
Antonin Raffin
31a862c3a9
Log success rate
2020-02-04 13:24:09 +01:00
Antonin Raffin
8acac6b0f4
Update docstring
2020-02-03 18:31:13 +01:00
Antonin Raffin
16121cf2b8
Create OffPolicyRLModel
2020-02-03 18:18:41 +01:00
Antonin Raffin
6710f1576c
Fix eval log path
2020-01-31 13:48:25 +01:00
Antonin Raffin
5d4e73544c
Fix reset_num_timesteps
2020-01-31 13:16:28 +01:00
Antonin Raffin
6d59bfd4a0
Merge branch 'master' into feat/callbacks
2020-01-31 13:09:55 +01:00
Dormann, Noah
1f0dd60b97
Fix saving on GPU - Loading on CPU ( #45 )
...
* removed policy from save, changed th.loads to map to device
* found hack: catch pickle exception and trying th.load with mapping instead, otherwise raise exception with more information -> loading cuda on cpu raises exception -> leads to th.load with map being called
* deleted todo
* updated changelog
* start of saving refactor
* first working c
* all tests pass, save refactored
* - backwards compatibilty not always
- make pytest all passing
- make typing all passing
* Fixes and simplify the save method
* Remove unused param
* Fix backward compat
* Fix docstring
2020-01-31 13:06:55 +01:00
Antonin Raffin
6ae8421615
Update docstring
2020-01-28 10:28:44 +01:00
Antonin Raffin
5c94a225ef
Minor edit to the doc
2020-01-28 10:24:02 +01:00
Antonin Raffin
a628354721
Refactor evaluation
2020-01-27 15:53:27 +01:00
Antonin Raffin
b66003cfb3
Add callback support
2020-01-27 14:32:31 +01:00
Antonin Raffin
7265d9e352
Fix multiline f-string
2020-01-23 10:56:53 +01:00
Antonin Raffin
ff0eddfb17
Partially type base class
2020-01-22 17:51:27 +01:00
Antonin Raffin
44fce7c02a
Fix typing errors and typos
2020-01-22 17:17:12 +01:00
Antonin Raffin
88f07bafb6
Convert format to f-strings
2020-01-22 16:39:25 +01:00
Antonin Raffin
37f9f13684
Revert all changes for python 2
...
+ Add makefile and pytype
2020-01-22 16:18:27 +01:00
Antonin Raffin
9e250b6818
Build doc
2020-01-20 16:19:35 +01:00
Antonin Raffin
0bed698ec5
Raise error for abstract methods
2020-01-20 12:57:40 +01:00
Antonin Raffin
c542009641
Clean up code + bug fixes
2020-01-20 11:17:55 +01:00
Antonin Raffin
ea20721632
Add TODO
2020-01-15 15:58:45 +01:00
Antonin Raffin
299ca007b5
Add comment about warmup phase
2020-01-07 17:36:26 +01:00
Antonin Raffin
8831eff163
Unify evaluation
2020-01-07 14:00:03 +01:00
Antonin Raffin
57708a628c
Add value function for SDE + TD3
2019-12-17 15:01:08 +01:00
Antonin Raffin
1d6f9bf100
Add sample freq for SDE
2019-12-17 11:47:21 +01:00
Antonin Raffin
0117cc37f4
Merge branch 'master' into feat/sde-features
2019-12-05 16:33:41 +01:00
Noah Dormann
aa67147796
clarified bytesIO use for load
2019-12-05 15:45:05 +01:00
Raffin, Antonin
bac9d4efed
Update torchy_baselines/common/base_class.py
2019-12-05 14:53:14 +01:00
Raffin, Antonin
695cdc63a4
Update torchy_baselines/common/base_class.py
2019-12-05 14:52:59 +01:00
Raffin, Antonin
424a554567
Update docstring
2019-12-05 14:50:11 +01:00
Raffin, Antonin
464dd773e6
Update comment
2019-12-05 14:46:02 +01:00
Raffin, Antonin
03ecb17ef6
Update error message
2019-12-05 14:41:39 +01:00
Noah Dormann
88d4f44d55
added set_env test and set_env wrapping
2019-12-05 13:59:07 +01:00
Noah Dormann
cf1d7118a5
replaced file with file_path
2019-12-05 13:44:02 +01:00
Noah Dormann
8062ed6036
fixed load, to check if environment ist correctly
2019-12-05 13:36:19 +01:00
Noah Dormann
4b1bab7f85
implemented set_env method
2019-12-05 09:11:30 +01:00
Noah Dormann
8460bfe397
added some comments to _load_from_file
2019-12-05 08:56:04 +01:00
Noah Dormann
6560ae9952
using other_file instead of other_files
2019-12-05 08:52:26 +01:00
Noah Dormann
7c8d375bcb
added get_parameter_list function
2019-12-05 08:50:11 +01:00
Noah Dormann
c3b0398d56
Changed load so it still works when env not saved
...
improved save function
2019-12-05 08:40:28 +01:00
Dormann, Noah
bea2799691
comment refactoring
...
Co-Authored-By: Raffin, Antonin <Antonin.Raffin@dlr.de>
2019-12-05 08:10:39 +01:00
Dormann, Noah
a756f40223
comment refactoring
...
Co-Authored-By: Raffin, Antonin <Antonin.Raffin@dlr.de>
2019-12-05 08:10:25 +01:00
Dormann, Noah
fdb544e775
comment refactoring
...
Co-Authored-By: Raffin, Antonin <Antonin.Raffin@dlr.de>
2019-12-05 08:10:14 +01:00
Dormann, Noah
aa66d2f82e
comment refactoring
...
Co-Authored-By: Raffin, Antonin <Antonin.Raffin@dlr.de>
2019-12-05 08:09:51 +01:00
Dormann, Noah
85d3743224
added standart exclude parameters
...
Co-Authored-By: Raffin, Antonin <Antonin.Raffin@dlr.de>
2019-12-05 08:09:26 +01:00
Noah Dormann
6928879f5a
Refactored doc-strings
2019-11-28 16:30:13 +01:00