Anssi
2cd6a4f93b
Match performance with stable-baselines (discrete case) ( #110 )
...
* Fix storing correct episode dones
* Fix number of filters in NatureCNN network
* Add TF-like RMSprop for matching performance with sb2
* Remove stuff that was accidentally included
* Reformat
* Clarify variable naming
* Update changelog
* Add comment on RMSprop implementations to A2C
* Add test for RMSpropTFLike
Co-authored-by: Antonin RAFFIN <antonin.raffin@ensta.org>
2020-08-03 22:22:51 +02:00
Tarik Kelestemur
b1322ff5d6
Fix cmd_util.py imports ( #24 )
...
* fix cmd_util.py imports
* Update changelog.rst
Co-authored-by: Antonin RAFFIN <antonin.raffin@ensta.org>
2020-05-19 10:19:16 +02:00
Roland Gavrilescu
91adefdb4b
Support for MultiBinary / MultiDiscrete spaces ( #13 )
...
* multicategorical dist and test
* fixed List annotation
* bernoulli dist and test
* added distributions to preprocessing (needs testing)
* fixed and tested distributions
* added changelog and fixed ppo policy
* minor fix
* dist fixes, added test_spaces
* clean up
* modified changelog
* additional fixes
* minor changelog mod
* hot encoding fix, flake8 clean up
* lint tests
* preprocessing fix
* fixed bernoulli bug
* removed commented prints
* Update changelog.rst
* included suggested modifications
* linting fix
* increased space dim
* Update doc and tests
Co-authored-by: Antonin RAFFIN <antonin.raffin@ensta.org>
2020-05-18 14:42:13 +02:00
Antonin RAFFIN
d17f29c8ad
Add base doc
2020-05-07 10:10:51 +02:00
Antonin RAFFIN
d542732c8d
Rename to stable-baselines3
2020-05-05 15:02:35 +02:00
Antonin Raffin
9e250b6818
Build doc
2020-01-20 16:19:35 +01:00