Quentin Gallouédec
f3abda5cbc
Fix Self return type ( #1167 )
...
* Fix Self annotation
* Update changelog
* Define type var on top
* ClassSelf to SelfClass
* annotate self
* Revert Running meanstd change
* Revert vecnormalize change (static method rejected)
Co-authored-by: Antonin RAFFIN <antonin.raffin@ensta.org>
2022-11-22 13:42:39 +01:00
Quentin Gallouédec
c4f54fcf04
Handling multi-dimensional action spaces ( #971 )
...
* Handle non 1D action shape
* Revert changes of observation (out of the scope of this PR)
* Apply changes to DictReplayBuffer
* Update tests
* Rollout buffer n-D actions space handling
* Remove error when non 1D action space
* ActorCriticPolicy return action with the proper shape
* remove useless reshape
* Update changelog
* Add tests
Co-authored-by: Antonin RAFFIN <antonin.raffin@ensta.org>
2022-08-06 14:19:20 +02:00
Quentin Gallouédec
fda3d4d748
Fix returned type in predict ( #964 )
...
* `arr[0]` to `arr.squeeze(0)`
* `squeeze(axis=0)` to `squeeze(0)`
* Type testing
* Add type test for unvectorized observation
* `squeeze(0)` to `squeeze(axis=0)`
* Treatment of the laziness symptoms
* Update changelog
* Udate changelog
Co-authored-by: Antonin RAFFIN <antonin.raffin@ensta.org>
2022-07-18 11:22:19 +02:00
Antonin RAFFIN
c1f1c3d3d7
Release v1.6.0 ( #958 )
...
* Release v1.6.0 + update doc + add copy button
* Update read the doc conda env
* Update year
* Fix bug in kl divergence check
* Rephrase requirement for envpool and isaac gym
2022-07-12 22:50:23 +02:00
Antonin RAFFIN
a6f5049a99
Upgrade code to Python 3.7+ syntax using pyupgrade ( #887 )
...
* Upgrade code to Python 3.7+ syntax
* Update changelog
2022-04-25 13:01:38 +03:00
Adam Gleave
f488d0772a
Autoformat code with black (new version complains about new things) ( #757 )
...
* Blacken code
* Fix GitLab CI: switch to Docker container with new black version
2022-02-04 02:56:06 +02:00
Rohan Tangri
df6f9de8f4
KL Divergence Helper Function ( #431 )
...
* add kl divergence wrapper
* add test
* update changelog
* black lint
* remove unused import
* Fix ent coef loading for SAC (#429 )
* Fix ent coef loading for SAC
* Better fix and add comment
* add 'distribution' to base Distribution class
* add sample test
* revert to plain pytorch implementation
* black reformat
* Update docs/misc/changelog.rst
Co-authored-by: Antonin RAFFIN <antonin.raffin@ensta.org>
* Doc update (custom policy + fix her example) (#436 )
* isort and black reformat
* float -> bool tensor
* add sanity test
* more concise kl code
* remove outdated comment
* all -> allclose assertion
Co-authored-by: Antonin RAFFIN <antonin.raffin@ensta.org>
* Fix PyTorch warning
* Update gSDE entropy test
* Update entropy test
Co-authored-by: Antonin RAFFIN <antonin.raffin@ensta.org>
Co-authored-by: Antonin Raffin <antonin.raffin@dlr.de>
2021-05-20 19:01:07 +02:00
Antonin RAFFIN
c62e9259db
Add custom objects support + bug fix ( #336 )
...
* Add support for custom objects
* Add python 3.8 to the CI
* Bump version
* PyType fixes
* [ci skip] Fix typo
* Add note about slow-down + fix typos
* Minor edits to the doc
* Bug fix for DQN
* Update test
* Add test for custom objects
2021-03-06 15:17:43 +02:00
Antonin RAFFIN
0fc0dd1b21
Fix off policy features extractor ( #198 )
...
* Faster tests
* Fix feature extractor bug + add check
* Add missing check
* Allow TD3 features extractor to be separate
* Add share features extractor option for SAC
* Bug fixes
* Apply suggestions from code review
Co-authored-by: Adam Gleave <adam@gleave.me>
Co-authored-by: Adam Gleave <adam@gleave.me>
2020-10-27 14:24:59 +01:00
Antonin RAFFIN
a1e055695c
Improve typing coverage ( #175 )
...
* Improve typing coverage
* Even more types
* Fixes
* Update changelog
* Unified docstrings
* Improve error messages for unsupported spaces
2020-10-07 10:51:49 +02:00
Antonin RAFFIN
55912576ed
Cleanup docstring types ( #169 )
...
* Cleanup docstring types
* Update style
* Test with js hack
* Revert "Test with js hack"
This reverts commit d091f438e8851ab8d01b66628e06a104f5e5ec69.
* Fix types
* Fix typo
* Update CONTRIBUTING example
2020-10-02 20:05:55 +03:00
Antonin RAFFIN
2c924f52f5
Update docs (custom policy, type hints) ( #167 )
...
* Change import
* Update custom policy doc
* Re-enable sphinx_autodoc_typehints
* Update docker image
* Attempt to fix read the doc build error
* Add sphinx_autodoc_typehints to read the doc env
* Fix pip version
* Add full custom policy example
* Fix
2020-09-29 20:41:14 +03:00
Antonin RAFFIN
23afedb254
Auto-formatting with black and isort ( #97 )
...
* Add auto formatting with black and isort
* Reformat code
* Ignore typing errors
* Add note about line length
* Add minimum version for isort
* Add commit-checks
* Update docker image
* Fixed lost import (during last merge)
* Fix opencv dependency
2020-07-16 16:12:16 +02:00
Adam Gleave
91bbc28c0f
Address minor issues after clarification by @araffin
2020-07-07 18:39:55 -07:00
Adam Gleave
cc7a58bc5f
Bugfix
2020-07-02 20:28:48 -07:00
Adam Gleave
7ba48dce48
Review distributions
2020-07-02 19:18:51 -07:00
Antonin RAFFIN
494ebfd20a
Hotfix PPO + gSDE ( #53 )
...
* Fix variable being passed with gradients
* Update changelog
* Bump version
* Fixes #54
2020-06-10 18:58:35 +02:00
Roland Gavrilescu
91adefdb4b
Support for MultiBinary / MultiDiscrete spaces ( #13 )
...
* multicategorical dist and test
* fixed List annotation
* bernoulli dist and test
* added distributions to preprocessing (needs testing)
* fixed and tested distributions
* added changelog and fixed ppo policy
* minor fix
* dist fixes, added test_spaces
* clean up
* modified changelog
* additional fixes
* minor changelog mod
* hot encoding fix, flake8 clean up
* lint tests
* preprocessing fix
* fixed bernoulli bug
* removed commented prints
* Update changelog.rst
* included suggested modifications
* linting fix
* increased space dim
* Update doc and tests
Co-authored-by: Antonin RAFFIN <antonin.raffin@ensta.org>
2020-05-18 14:42:13 +02:00
Antonin RAFFIN
15ff6d47ee
Documentation update and style fixes ( #21 )
...
* Update doc: add gSDE
* Fix codestyle
* Remove travis script
* Add lint check to gitlab
2020-05-15 13:54:06 +02:00
Antonin RAFFIN
a4a4afcc3d
Fix typos
2020-05-08 12:28:41 +02:00
Antonin RAFFIN
d542732c8d
Rename to stable-baselines3
2020-05-05 15:02:35 +02:00