stable-baselines3/stable_baselines3/ppo
Cody Wild b1aee71772
Improve error messages when PPO effective batch size is 1 and when last mini-batch is truncated (#270)
* Add warning about total_env_steps not dividing neatly into batch size

* Stylistic cleanup

* Black reformatting

* Add clearer documentation and update changelog

* Update changelog.rst

* Use specific RolloutBuffer terminology

Co-authored-by: Antonin RAFFIN <antonin.raffin@ensta.org>

* Change to minibatch language

Co-authored-by: Antonin RAFFIN <antonin.raffin@ensta.org>

* Cleaning up language describing rollout buffer requirements

Co-authored-by: Antonin RAFFIN <antonin.raffin@ensta.org>

* Switch to using env.num_envs

* Working tests

* Black and isort still fighting each other

* codestyle finally happy

* Basic test exists, possibly in the wrong file

* Update phrasing

Co-authored-by: Antonin RAFFIN <antonin.raffin@ensta.org>
2021-01-11 17:03:32 +01:00
..
__init__.py Auto-formatting with black and isort (#97) 2020-07-16 16:12:16 +02:00
policies.py Auto-formatting with black and isort (#97) 2020-07-16 16:12:16 +02:00
ppo.py Improve error messages when PPO effective batch size is 1 and when last mini-batch is truncated (#270) 2021-01-11 17:03:32 +01:00