stable-baselines3/stable_baselines3/common
Quentin Gallouédec c4f54fcf04
Handling multi-dimensional action spaces (#971)
* Handle non 1D action shape

* Revert changes of observation (out of the scope of this PR)

* Apply changes  to DictReplayBuffer

* Update tests

* Rollout buffer n-D actions space handling

* Remove error when non 1D action space

* ActorCriticPolicy return action with the proper shape

* remove useless reshape

* Update changelog

* Add tests

Co-authored-by: Antonin RAFFIN <antonin.raffin@ensta.org>
2022-08-06 14:19:20 +02:00
..
envs Upgrade code to Python 3.7+ syntax using pyupgrade (#887) 2022-04-25 13:01:38 +03:00
sb2_compat Upgrade code to Python 3.7+ syntax using pyupgrade (#887) 2022-04-25 13:01:38 +03:00
vec_env Fix synchronization bug with EvalCallback (#907) 2022-05-08 21:54:34 +03:00
__init__.py Update docs (custom policy, type hints) (#167) 2020-09-29 20:41:14 +03:00
atari_wrappers.py Upgrade code to Python 3.7+ syntax using pyupgrade (#887) 2022-04-25 13:01:38 +03:00
base_class.py Use higher resolution time_ns() and avoid division by zero (#979) 2022-07-25 23:02:53 +02:00
buffers.py Handling multi-dimensional action spaces (#971) 2022-08-06 14:19:20 +02:00
callbacks.py Fix exception cause in base_class.py (#940) 2022-06-21 20:58:02 +01:00
distributions.py Handling multi-dimensional action spaces (#971) 2022-08-06 14:19:20 +02:00
env_checker.py Fix exception cause in base_class.py (#940) 2022-06-21 20:58:02 +01:00
env_util.py Added wrapper_kwargs argument to make_vec_env (#448) 2021-05-23 11:33:34 +02:00
evaluation.py Fix evaluation script for recurrent policies (#678) 2021-11-30 13:49:06 +01:00
logger.py Add doc to use mlflow logger (#889) 2022-05-08 15:28:31 +02:00
monitor.py Upgrade code to Python 3.7+ syntax using pyupgrade (#887) 2022-04-25 13:01:38 +03:00
noise.py Fix exception cause in base_class.py (#940) 2022-06-21 20:58:02 +01:00
off_policy_algorithm.py Use higher resolution time_ns() and avoid division by zero (#979) 2022-07-25 23:02:53 +02:00
on_policy_algorithm.py Use higher resolution time_ns() and avoid division by zero (#979) 2022-07-25 23:02:53 +02:00
policies.py Handling multi-dimensional action spaces (#971) 2022-08-06 14:19:20 +02:00
preprocessing.py Documentation update (#450) 2021-05-23 13:13:11 +02:00
results_plotter.py Fix default arguments + add bugbear (#363) 2021-03-25 11:35:21 +02:00
running_mean_std.py Upgrade code to Python 3.7+ syntax using pyupgrade (#887) 2022-04-25 13:01:38 +03:00
save_util.py Fix exception cause in base_class.py (#940) 2022-06-21 20:58:02 +01:00
torch_layers.py Replace "nature" with "Nature" (magazine) to reduce confusion (#965) 2022-07-15 22:48:27 +02:00
type_aliases.py Multiprocessing support for off policy algorithms (#439) 2021-12-01 22:30:09 +01:00
utils.py Escape tensorboard log name (#857) 2022-04-11 21:49:18 +02:00