stable-baselines3

mirror of https://github.com/saymrwulf/stable-baselines3.git synced 2026-07-12 17:58:00 +00:00

History

Quentin Gallouédec c4f54fcf04 Handling multi-dimensional action spaces (#971 ) * Handle non 1D action shape * Revert changes of observation (out of the scope of this PR) * Apply changes to DictReplayBuffer * Update tests * Rollout buffer n-D actions space handling * Remove error when non 1D action space * ActorCriticPolicy return action with the proper shape * remove useless reshape * Update changelog * Add tests Co-authored-by: Antonin RAFFIN <antonin.raffin@ensta.org>		2022-08-06 14:19:20 +02:00
..
envs	Upgrade code to Python 3.7+ syntax using `pyupgrade` (#887 )	2022-04-25 13:01:38 +03:00
sb2_compat	Upgrade code to Python 3.7+ syntax using `pyupgrade` (#887 )	2022-04-25 13:01:38 +03:00
vec_env	Fix synchronization bug with EvalCallback (#907 )	2022-05-08 21:54:34 +03:00
__init__.py	Update docs (custom policy, type hints) (#167 )	2020-09-29 20:41:14 +03:00
atari_wrappers.py	Upgrade code to Python 3.7+ syntax using `pyupgrade` (#887 )	2022-04-25 13:01:38 +03:00
base_class.py	Use higher resolution time_ns() and avoid division by zero (#979 )	2022-07-25 23:02:53 +02:00
buffers.py	Handling multi-dimensional action spaces (#971 )	2022-08-06 14:19:20 +02:00
callbacks.py	Fix exception cause in base_class.py (#940 )	2022-06-21 20:58:02 +01:00
distributions.py	Handling multi-dimensional action spaces (#971 )	2022-08-06 14:19:20 +02:00
env_checker.py	Fix exception cause in base_class.py (#940 )	2022-06-21 20:58:02 +01:00
env_util.py	Added wrapper_kwargs argument to make_vec_env (#448 )	2021-05-23 11:33:34 +02:00
evaluation.py	Fix evaluation script for recurrent policies (#678 )	2021-11-30 13:49:06 +01:00
logger.py	Add doc to use mlflow logger (#889 )	2022-05-08 15:28:31 +02:00
monitor.py	Upgrade code to Python 3.7+ syntax using `pyupgrade` (#887 )	2022-04-25 13:01:38 +03:00
noise.py	Fix exception cause in base_class.py (#940 )	2022-06-21 20:58:02 +01:00
off_policy_algorithm.py	Use higher resolution time_ns() and avoid division by zero (#979 )	2022-07-25 23:02:53 +02:00
on_policy_algorithm.py	Use higher resolution time_ns() and avoid division by zero (#979 )	2022-07-25 23:02:53 +02:00
policies.py	Handling multi-dimensional action spaces (#971 )	2022-08-06 14:19:20 +02:00
preprocessing.py	Documentation update (#450 )	2021-05-23 13:13:11 +02:00
results_plotter.py	Fix default arguments + add bugbear (#363 )	2021-03-25 11:35:21 +02:00
running_mean_std.py	Upgrade code to Python 3.7+ syntax using `pyupgrade` (#887 )	2022-04-25 13:01:38 +03:00
save_util.py	Fix exception cause in base_class.py (#940 )	2022-06-21 20:58:02 +01:00
torch_layers.py	Replace "nature" with "Nature" (magazine) to reduce confusion (#965 )	2022-07-15 22:48:27 +02:00
type_aliases.py	Multiprocessing support for off policy algorithms (#439 )	2021-12-01 22:30:09 +01:00
utils.py	Escape tensorboard log name (#857 )	2022-04-11 21:49:18 +02:00