stable-baselines3

mirror of https://github.com/saymrwulf/stable-baselines3.git synced 2026-07-08 17:17:34 +00:00

History

Antonin RAFFIN 944dfdafe4 Update doc: SB3-Contrib (#267 ) * Fix big when saving/loading q-net alone * Rename variables to match SB3-contrib * Update docker image * Set min version for tensorboard * Add SB3-Contrib to doc * Update DQN * Apply suggestions from code review Co-authored-by: Adam Gleave <adam@gleave.me> * Update wording Co-authored-by: Adam Gleave <adam@gleave.me>		2020-12-21 16:17:24 +01:00
..
__init__.py	Init: TD3	2019-09-05 17:29:41 +02:00
test_callbacks.py	Add eval success rate logging (#255 )	2020-12-08 15:49:07 +01:00
test_cnn.py	Use Monitor episode reward/length for `evaluate_policy` (#220 )	2020-11-16 11:52:28 +01:00
test_custom_policy.py	Add custom arch for off-policy actor/critic networks (#182 )	2020-10-13 12:01:33 +02:00
test_deterministic.py	Auto-formatting with black and isort (#97 )	2020-07-16 16:12:16 +02:00
test_distributions.py	Update black version + update docker image (#151 )	2020-08-27 23:02:59 +02:00
test_env_checker.py	add check to ensure action space is non-dict non-tuple for env_checker nan check (#192 )	2020-10-19 00:23:51 +03:00
test_envs.py	Auto-formatting with black and isort (#97 )	2020-07-16 16:12:16 +02:00
test_her.py	Fix bug with full HerReplayBuffer (#236 )	2020-11-20 13:23:03 +01:00
test_identity.py	Use Monitor episode reward/length for `evaluate_policy` (#220 )	2020-11-16 11:52:28 +01:00
test_logger.py	Add support to log videos via tensorboard (#196 )	2020-10-22 11:33:58 +02:00
test_monitor.py	Auto-formatting with black and isort (#97 )	2020-07-16 16:12:16 +02:00
test_predict.py	Fix DQN predict shape for single Gym env (#222 )	2020-11-17 00:43:26 +02:00
test_run.py	Fix off policy features extractor (#198 )	2020-10-27 14:24:59 +01:00
test_save_load.py	Update doc: SB3-Contrib (#267 )	2020-12-21 16:17:24 +01:00
test_sde.py	Implement HER (#120 )	2020-10-22 11:56:43 +02:00
test_spaces.py	Add supported action spaces checks (#254 )	2020-12-06 14:05:10 +02:00
test_tensorboard.py	Auto-formatting with black and isort (#97 )	2020-07-16 16:12:16 +02:00
test_utils.py	Automatically wrap with a Monitor when possible (#237 )	2020-11-20 18:08:00 +02:00
test_vec_check_nan.py	Auto-formatting with black and isort (#97 )	2020-07-16 16:12:16 +02:00
test_vec_envs.py	Use Monitor episode reward/length for `evaluate_policy` (#220 )	2020-11-16 11:52:28 +01:00
test_vec_normalize.py	Use Monitor episode reward/length for `evaluate_policy` (#220 )	2020-11-16 11:52:28 +01:00