stable-baselines3

mirror of https://github.com/saymrwulf/stable-baselines3.git synced 2026-07-13 18:08:39 +00:00

History

Anssi 18d10dbf42 Use Monitor episode reward/length for `evaluate_policy` (#220 ) * Update evaluate_policy to use monitor data if available * Update documentation * Cleaning up * Remove unnecessary typing trickery * Update doc * Rename is_wrapped to clarify it is for vecenvs * Add is_wrapped for regular envs * Add is_wrapped call for subprocvecenv and update code for circular imports * Move new functions back to env_util and fix imports * Update changelog * Clarify evaluate_policy docs * Add tests for wrapped modifying episode lengths * Fix tests * Update changelog * Minor edits * Add warn switch to evaluate_policy and update tests Co-authored-by: Antonin RAFFIN <antonin.raffin@ensta.org>		2020-11-16 11:52:28 +01:00
..
__init__.py	Init: TD3	2019-09-05 17:29:41 +02:00
test_callbacks.py	Use Monitor episode reward/length for `evaluate_policy` (#220 )	2020-11-16 11:52:28 +01:00
test_cnn.py	Use Monitor episode reward/length for `evaluate_policy` (#220 )	2020-11-16 11:52:28 +01:00
test_custom_policy.py	Add custom arch for off-policy actor/critic networks (#182 )	2020-10-13 12:01:33 +02:00
test_deterministic.py	Auto-formatting with black and isort (#97 )	2020-07-16 16:12:16 +02:00
test_distributions.py	Update black version + update docker image (#151 )	2020-08-27 23:02:59 +02:00
test_env_checker.py	add check to ensure action space is non-dict non-tuple for env_checker nan check (#192 )	2020-10-19 00:23:51 +03:00
test_envs.py	Auto-formatting with black and isort (#97 )	2020-07-16 16:12:16 +02:00
test_her.py	Implement HER (#120 )	2020-10-22 11:56:43 +02:00
test_identity.py	Use Monitor episode reward/length for `evaluate_policy` (#220 )	2020-11-16 11:52:28 +01:00
test_logger.py	Add support to log videos via tensorboard (#196 )	2020-10-22 11:33:58 +02:00
test_monitor.py	Auto-formatting with black and isort (#97 )	2020-07-16 16:12:16 +02:00
test_predict.py	Allow to set a device when loading a model (#154 )	2020-09-20 19:13:18 +02:00
test_run.py	Fix off policy features extractor (#198 )	2020-10-27 14:24:59 +01:00
test_save_load.py	Fix env loading (#203 )	2020-10-27 23:12:52 +02:00
test_sde.py	Implement HER (#120 )	2020-10-22 11:56:43 +02:00
test_spaces.py	Use Monitor episode reward/length for `evaluate_policy` (#220 )	2020-11-16 11:52:28 +01:00
test_tensorboard.py	Auto-formatting with black and isort (#97 )	2020-07-16 16:12:16 +02:00
test_utils.py	Use Monitor episode reward/length for `evaluate_policy` (#220 )	2020-11-16 11:52:28 +01:00
test_vec_check_nan.py	Auto-formatting with black and isort (#97 )	2020-07-16 16:12:16 +02:00
test_vec_envs.py	Use Monitor episode reward/length for `evaluate_policy` (#220 )	2020-11-16 11:52:28 +01:00
test_vec_normalize.py	Use Monitor episode reward/length for `evaluate_policy` (#220 )	2020-11-16 11:52:28 +01:00