stable-baselines3/tests
Antonin RAFFIN 944dfdafe4
Update doc: SB3-Contrib (#267)
* Fix big when saving/loading q-net alone

* Rename variables to match SB3-contrib

* Update docker image

* Set min version for tensorboard

* Add SB3-Contrib to doc

* Update DQN

* Apply suggestions from code review

Co-authored-by: Adam Gleave <adam@gleave.me>

* Update wording

Co-authored-by: Adam Gleave <adam@gleave.me>
2020-12-21 16:17:24 +01:00
..
__init__.py Init: TD3 2019-09-05 17:29:41 +02:00
test_callbacks.py Add eval success rate logging (#255) 2020-12-08 15:49:07 +01:00
test_cnn.py Use Monitor episode reward/length for evaluate_policy (#220) 2020-11-16 11:52:28 +01:00
test_custom_policy.py Add custom arch for off-policy actor/critic networks (#182) 2020-10-13 12:01:33 +02:00
test_deterministic.py Auto-formatting with black and isort (#97) 2020-07-16 16:12:16 +02:00
test_distributions.py Update black version + update docker image (#151) 2020-08-27 23:02:59 +02:00
test_env_checker.py add check to ensure action space is non-dict non-tuple for env_checker nan check (#192) 2020-10-19 00:23:51 +03:00
test_envs.py Auto-formatting with black and isort (#97) 2020-07-16 16:12:16 +02:00
test_her.py Fix bug with full HerReplayBuffer (#236) 2020-11-20 13:23:03 +01:00
test_identity.py Use Monitor episode reward/length for evaluate_policy (#220) 2020-11-16 11:52:28 +01:00
test_logger.py Add support to log videos via tensorboard (#196) 2020-10-22 11:33:58 +02:00
test_monitor.py Auto-formatting with black and isort (#97) 2020-07-16 16:12:16 +02:00
test_predict.py Fix DQN predict shape for single Gym env (#222) 2020-11-17 00:43:26 +02:00
test_run.py Fix off policy features extractor (#198) 2020-10-27 14:24:59 +01:00
test_save_load.py Update doc: SB3-Contrib (#267) 2020-12-21 16:17:24 +01:00
test_sde.py Implement HER (#120) 2020-10-22 11:56:43 +02:00
test_spaces.py Add supported action spaces checks (#254) 2020-12-06 14:05:10 +02:00
test_tensorboard.py Auto-formatting with black and isort (#97) 2020-07-16 16:12:16 +02:00
test_utils.py Automatically wrap with a Monitor when possible (#237) 2020-11-20 18:08:00 +02:00
test_vec_check_nan.py Auto-formatting with black and isort (#97) 2020-07-16 16:12:16 +02:00
test_vec_envs.py Use Monitor episode reward/length for evaluate_policy (#220) 2020-11-16 11:52:28 +01:00
test_vec_normalize.py Use Monitor episode reward/length for evaluate_policy (#220) 2020-11-16 11:52:28 +01:00