stable-baselines3/tests
Rohan Tangri df6f9de8f4
KL Divergence Helper Function (#431)
* add kl divergence wrapper

* add test

* update changelog

* black lint

* remove unused import

* Fix ent coef loading for SAC (#429)

* Fix ent coef loading for SAC

* Better fix and add comment

* add 'distribution' to base Distribution class

* add sample test

* revert to plain pytorch implementation

* black reformat

* Update docs/misc/changelog.rst

Co-authored-by: Antonin RAFFIN <antonin.raffin@ensta.org>

* Doc update (custom policy + fix her example) (#436)

* isort and black reformat

* float -> bool tensor

* add sanity test

* more concise kl code

* remove outdated comment

* all -> allclose assertion

Co-authored-by: Antonin RAFFIN <antonin.raffin@ensta.org>

* Fix PyTorch warning

* Update gSDE entropy test

* Update entropy test

Co-authored-by: Antonin RAFFIN <antonin.raffin@ensta.org>
Co-authored-by: Antonin Raffin <antonin.raffin@dlr.de>
2021-05-20 19:01:07 +02:00
..
__init__.py Init: TD3 2019-09-05 17:29:41 +02:00
test_callbacks.py Dictionary Observations (#243) 2021-05-11 12:29:30 +02:00
test_cnn.py Dictionary Observations (#243) 2021-05-11 12:29:30 +02:00
test_custom_policy.py Fix discrete obs support (#296) 2021-01-21 02:42:33 +02:00
test_deterministic.py Auto-formatting with black and isort (#97) 2020-07-16 16:12:16 +02:00
test_dict_env.py Dictionary Observations (#243) 2021-05-11 12:29:30 +02:00
test_distributions.py KL Divergence Helper Function (#431) 2021-05-20 19:01:07 +02:00
test_env_checker.py add check to ensure action space is non-dict non-tuple for env_checker nan check (#192) 2020-10-19 00:23:51 +03:00
test_envs.py Dictionary Observations (#243) 2021-05-11 12:29:30 +02:00
test_gae.py Add test for GAE + rename RolloutBuffer.dones for clarification (#375) 2021-04-16 15:52:55 +02:00
test_her.py Dictionary Observations (#243) 2021-05-11 12:29:30 +02:00
test_identity.py Dictionary Observations (#243) 2021-05-11 12:29:30 +02:00
test_logger.py Add Support for Text Records to Logger, Add Hint on How To Access SummaryWriter in Docs. (#303) 2021-02-01 11:56:33 +01:00
test_monitor.py Auto-formatting with black and isort (#97) 2020-07-16 16:12:16 +02:00
test_predict.py Fix DQN predict shape for single Gym env (#222) 2020-11-17 00:43:26 +02:00
test_run.py Fix train_freq at load time (#332) 2021-02-27 19:53:13 +01:00
test_save_load.py Fix ent coef loading for SAC (#429) 2021-05-12 12:21:54 +03:00
test_sde.py Implement HER (#120) 2020-10-22 11:56:43 +02:00
test_spaces.py Fix discrete obs support (#296) 2021-01-21 02:42:33 +02:00
test_tensorboard.py Auto-formatting with black and isort (#97) 2020-07-16 16:12:16 +02:00
test_utils.py Beta is over =)! V1.0rc0 (#334) 2021-03-01 13:35:21 +01:00
test_vec_check_nan.py Auto-formatting with black and isort (#97) 2020-07-16 16:12:16 +02:00
test_vec_envs.py Use Monitor episode reward/length for evaluate_policy (#220) 2020-11-16 11:52:28 +01:00
test_vec_extract_dict_obs.py Support for VecMonitor for gym3-style environments (#311) 2021-04-13 18:09:31 +02:00
test_vec_monitor.py Support for VecMonitor for gym3-style environments (#311) 2021-04-13 18:09:31 +02:00
test_vec_normalize.py Dictionary Observations (#243) 2021-05-11 12:29:30 +02:00