stable-baselines3/stable_baselines3/common
Rohan Tangri df6f9de8f4
KL Divergence Helper Function (#431)
* add kl divergence wrapper

* add test

* update changelog

* black lint

* remove unused import

* Fix ent coef loading for SAC (#429)

* Fix ent coef loading for SAC

* Better fix and add comment

* add 'distribution' to base Distribution class

* add sample test

* revert to plain pytorch implementation

* black reformat

* Update docs/misc/changelog.rst

Co-authored-by: Antonin RAFFIN <antonin.raffin@ensta.org>

* Doc update (custom policy + fix her example) (#436)

* isort and black reformat

* float -> bool tensor

* add sanity test

* more concise kl code

* remove outdated comment

* all -> allclose assertion

Co-authored-by: Antonin RAFFIN <antonin.raffin@ensta.org>

* Fix PyTorch warning

* Update gSDE entropy test

* Update entropy test

Co-authored-by: Antonin RAFFIN <antonin.raffin@ensta.org>
Co-authored-by: Antonin Raffin <antonin.raffin@dlr.de>
2021-05-20 19:01:07 +02:00
..
envs Dictionary Observations (#243) 2021-05-11 12:29:30 +02:00
sb2_compat Add callable signatures to type annotations. (#215) 2020-11-15 17:50:28 +01:00
vec_env Dictionary Observations (#243) 2021-05-11 12:29:30 +02:00
__init__.py Update docs (custom policy, type hints) (#167) 2020-09-29 20:41:14 +03:00
atari_wrappers.py Add SUMO-RL as example project in the docs (#257) 2020-12-13 17:15:45 +01:00
base_class.py Fix ent coef loading for SAC (#429) 2021-05-12 12:21:54 +03:00
buffers.py Dictionary Observations (#243) 2021-05-11 12:29:30 +02:00
callbacks.py Fix numpy warning and update migration guide (#307) 2021-02-01 11:24:44 +01:00
distributions.py KL Divergence Helper Function (#431) 2021-05-20 19:01:07 +02:00
env_checker.py Dictionary Observations (#243) 2021-05-11 12:29:30 +02:00
env_util.py Doc fixes and add monitor_kwargs parameter (#230) 2020-11-20 10:28:54 +01:00
evaluation.py Support for VecMonitor for gym3-style environments (#311) 2021-04-13 18:09:31 +02:00
logger.py Add Support for Text Records to Logger, Add Hint on How To Access SummaryWriter in Docs. (#303) 2021-02-01 11:56:33 +01:00
monitor.py Support for VecMonitor for gym3-style environments (#311) 2021-04-13 18:09:31 +02:00
noise.py Improve typing coverage (#175) 2020-10-07 10:51:49 +02:00
off_policy_algorithm.py Dictionary Observations (#243) 2021-05-11 12:29:30 +02:00
on_policy_algorithm.py Dictionary Observations (#243) 2021-05-11 12:29:30 +02:00
policies.py Dictionary Observations (#243) 2021-05-11 12:29:30 +02:00
preprocessing.py Dictionary Observations (#243) 2021-05-11 12:29:30 +02:00
results_plotter.py Fix default arguments + add bugbear (#363) 2021-03-25 11:35:21 +02:00
running_mean_std.py Cleanup docstring types (#169) 2020-10-02 20:05:55 +03:00
save_util.py Add custom objects support + bug fix (#336) 2021-03-06 15:17:43 +02:00
torch_layers.py Dictionary Observations (#243) 2021-05-11 12:29:30 +02:00
type_aliases.py Dictionary Observations (#243) 2021-05-11 12:29:30 +02:00
utils.py Dictionary Observations (#243) 2021-05-11 12:29:30 +02:00