| .. |
|
envs
|
Add method predict_values for ActorCriticPolicy (#569)
|
2021-09-15 14:03:04 +02:00 |
|
sb2_compat
|
|
|
|
vec_env
|
VecNormalize: allow non-continuous observations when norm_obs is False (#575)
|
2021-09-18 12:11:01 +02:00 |
|
__init__.py
|
|
|
|
atari_wrappers.py
|
|
|
|
base_class.py
|
Fix set_env to keep the number of timesteps (#615)
|
2021-10-23 16:36:40 +02:00 |
|
buffers.py
|
Corrected DictReplayBuffer observation dtype #484 (#486)
|
2021-06-22 13:41:26 +02:00 |
|
callbacks.py
|
Use consistent logging keys (#605)
|
2021-10-12 13:17:30 +02:00 |
|
distributions.py
|
KL Divergence Helper Function (#431)
|
2021-05-20 19:01:07 +02:00 |
|
env_checker.py
|
Dictionary Observations (#243)
|
2021-05-11 12:29:30 +02:00 |
|
env_util.py
|
Added wrapper_kwargs argument to make_vec_env (#448)
|
2021-05-23 11:33:34 +02:00 |
|
evaluation.py
|
Added support for vector envs in evaluation (#447)
|
2021-05-28 12:40:29 +02:00 |
|
logger.py
|
Fix logger setup (#469)
|
2021-06-14 15:17:48 +02:00 |
|
monitor.py
|
Fix type annotations (#522)
|
2021-07-29 13:02:09 +02:00 |
|
noise.py
|
|
|
|
off_policy_algorithm.py
|
System info helper (#613)
|
2021-10-18 10:43:56 +02:00 |
|
on_policy_algorithm.py
|
Avoid putting target networks into training mode (#553)
|
2021-08-30 17:42:41 +02:00 |
|
policies.py
|
Remove sde_net_arch + Simplify policy (#584)
|
2021-09-28 22:32:54 +03:00 |
|
preprocessing.py
|
Documentation update (#450)
|
2021-05-23 13:13:11 +02:00 |
|
results_plotter.py
|
|
|
|
running_mean_std.py
|
|
|
|
save_util.py
|
System info helper (#613)
|
2021-10-18 10:43:56 +02:00 |
|
torch_layers.py
|
Remove sde_net_arch + Simplify policy (#584)
|
2021-09-28 22:32:54 +03:00 |
|
type_aliases.py
|
Dictionary Observations (#243)
|
2021-05-11 12:29:30 +02:00 |
|
utils.py
|
System info helper (#613)
|
2021-10-18 10:43:56 +02:00 |