| .. |
|
sb2_compat
|
Add callable signatures to type annotations. (#215)
|
2020-11-15 17:50:28 +01:00 |
|
vec_env
|
Use Monitor episode reward/length for evaluate_policy (#220)
|
2020-11-16 11:52:28 +01:00 |
|
__init__.py
|
Update docs (custom policy, type hints) (#167)
|
2020-09-29 20:41:14 +03:00 |
|
atari_wrappers.py
|
Implement HER (#120)
|
2020-10-22 11:56:43 +02:00 |
|
base_class.py
|
Automatically wrap with a Monitor when possible (#237)
|
2020-11-20 18:08:00 +02:00 |
|
bit_flipping_env.py
|
Implement HER (#120)
|
2020-10-22 11:56:43 +02:00 |
|
buffers.py
|
Small docstring improvements related to the notion of Rollout (#206)
|
2020-11-02 11:45:08 +01:00 |
|
callbacks.py
|
Doc fixes and add monitor_kwargs parameter (#230)
|
2020-11-20 10:28:54 +01:00 |
|
cmd_util.py
|
Fix off policy features extractor (#198)
|
2020-10-27 14:24:59 +01:00 |
|
distributions.py
|
Fix off policy features extractor (#198)
|
2020-10-27 14:24:59 +01:00 |
|
env_checker.py
|
add check to ensure action space is non-dict non-tuple for env_checker nan check (#192)
|
2020-10-19 00:23:51 +03:00 |
|
env_util.py
|
Doc fixes and add monitor_kwargs parameter (#230)
|
2020-11-20 10:28:54 +01:00 |
|
evaluation.py
|
Use Monitor episode reward/length for evaluate_policy (#220)
|
2020-11-16 11:52:28 +01:00 |
|
identity_env.py
|
Avoid transposing channel-first envs (#213)
|
2020-11-03 12:34:09 +01:00 |
|
logger.py
|
Add support to log videos via tensorboard (#196)
|
2020-10-22 11:33:58 +02:00 |
|
monitor.py
|
Improve typing coverage (#175)
|
2020-10-07 10:51:49 +02:00 |
|
noise.py
|
Improve typing coverage (#175)
|
2020-10-07 10:51:49 +02:00 |
|
off_policy_algorithm.py
|
Doc fixes and add monitor_kwargs parameter (#230)
|
2020-11-20 10:28:54 +01:00 |
|
on_policy_algorithm.py
|
Doc fixes and add monitor_kwargs parameter (#230)
|
2020-11-20 10:28:54 +01:00 |
|
policies.py
|
Doc fixes and add monitor_kwargs parameter (#230)
|
2020-11-20 10:28:54 +01:00 |
|
preprocessing.py
|
Avoid transposing channel-first envs (#213)
|
2020-11-03 12:34:09 +01:00 |
|
results_plotter.py
|
Cleanup docstring types (#169)
|
2020-10-02 20:05:55 +03:00 |
|
running_mean_std.py
|
Cleanup docstring types (#169)
|
2020-10-02 20:05:55 +03:00 |
|
save_util.py
|
Fix for saving big replay buffer, use pickle protocol>=4 (#239)
|
2020-11-24 16:13:00 +02:00 |
|
torch_layers.py
|
Avoid transposing channel-first envs (#213)
|
2020-11-03 12:34:09 +01:00 |
|
type_aliases.py
|
Doc fixes and add monitor_kwargs parameter (#230)
|
2020-11-20 10:28:54 +01:00 |
|
utils.py
|
Doc fixes and add monitor_kwargs parameter (#230)
|
2020-11-20 10:28:54 +01:00 |