mirror of
https://github.com/saymrwulf/stable-baselines3.git
synced 2026-05-30 23:18:47 +00:00
* Added test cases where off policy algorithms fail with float64 actionspace * casting observations and actions to `np.float32` to unify behaviour between `ReplayBuffer` and `RolloutBuffer`. Fixing issue #1145 * reformatted using black * making test more restrictive by checking models action is float64 * added changelog entry * undo cast of observations as `preprocessing.preprocess_obs()` casts them to float32 anyways. * - Casting to float32 only, if action.dtype is float64 - Added cast to `DictReplayBuffer` as well * Added tests for multiple variations of continuous action types and observation spaces * applied reformatting by `make commit-checks` * Added typing and comment referring to description in merge request * Apply linter for single element slice * Rename helper and refactor tests * Update changelog and docstring --------- Co-authored-by: Antonin RAFFIN <antonin.raffin@ensta.org> |
||
|---|---|---|
| .. | ||
| envs | ||
| sb2_compat | ||
| vec_env | ||
| __init__.py | ||
| atari_wrappers.py | ||
| base_class.py | ||
| buffers.py | ||
| callbacks.py | ||
| distributions.py | ||
| env_checker.py | ||
| env_util.py | ||
| evaluation.py | ||
| logger.py | ||
| monitor.py | ||
| noise.py | ||
| off_policy_algorithm.py | ||
| on_policy_algorithm.py | ||
| policies.py | ||
| preprocessing.py | ||
| results_plotter.py | ||
| running_mean_std.py | ||
| save_util.py | ||
| torch_layers.py | ||
| type_aliases.py | ||
| utils.py | ||