mirror of
https://github.com/saymrwulf/stable-baselines3.git
synced 2026-05-18 21:30:19 +00:00
* make sure DQN policy is always in correct mode - train or eval * make set_training_mode an abstract method of the base policy - safer * update docstring of _build method to note that the target network is put into eval mode * use set_training_mode to put the dqn target network into eval mode * use set_training_mode to set the training model of the q-network * move set_training_mode abstract method from BasePolicy to BaseModel * set train and eval mode for TD3 * make sure critic is always in correct mode during train * set train and eval mode for SAC * add comment re batch norm and dropout * set train and eval mode for A2C and PPO * add tests for collect rollouts with batch norm * fix formatting * update change log * update version * remove Optional typing for batch size - causing type check to fail * Fix scipy dependency for toy text envs * implement set_training_mode method in BaseModel * move all tests of train/eval mode to test_train_eval_mode * call learn with learning_starts = total_timesteps to test that collect_rollouts does not update batch norm * remove extra calls to set_training_mode in train method of TD3 and SAC * Allow gradient_steps=0 * Refactor tests * Add comment + use aliases * Typos Co-authored-by: Antonin Raffin <antonin.raffin@ensta.org> |
||
|---|---|---|
| .. | ||
| __init__.py | ||
| test_callbacks.py | ||
| test_cnn.py | ||
| test_custom_policy.py | ||
| test_deterministic.py | ||
| test_dict_env.py | ||
| test_distributions.py | ||
| test_env_checker.py | ||
| test_envs.py | ||
| test_gae.py | ||
| test_her.py | ||
| test_identity.py | ||
| test_logger.py | ||
| test_monitor.py | ||
| test_predict.py | ||
| test_run.py | ||
| test_save_load.py | ||
| test_sde.py | ||
| test_spaces.py | ||
| test_tensorboard.py | ||
| test_train_eval_mode.py | ||
| test_utils.py | ||
| test_vec_check_nan.py | ||
| test_vec_envs.py | ||
| test_vec_extract_dict_obs.py | ||
| test_vec_monitor.py | ||
| test_vec_normalize.py | ||