Commit graph

894 commits

Author SHA1 Message Date
0354548bd0 Implementing approach from a new paper read last night (stable-baselines3) 2014-07-28 04:09:00 +02:00
07a2f70377 Quick fix, referencing a known issue from the official repo (stable-baselines3) 2014-07-25 07:14:00 +02:00
0c3dc2ee4d Refactor for clarity, might break a few tests though (stable-baselines3) 2014-07-25 23:22:00 +02:00
2436e0d3ee Testing bigger LLM config, referencing 'Attention Is All You Need' (stable-baselines3) 2014-07-25 02:19:00 +02:00
86355791ab Experimenting with FPGA constraints (source: Trimberger 'Three Ages of FPGAs') (stable-baselines3) 2014-07-14 20:12:00 +02:00
68772098de Experimenting with FPGA constraints (source: Trimberger 'Three Ages of FPGAs') (stable-baselines3) 2014-07-14 06:47:00 +02:00
2ad731d5d7 Trying out boneh-franklin approach for IBE (ref. 2003 paper) (stable-baselines3) 2014-04-01 05:31:00 +02:00
e1e96bf4e5 Trying out boneh-franklin approach for IBE (ref. 2003 paper) (stable-baselines3) 2014-04-01 08:54:00 +02:00
7b30044e06 Quick fix, referencing a known issue from the official repo (stable-baselines3) 2014-02-19 00:44:00 +01:00
c6ff923898 Testing bigger LLM config, referencing 'Attention Is All You Need' (stable-baselines3) 2014-02-19 00:22:00 +01:00
651702772e Testing bigger LLM config, referencing 'Attention Is All You Need' (stable-baselines3) 2014-02-19 17:05:00 +01:00
aeeb674a1c Experimenting with FPGA constraints (source: Trimberger 'Three Ages of FPGAs') (stable-baselines3) 2014-02-19 05:01:00 +01:00
111ac7786e Minor doc updates: linking to article on quantization (stable-baselines3) 2014-02-16 19:06:00 +01:00
Paul Stahlhofen
c5c29a32d9
Clarify the use of Gym wrappers with make_vec_env (#2079)
* Added a note to the documentation of Vectorized Environments to show the
possibility of wrapping sub-environments with `make_vec_env` (See #2075 )

* Add example

---------

Co-authored-by: Antonin RAFFIN <antonin.raffin@ensta.org>
2025-02-07 12:04:48 +01:00
Antonin RAFFIN
b8b2d30a83
Add has_attr for VecEnv (#2077)
* Add `has_attr` for `VecEnv`

* Add special case for gymnasium<1.0

* Update changelog.rst

* Update black version
2025-02-03 10:43:56 +01:00
Antonin RAFFIN
ee8a77defb
Release v2.5.0 (#2074)
* Update Dockerfile

* Release v2.5.0

* Fix docker GPU parent

* Fix changelog
2025-01-27 12:50:17 +01:00
Antonin RAFFIN
f8ea2995cb
Doc update: custom envs, IsaacLab, Brax and dm_control (#2072)
* Add note about start!=0 for Discrete spaces

* Update doc for IsaacLab and dm_control

* Fix test due to rounding error
2025-01-26 11:42:57 +01:00
Yufeng Gao
d055a2e2af
fix docs atari example by import ale_py (#2071)
* fix docs atari example by import ale_py

* Update changelog

---------

Co-authored-by: Antonin Raffin <antonin.raffin@dlr.de>
2025-01-21 10:53:42 +01:00
San
b7c64a1aa4
Simplify GPU detection in docker launch script (#2064)
Co-authored-by: Antonin RAFFIN <antonin.raffin@ensta.org>
2025-01-07 14:58:28 +01:00
Antonin RAFFIN
2b529e57a0
Fix tests and warnings when running locally with a GPU (#2069)
* Fix test when GPU is available

* Sort file list for consistent results

* Ignore A2C warnings too
2025-01-07 14:19:05 +01:00
Antonin RAFFIN
dba0baa491
Fix mypy error (#2067)
* Fix mypy error

* Ignore new errors
2025-01-07 11:57:54 +01:00
Antonin RAFFIN
57e8b97df5
Fix video recorder and add test (#2063)
* Fix video recorder and add test

* Update github CI

* Install ffmpeg

* Revert "Update github CI"

This reverts commit 07791e97fccae4f003b2909428b23f59557d7034.

* Skip VecVideoRecorder test on github
2024-12-21 08:24:25 +01:00
Marc Duclusaud
0fd0db0b7b
FRASA project indent fix (#2060)
Co-authored-by: Antonin RAFFIN <antonin.raffin@ensta.org>
2024-12-17 17:18:06 +01:00
cgaspard3333
93778c3de4
Add FootstepNet Envs to doc project page (#2058)
* Add FootstepNet Envs to doc projects page

* Update Changelog + typo

* Update FootstepNet description

---------

Co-authored-by: Antonin RAFFIN <antonin.raffin@ensta.org>
2024-12-17 16:47:19 +01:00
Marc Duclusaud
f432a6fcdc
Adding FRASA to the projects page (#2059)
* Adding FRASA to the projects page

* Updating changelog.rst

* Ignore mypy errors for np arrays (python 3.11+)

---------

Co-authored-by: Antonin Raffin <antonin.raffin@dlr.de>
2024-12-17 14:53:07 +01:00
kplers
9caa168686
Add policy documentation links to policy_kwargs parameter (#2050)
* docs: Add policy documentation links to policy_kwargs parameter

* Fix missing references, update changelog

---------

Co-authored-by: Antonin RAFFIN <antonin.raffin@ensta.org>
2024-12-02 14:40:05 +01:00
Antonin RAFFIN
897d01d225
Update PyBullet example (#2049) 2024-11-29 14:58:09 +01:00
Antonin RAFFIN
98366922b4
Fix linter warnings (order __all__) (#2048) 2024-11-29 13:55:18 +01:00
James MacGlashan
8a3e3ccb4e
Add Decisions and Dragons site to resources (#2044)
* add dnd site to resources

.

add info

* add to changelog
2024-11-22 23:02:13 +01:00
Antonin RAFFIN
daaebd0a52
Drop python 3.8 and add python 3.12 support (#2041)
* Drop python 3.8 support, add python 3.12 support

* Upgrade to python 3.9 syntax

* Fixes for Numpy v2

* Fix doc warning
2024-11-18 15:40:36 +01:00
Antonin RAFFIN
020ee42f4d
Release 2.4.0 (#2040) 2024-11-18 11:03:03 +01:00
Antonin RAFFIN
e4f4f123e3
Add note about SAC ent coeff optimization (#2037)
* Allow new sphinx version

* Add note about SAC ent coeff and add DQN tutorial link
2024-11-08 11:01:04 +01:00
Mark Towers
8f0b488bc5
Update Gymnasium to v1.0.0 (#1837)
* Update Gymnasium to v1.0.0a1

* Comment out `gymnasium.wrappers.monitor` (todo update to VideoRecord)

* Fix ruff warnings

* Register Atari envs

* Update `getattr` to `Env.get_wrapper_attr`

* Reorder imports

* Fix `seed` order

* Fix collecting `max_steps`

* Copy and paste video recorder to prevent the need to rewrite the vec vide recorder wrapper

* Use `typing.List` rather than list

* Fix env attribute forwarding

* Separate out env attribute collection from its utilisation

* Update for Gymnasium alpha 2

* Remove assert for OrderedDict

* Update setup.py

* Add type: ignore

* Test with Gymnasium main

* Remove `gymnasium.logger.debug/info`

* Fix github CI yaml

* Run gym 0.29.1 on python 3.10

* Update lower bounds

* Integrate video recorder

* Remove ordered dict

* Update changelog

---------

Co-authored-by: Antonin RAFFIN <antonin.raffin@ensta.org>
2024-11-04 12:03:12 +01:00
Antonin RAFFIN
dd3d0acf15
Update readme and clarify planned features (#2030)
* Update readme and clarify planned features

* Fix rtd python version

* Fix pip version for rtd

* Update rtd ubuntu and mambaforge

* Add upper bound for gymnasium

* [ci skip] Update readme
2024-10-29 12:23:13 +01:00
Antonin RAFFIN
3d59b5c86b
Use uv on GitHub CI for faster download and update changelog (#2026)
* Use uv on GitHub CI for faster download and update changelog

* Fix new mypy issues
2024-10-24 15:20:05 +02:00
Devin White
56c153f048
Add warning when using PPO on GPU and update doc (#2017)
* Update documentation

Added comment to PPO documentation that CPU should primarily be used unless using CNN as well as sample code. Added warning to user for both PPO and A2C that CPU should be used if the user is running GPU without using a CNN, reference Issue #1245.

* Add warning to base class and add test

---------

Co-authored-by: Antonin RAFFIN <antonin.raffin@ensta.org>
2024-10-07 11:24:47 +02:00
Antonin RAFFIN
512eea923a
Warn users when using multi-dim MultiDiscrete obs space (#2003)
* Update env checker to warn users when using multi-dim MultiDiscrete obs space

* Update changelog
2024-09-13 13:15:23 +02:00
Antonin RAFFIN
9a3b28bb9f
[ci skip] Update README.md, fix image display 2024-08-23 08:58:43 +02:00
Antonin RAFFIN
4a7631b71d
Fix test device for buffers (#1993)
* Prevent test_device from being a noop

* Update changelog

---------

Co-authored-by: Adrià Garriga-Alonso <adria@far.ai>
2024-08-18 12:33:22 +02:00
Jan-Hendrik Ewers
4a1137ba3a
Add np.ndarray as a recognized type for TB histograms. (#1635)
* Add np.ndarray as a recognized type for TB histograms.

Torch histograms allow th.Tensor, np.ndarray, and caffe2 formatted strings. This commits expands the TensorBoardOutputFormat's capabilities to log the two former types.

* Update changelog to reflect bug fix

* fix: try/catch for if either np or torch aren't at the required versions. See https://github.com/DLR-RM/stable-baselines3/pull/1635 for more details

* fix: Add comment describing the test for when add_histogram should not have been called

* Cleanup

---------

Co-authored-by: Antonin RAFFIN <antonin.raffin@ensta.org>
2024-08-02 11:55:27 +02:00
Chris Schindlbeck
6ad6fa55b6
Fix various typos (#1981) 2024-07-29 10:44:23 +02:00
Antonin RAFFIN
bd3c0c6530
Fix loading of optimizer with older DQN models (#1978) 2024-07-26 14:57:55 +02:00
Antonin RAFFIN
000544cc1f
Add support for pre and post linear modules in create_mlp (#1975)
* Add support for pre and post linear modules in `create_mlp`

* Disable mypy for python 3.8

* Reformat toml file

* Update docstring

Co-authored-by: Quentin Gallouédec <45557362+qgallouedec@users.noreply.github.com>

* Add some comments

---------

Co-authored-by: Quentin Gallouédec <45557362+qgallouedec@users.noreply.github.com>
2024-07-22 13:42:33 +02:00
Quentin Gallouédec
1a69fc8314
Update examples.rst (#1969) 2024-07-15 23:57:24 +02:00
Corentin
d8148deeaa
Updated DQN optimizer input to only include q_network parameters as input (#1963)
* Updated DQN optimizer input to only include q_network parameters

* Update version

---------

Co-authored-by: Antonin RAFFIN <antonin.raffin@ensta.org>
2024-07-05 19:07:55 +02:00
Sahit Chintalapudi
0eebde7ca1
Fix typo in examples.rst (#1962)
The variable `env` is not defined. The gym env we want to change is `vec_env`
2024-07-05 15:00:48 +02:00
Dominik Baron
24ebf1a1df
Remove unnecessary SDE resampling in PPO update (#1933)
* Remove unnecessary SDE resampling in PPO update

* Update changelog.rst

* Update version

* Update PyTorch version on CI

* Update ruff

* Limit NumPy version

* Reformat

---------

Co-authored-by: Antonin RAFFIN <antonin.raffin@ensta.org>
2024-06-29 20:07:32 +02:00
will-maclean
4efee92fba
Set CallbackList children's parent correctly (#1939)
* Fixing #1791

* Update test and version

* Add test for callback after eval

* Fix mypy error

* Remove tqdm warnings

---------

Co-authored-by: Antonin RAFFIN <antonin.raffin@ensta.org>
2024-06-07 14:07:28 +02:00
Joe Ksiazek
0b06d8ab20
Fix error when loading a model that has net_arch manually set to None (#1937)
* Fix loading a model with net_arch=None

* Remove redundant get

* Dummy commit

* Add to contributors

* Update test and version

---------

Co-authored-by: Antonin Raffin <antonin.raffin@ensta.org>
2024-06-05 17:27:40 +02:00
Ole Petersen
6c00565778
Fix memory leak in base_class.py (#1908)
* Fix memory leak in base_class.py

Loading the data return value is not necessary since it is unused. Loading the data causes a memory leak through the ep_info_buffer variable. I found this while loading a PPO learner from storage on a multi-GPU system since the ep_info_buffer is loaded to the memory location it was on while it was saved to disk, instead of the target loading location, and is then not cleaned up.

* Update changelog.rst

* Update changelog

---------

Co-authored-by: Antonin RAFFIN <antonin.raffin@ensta.org>
2024-05-15 15:59:32 +02:00