mirror of
https://github.com/saymrwulf/stable-baselines3.git
synced 2026-05-31 23:28:05 +00:00
Update custom_policy.rst (#1183)
* Update custom_policy.rst * Update changelog Co-authored-by: Quentin Gallouédec <45557362+qgallouedec@users.noreply.github.com> Co-authored-by: Antonin RAFFIN <antonin.raffin@ensta.org> Co-authored-by: Antonin Raffin <antonin.raffin@dlr.de>
This commit is contained in:
parent
002850f8ac
commit
f7d7ed3fa7
2 changed files with 4 additions and 3 deletions
|
|
@ -333,11 +333,11 @@ If your task requires even more granular control over the policy/value architect
|
|||
:return: (th.Tensor, th.Tensor) latent_policy, latent_value of the specified network.
|
||||
If all layers are shared, then ``latent_policy == latent_value``
|
||||
"""
|
||||
return self.policy_net(features), self.value_net(features)
|
||||
|
||||
return self.forward_actor(features), self.forward_critic(features)
|
||||
|
||||
def forward_actor(self, features: th.Tensor) -> th.Tensor:
|
||||
return self.policy_net(features)
|
||||
|
||||
|
||||
def forward_critic(self, features: th.Tensor) -> th.Tensor:
|
||||
return self.value_net(features)
|
||||
|
||||
|
|
|
|||
|
|
@ -50,6 +50,7 @@ Documentation:
|
|||
^^^^^^^^^^^^^^
|
||||
- Updated Hugging Face Integration page (@simoninithomas)
|
||||
- Changed ``env`` to ``vec_env`` when environment is vectorized
|
||||
- Update custom policy documentation (@athatheo)
|
||||
|
||||
Release 1.6.2 (2022-10-10)
|
||||
--------------------------
|
||||
|
|
|
|||
Loading…
Reference in a new issue