mirror of
https://github.com/saymrwulf/timeseries.git
synced 2026-05-14 20:58:05 +00:00
Update Readme.md
This commit is contained in:
parent
82b9c8b0e5
commit
a9273db3d9
1 changed files with 7 additions and 0 deletions
|
|
@ -1 +1,8 @@
|
|||
# Readme
|
||||
|
||||
## RL_countdown_r1zero
|
||||
|
||||
Reinforcement Learning of a countdown function . Target: R1-Zero Base Model
|
||||
|
||||
Hint: the file ./TinyZero/scripts/train_tiny_zero.sh --> data.train_batch_size=256 \
|
||||
data.val_batch_size=1312 \ ! ADJUST ! Commandline handover of override doe not work .
|
||||
|
|
|
|||
Loading…
Reference in a new issue