mirror of
https://github.com/saymrwulf/timeseries.git
synced 2026-05-14 20:58:05 +00:00
283 B
283 B
Readme
RL_countdown_r1zero
Reinforcement Learning of a countdown function . Target: R1-Zero Base Model
Hint: the file ./TinyZero/scripts/train_tiny_zero.sh --> data.train_batch_size=256
data.val_batch_size=1312 \ ! ADJUST ! Commandline handover of override doe not work .