Update Readme.md

This commit is contained in:
saymrwulf 2025-02-10 17:34:27 +01:00 committed by GitHub
parent 82b9c8b0e5
commit a9273db3d9
No known key found for this signature in database
GPG key ID: B5690EEEBB952194

View file

@ -1 +1,8 @@
# Readme
## RL_countdown_r1zero
Reinforcement Learning of a countdown function . Target: R1-Zero Base Model
Hint: the file ./TinyZero/scripts/train_tiny_zero.sh --> data.train_batch_size=256 \
data.val_batch_size=1312 \ ! ADJUST ! Commandline handover of override doe not work .