From a9273db3d919a1488586fcf5ed820d5ad061f6f4 Mon Sep 17 00:00:00 2001 From: saymrwulf Date: Mon, 10 Feb 2025 17:34:27 +0100 Subject: [PATCH] Update Readme.md --- Readme.md | 7 +++++++ 1 file changed, 7 insertions(+) diff --git a/Readme.md b/Readme.md index f395431..37bc9b1 100644 --- a/Readme.md +++ b/Readme.md @@ -1 +1,8 @@ # Readme + +## RL_countdown_r1zero + +Reinforcement Learning of a countdown function . Target: R1-Zero Base Model + +Hint: the file ./TinyZero/scripts/train_tiny_zero.sh --> data.train_batch_size=256 \ +data.val_batch_size=1312 \ ! ADJUST ! Commandline handover of override doe not work .