notebooks
notebooks copied to clipboard
GRPO results not match the dataset
I use the Gemma 3 (1B), pick some question from the openai/gsm8k, the result is pretty bad compare Gemma 3 (1B) without fine-tune.
I run it with llama-cli -m ./gemma-3-finetune.Q8_0.gguf, is there something I do wrong ?
I also try change max_step=300, save_step=250. max_seq_length=4096
Hello are you still having your problem? By the way you can join our discord at discord.gg/unsloth where many people can help you there :D
Thanks for the tips, I will try again and report back late.