grpotrainer topic
List
grpotrainer repositories
vlm-grpo
78
Stars
7
Forks
78
Watchers
An implementation of GRPO for Unsloth's VLMs training
tiny-r1
21
Stars
3
Forks
21
Watchers
Recreating the minimal training methods of DeepSeek-R1 for small langauge models.
simpleR1
30
Stars
2
Forks
30
Watchers
simpleR1: A Simple Framework for Training R1-like Models