grpotrainer topic

List grpotrainer repositories

vlm-grpo

78
Stars
7
Forks
78
Watchers

An implementation of GRPO for Unsloth's VLMs training

tiny-r1

21
Stars
3
Forks
21
Watchers

Recreating the minimal training methods of DeepSeek-R1 for small langauge models.

simpleR1

30
Stars
2
Forks
30
Watchers

simpleR1: A Simple Framework for Training R1-like Models