Hert4

Results 5 issues of Hert4

Can we set ```𝛼 ≥ r``` in GPRO ?

Empty label while training with conversation notebook

### 📚 The doc issue Is there any tutor for integrating the vision model with the language model? ### Suggest a potential alternative/fix _No response_

Can you help me evaluate these models? Thanks beyoru/Qwen3-4B-I-1509 beyoru/Qwen3-4B-I-1209

live above :3

feature request