jingyi0000

Results 2 repositories owned by jingyi0000

VLM_survey

2.4k
Stars
214
Forks
Watchers

Collection of AWESOME vision-language models for vision tasks

R1-VL

445
Stars
0
Forks
445
Watchers

R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy Optimization