lynshwoo2022
lynshwoo2022
thanks your great work, is there any .md training、testing instruction?
 when will model interaction part be released?

hi author, thanks for your great work. my question is as the title said. looking forward to your response. best regards
is there a script that can test on single video inference?
In table 13 of your paper ”Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling” shows that the **result of InternVL2_5-2B on GSM8K(4-shot)** is about **55**,...
### Describe the feature 如题 ### Will you implement it? - [ ] I would like to implement this feature and create a PR!