Xuan Dong
Xuan Dong
Thank you for your contribution! Hello, I'm trying to reproduce the evaluation scores for generative performance in the VideoChatGPT evaluation of model EVA-G & LLaVA1.5-VideoChatGPT-Instruct 7B. I have downloaded your...
Thank you for your contribution. Under the huggingface `lmms-lab/LLaVA-OneVision-Data` repo, I find that there are only single-image data, and in your `scripts/train/README.md`, you say that the video incorporates **Youcook2 (32267),...
Hi, First of all, thanks for your great work on Omniparser V2! After reviewing the code in demo.ipynb, I understand that the workflow of Omniparser V2 involves: 1. Using an...