朱橹

Results 4 issues of 朱橹

In the original finetune.sh script, it was divided into model_name_or_path and pretrain_mm_mlp_adapter, representing the paths to the language model and the projector, respectively. However, in LanguageBind/Video-LLaVA-7B, the weights of all...

In VIDEO INSTRUCTION TUNING WITH SYNTHETIC DATA , the performance of LLaVA-Video-7B on MLVU is 70.8.However i can only get 67.0. I do not know what went wrong and could...