朱橹
Results
4
issues of
朱橹
In the original finetune.sh script, it was divided into model_name_or_path and pretrain_mm_mlp_adapter, representing the paths to the language model and the projector, respectively. However, in LanguageBind/Video-LLaVA-7B, the weights of all...
In VIDEO INSTRUCTION TUNING WITH SYNTHETIC DATA , the performance of LLaVA-Video-7B on MLVU is 70.8.However i can only get 67.0. I do not know what went wrong and could...