Video-LLaVA
Video-LLaVA copied to clipboard
Support multiple rounds of video conversations?
Great work! As video conversations in the instruction dataset have only one round in this version, if I want to train and test multiple rounds of video conversions, what should I do? Thanks!
Simply just need to organize the multi-round conversation data in the format of llava_image_tune_.json
. llava_image_tune_.json
has examples of multi-round conversations in it, even though it is images.
For the dataset source you can use VideoChat.
Does this repo support inference and evaluation of multiple rounds of video conversations currently? Which file should I refer?
Does this repo support inference and evaluation of multiple rounds of video conversations currently? Which file should I refer?
You can refer to this. But I'm not sure the second output of the model is useful.
Simply just need to organize the multi-round conversation data in the format of
llava_image_tune_.json
.llava_image_tune_.json
has examples of multi-round conversations in it, even though it is images.For the dataset source you can use VideoChat.
where can I get llava_image_tune_.json? this file is not contained in datasets