MoE-LLaVA
MoE-LLaVA copied to clipboard
is support video ?
is support video train?
Sure. Our code supports multi-image training, multi-video training, and even image-video training together.
@LinB203 is there any video predict code in the repo to test it on a mp4 file? Or a preprocessing script showing how to sample the frames from a video?
This repo support training video but do not release checkpoint about video. So you may need to train a new model to support video. For video predict code or how to adapt video encoder into MoE-LLaVA, you can refer to Video-LLaVA.