LLaVA [Question] Predictions in video by stitching consecutive frames into a single image

[Question] Predictions in video by stitching consecutive frames into a single image

Open shashi-netra opened this issue 1 year ago • 1 comments

Question

I am looking to use LLaVa for predictions in video by stitching a sequence of consecutive frames into a single image and then asking LLava for a prediction. Has anyone used this approach before and found any success? if so, any tips on how you approached it.

Dec 12 '23 12:12 shashi-netra

Hi, now I also need to predict videos. Do you have a better solution? My current approach is to draw frames to predict

Apr 29 '24 02:04 mhkz

LLaVA LLaVA copied to clipboard

[Question] Predictions in video by stitching consecutive frames into a single image

Question

LLaVA
LLaVA copied to clipboard