LAVIS icon indicating copy to clipboard operation
LAVIS copied to clipboard

Video Question Answering examples

Open behroozazarkhalili opened this issue 1 year ago • 8 comments

Hi there, Could you please provide an example as how we should run Video Question Answering task using LAVIS?

Any examples about other video-related tasks well be very appreciated.

behroozazarkhalili avatar Nov 16 '22 13:11 behroozazarkhalili

Hi @behroozazarkhalili,

I will work on this, aiming to get back in 1-2 weeks.

Thanks.

dxli94 avatar Nov 17 '22 15:11 dxli94

Hi @behroozazarkhalili,

I will work on this, aiming to get back in 1-2 weeks.

Thanks.

Thank you for your time and consideration.

behroozazarkhalili avatar Nov 19 '22 19:11 behroozazarkhalili

@dxli94 Hi. Any update about this?

behroozazarkhalili avatar Dec 10 '22 17:12 behroozazarkhalili

@mgomes @mgomes @amerine @dxli94 Hi everyone. Could you please kindly do a favor and answer this vital question?

behroozazarkhalili avatar Dec 31 '22 15:12 behroozazarkhalili

Hi. Will some examples be provided? Thanks.

tobyperrett avatar Feb 04 '23 15:02 tobyperrett

Haven't got time to work on this yet.

With the unified interface in LAVIS to load model and preprocess samples, a working example will look similar to others in vqa etc.

PR welcomed.

dxli94 avatar Feb 06 '23 03:02 dxli94

It looks like the ALPRO model does not have a predict_answers method. Would this would be needed to be implemented in the ALPRO class to do video QA as in the example for vqa using BLIP? Or am I missing something?

tobyperrett avatar Feb 06 '23 13:02 tobyperrett

May I ask how we can use LAVIS to run video Q&A tasks as an example? Is there any latest progress

qq846511277 avatar Apr 25 '23 05:04 qq846511277

Hi, I am also keen to know how to use InstructBLIP for video input, does anyone willing to share? I truly appreciate your valuable time.

ee2110 avatar Jul 09 '23 16:07 ee2110