Video-LLaMA What is the input sample of the forward function in videollama

What is the input sample of the forward function in videollama

Open llx-08 opened this issue 1 year ago • 1 comments

Hi, I'm wondering what is the input sample of the forward function in videollama.py.

It seems like an dict() which contains image, text_input as its keys, but I can't find any usage as example. Besides, I check the inference process in demo_audiovideo.py, it's different with the forward process. Can you provide some example to use the forward function in videollama? Thank you very much!

Mar 08 '24 04:03 llx-08

I am also finding this solution.!

Apr 28 '24 04:04 EQ3000

Video-LLaMA Video-LLaMA copied to clipboard

What is the input sample of the forward function in videollama

Video-LLaMA
Video-LLaMA copied to clipboard