MovieChat icon indicating copy to clipboard operation
MovieChat copied to clipboard

About ablation study on memory mechanism

Open liziming5353 opened this issue 10 months ago • 14 comments

How is the model without the MM module implemented in the ablation experiment? Is it directly applying the merge algorithm to the entire video?

liziming5353 avatar Apr 25 '24 11:04 liziming5353

We just select several frames, feed them into LLM decoder without merge algorithm

Espere-1119-Song avatar Apr 25 '24 12:04 Espere-1119-Song

Got it. What's the difference between video_path and fragment_video_path? In my understanding, video_path is the path to the video to be processed. But in upload_video_without_audio function in chat_model.py, fragment_video_path is used as a parameter of load_video function.

liziming5353 avatar Apr 25 '24 12:04 liziming5353

fragment_video_path stores the video clips read by the sliding window

Espere-1119-Song avatar Apr 25 '24 12:04 Espere-1119-Song

So need I prepare the video clips in advance or it will be generated automatically?

liziming5353 avatar Apr 25 '24 12:04 liziming5353

no needs, it will be generated automatically

Espere-1119-Song avatar Apr 25 '24 12:04 Espere-1119-Song

Where does it generated? I don't find it. The first time fragment_video_path is used seems to be as a parameter of load_video in upload_video_without_audio function.

liziming5353 avatar Apr 25 '24 12:04 liziming5353

image

liziming5353 avatar Apr 25 '24 12:04 liziming5353

you can run it and print the path to see:)

Espere-1119-Song avatar Apr 25 '24 12:04 Espere-1119-Song

I have run it. I set the fragment_video_path to "~/video_frames_moviechat" which is an empty folder. An error occurred: image

liziming5353 avatar Apr 25 '24 12:04 liziming5353

because fragment_video_path needs to be a mp4 file, not a dictionary:)

Espere-1119-Song avatar Apr 25 '24 12:04 Espere-1119-Song

So fragment_video_path and video_path are the same video?

liziming5353 avatar Apr 25 '24 12:04 liziming5353

no, fragment_video_path is a tmp mp4 file

Espere-1119-Song avatar Apr 25 '24 12:04 Espere-1119-Song

But I only have one video to be processed and you said that fragment_video_path will be generated. So I am confused... Could you give me a sample?

liziming5353 avatar Apr 25 '24 12:04 liziming5353

It seems a bug in pypi code. In github code, the capture_video function write the tmp video file, and return the path. But in pypi code, the capture_video function does not write the tmp video file but still return the path. So the error above occurred.

liziming5353 avatar Apr 25 '24 16:04 liziming5353