MiniCPM-V icon indicating copy to clipboard operation
MiniCPM-V copied to clipboard

Using v2.6 model with video on Mac

Open chigkim opened this issue 1 year ago • 1 comments

First of all, thank you for the amazing work! I'm wondering if there's a way to use the video input on Mac? I can load the v2.6 model on Llama.cpp, but it only accepts image input. When trying to load the 2.6 model with transformers.AutoModel, it requires flash attention. Unfortunately flash attention is not available on Mac. Is there a workaround for this? Thanks!

chigkim avatar Aug 27 '24 17:08 chigkim

@chigkim You need this video understanding patch https://github.com/ggerganov/llama.cpp/pull/9165.patch It ran fine on the Mac

saket424 avatar Aug 27 '24 22:08 saket424

Thanks so much! The pr worked!

chigkim avatar Aug 28 '24 14:08 chigkim

how about streaming?

franz101 avatar Jan 28 '25 21:01 franz101