LLaVA-NeXT
LLaVA-NeXT copied to clipboard
slow fast features not being used in current code
Hi, I notice that you have commented out encode_multimodals (https://github.com/LLaVA-VL/LLaVA-NeXT/blob/09e5840d5589ad2d6a8656c0a60f21ae134b3309/llava/model/llava_arch.py#L291C32-L291C55). If I understand correctly, using slow-fast features would require using self.encode_multimodals and not self.encode_images. Could you clarify this?
i also found this problem. self.encode_multimodals is not called ?could you clarify this?
They replied on another thread. They don't use the slow fast pooling on the 7B model, only on the 72B.