SoM-LLaVA icon indicating copy to clipboard operation
SoM-LLaVA copied to clipboard

[COLM-2024] List Items One by One: A New Data Source and Learning Paradigm for Multimodal LLMs

Results 2 SoM-LLaVA issues
Sort by recently updated
recently updated
newest added

Thank you very much for your awesome work. Would you mind providing the annotated image download links?

Hello, thanks for sharing the work, it is very inspiring. I wonder if you can share the attention extraction and visualization script used for creating Figure 2 in the paper?