SoM-LLaVA
SoM-LLaVA copied to clipboard
[COLM-2024] List Items One by One: A New Data Source and Learning Paradigm for Multimodal LLMs
Results
2
SoM-LLaVA issues
Sort by
recently updated
recently updated
newest added
Thank you very much for your awesome work. Would you mind providing the annotated image download links?
Hello, thanks for sharing the work, it is very inspiring. I wonder if you can share the attention extraction and visualization script used for creating Figure 2 in the paper?