multimodal-large-language-models topic
Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
LLaVA-Plus-Codebase
LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skills
Awesome-Multimodal-LLM-Autonomous-Driving
[WACV 2024 Survey Paper] Multimodal Large Language Models for Autonomous Driving
awesome-multimodal-in-medical-imaging
A collection of resources on applications of multi-modal learning in medical imaging.
MovieChat
[CVPR 2024] MovieChat: From Dense Token to Sparse Memory for Long Video Understanding
KoPA
[Paper][ACM MM 2024] Making Large Language Models Perform Better in Knowledge Graph Completion
RPG-DiffusionMaster
[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (PRG)
Awesome-Multimodal-LLM
Reading list for Multimodal Large Language Models
Awesome-Multimodal-LLM
Research Trends in LLM-guided Multimodal Learning.
MobileAgent
Mobile-Agent: The Powerful Mobile Device Operation Assistant Family