multimodal-large-language-models topic

List multimodal-large-language-models repositories

Awesome-Multimodal-Large-Language-Models

11.9k
Stars
765
Forks
208
Watchers

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models

LLaVA-Plus-Codebase

696
Stars
52
Forks
Watchers

LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skills

[WACV 2024 Survey Paper] Multimodal Large Language Models for Autonomous Driving

awesome-multimodal-in-medical-imaging

486
Stars
51
Forks
Watchers

A collection of resources on applications of multi-modal learning in medical imaging.

MovieChat

500
Stars
41
Forks
Watchers

[CVPR 2024] MovieChat: From Dense Token to Sparse Memory for Long Video Understanding

KoPA

132
Stars
8
Forks
Watchers

[Paper][ACM MM 2024] Making Large Language Models Perform Better in Knowledge Graph Completion

RPG-DiffusionMaster

1.7k
Stars
92
Forks
17
Watchers

[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (PRG)

Awesome-Multimodal-LLM

347
Stars
16
Forks
Watchers

Research Trends in LLM-guided Multimodal Learning.

MobileAgent

2.9k
Stars
265
Forks
32
Watchers

Mobile-Agent: The Powerful Mobile Device Operation Assistant Family