large-vision-language-models topic

List large-vision-language-models repositories

Awesome-Medical-Large-Language-Models

204
Stars
21
Forks
Watchers

Curated papers on Large Language Models in Healthcare and Medical domain

Awesome-Chart-Understanding

161
Stars
15
Forks
Watchers

A curated list of recent and past chart understanding work based on our survey paper: From Pixels to Insights: A Survey on Automatic Chart Understanding in the Era of Large Foundation Models.

Awesome-LLMs-meet-Multimodal-Generation

322
Stars
17
Forks
Watchers

🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).

ShareGPT4Video

1.2k
Stars
44
Forks
Watchers

[NeurIPS 2024 D&B Track] An official implementation of ShareGPT4Video: Improving Video Understanding and Generation with Better Captions

Awesome-LVLM-Hallucination

18
Stars
1
Forks
Watchers

up-to-date curated list of state-of-the-art Large vision language models hallucinations research work, papers & resources

ShareGPT4V

124
Stars
4
Forks
Watchers

[ECCV 2024] ShareGPT4V: Improving Large Multi-modal Models with Better Captions

Video-MME

370
Stars
11
Forks
Watchers

✨✨Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis

moh

23
Stars
1
Forks
Watchers

Official Repository of Multi-Object Hallucination in Vision-Language Models

apiprompting

21
Stars
1
Forks
Watchers

[ECCV 2024] API: Attention Prompting on Image for Large Vision-Language Models