mllm topic

List mllm repositories

Woodpecker

561
Stars
28
Forks
Watchers

✨✨Woodpecker: Hallucination Correction for Multimodal Large Language Models. The first work to correct hallucinations in MLLMs.

GenerateU

99
Stars
5
Forks
Watchers

[CVPR2024] Generative Region-Language Pretraining for Open-Ended Object Detection

MIKO

16
Stars
0
Forks
Watchers

MIKO: Multimodal Intention Knowledge Distillation from Large Language Models for Social-Media Commonsense Discover

Bunny

704
Stars
55
Forks
Watchers

A family of lightweight multimodal models.

ComfyUI_VLM_nodes

246
Stars
16
Forks
Watchers

Custom ComfyUI nodes for Vision Language Models, Large Language Models, Image to Music, Text to Music, Consistent and Random Creative Prompt Generation

mPLUG-DocOwl

1.0k
Stars
58
Forks
Watchers

mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding

Groma

404
Stars
50
Forks
Watchers

Grounded Multimodal Large Language Model with Localized Visual Tokenization

VisualWebBench

34
Stars
0
Forks
Watchers

Evaluation framework for paper "VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding?"

Youku-mPLUG

262
Stars
11
Forks
Watchers

Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Pre-training Dataset and Benchmarks

mPLUG-HalOwl

59
Stars
1
Forks
Watchers

mPLUG-HalOwl: Multimodal Hallucination Evaluation and Mitigating