multimodal-large-language-models topic

List multimodal-large-language-models repositories

VisualWebBench

41
Stars
1
Forks
Watchers

Evaluation framework for paper "VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding?"

Youku-mPLUG

281
Stars
11
Forks
Watchers

Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Pre-training Dataset and Benchmarks

EasyDetect

49
Stars
3
Forks
Watchers

An Easy-to-use Hallucination Detection Framework for LLMs.

MineLand

48
Stars
7
Forks
Watchers

Simulating Large-Scale Multi-Agent Interactions with Limited Multimodal Senses and Physical Needs

seemore

151
Stars
14
Forks
Watchers

From scratch implementation of a vision language model in pure PyTorch

mPLUG-HalOwl

75
Stars
2
Forks
Watchers

mPLUG-HalOwl: Multimodal Hallucination Evaluation and Mitigating

MPP-LLaVA

357
Stars
20
Forks
Watchers

Personal Project: MPP-Qwen14B & MPP-Qwen-Next(Multimodal Pipeline Parallel based on Qwen-LM). Support [video/image/multi-image] {sft/conversations}. Don't let the poverty limit your imagination! Train...

Awesome-Medical-Large-Language-Models

204
Stars
21
Forks
Watchers

Curated papers on Large Language Models in Healthcare and Medical domain

polite-flamingo

63
Stars
3
Forks
Watchers

🦩 Visual Instruction Tuning with Polite Flamingo - training multi-modal LLMs to be both clever and polite! (AAAI-24 Oral)

VideoTGB

22
Stars
1
Forks
Watchers

[EMNLP 2024] A Video Chat Agent with Temporal Prior