multimodal-large-language-models topic

List multimodal-large-language-models repositories

Awesome_Matching_Pretraining_Transfering

434
Stars
49
Forks
434
Watchers

The Paper List of Large Multi-Modality Model (Perception, Generation, Unification), Parameter-Efficient Finetuning, Vision-Language Pretraining, Conventional Image-Text Matching for Preliminary Insigh...

MM-NIAH

75
Stars
4
Forks
Watchers

This is the official implementation of the paper "Needle In A Multimodal Haystack"

EasyDetect

36
Stars
2
Forks
36
Watchers

[ACL 2024] An Easy-to-use Hallucination Detection Framework for LLMs.

DynMoE

42
Stars
8
Forks
Watchers

[Preprint] Dynamic Mixture of Experts: An Auto-Tuning Approach for Efficient Transformer Models

Video-of-Thought

35
Stars
2
Forks
Watchers

Codes for ICML 2024 paper: "Video-of-Thought: Step-by-Step Video Reasoning from Perception to Cognition"

EVE

361
Stars
12
Forks
361
Watchers

EVE Series: Encoder-Free Vision-Language Models from BAAI

cambrian

1.7k
Stars
112
Forks
Watchers

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

CompBench

30
Stars
1
Forks
Watchers

CompBench evaluates the comparative reasoning of multimodal large language models (MLLMs) with 40K image pairs and questions across 8 dimensions of relative comparison: visual attribute, existence, st...

Awesome-LVLM-Hallucination

211
Stars
8
Forks
211
Watchers

up-to-date curated list of state-of-the-art Large vision language models hallucinations research work, papers & resources