multimodal-large-language-models topics

Awesome_Matching_Pretraining_Transfering

434

Stars

49

Forks

434

Watchers

The Paper List of Large Multi-Modality Model (Perception, Generation, Unification), Parameter-Efficient Finetuning, Vision-Language Pretraining, Conventional Image-Text Matching for Preliminary Insigh...

Paranioar

awesome

awesome-list

cross-modal-retrieval

image-retrieval

MM-NIAH

75

Stars

4

Forks

Watchers

This is the official implementation of the paper "Needle In A Multimodal Haystack"

OpenGVLab

benchmark

long-context

multimodal-large-language-models

vision-language-model

EasyDetect

36

Stars

2

Forks

36

Watchers

[ACL 2024] An Easy-to-use Hallucination Detection Framework for LLMs.

zjunlp

aigc

artificial-intelligence

easydetect

generation

DynMoE

42

Stars

8

Forks

Watchers

[Preprint] Dynamic Mixture of Experts: An Auto-Tuning Approach for Efficient Transformer Models

LINs-lab

adaptive-computation

language-model

mixture-of-experts

moe

EmpathyEar

15

Stars

3

Forks

Watchers

Multimodal Empathetic Chatbot

scofield7419

empathetic-ai

empathetic-responses

multimodal-large-language-models

Video-of-Thought

35

Stars

2

Forks

Watchers

Codes for ICML 2024 paper: "Video-of-Thought: Step-by-Step Video Reasoning from Perception to Cognition"

scofield7419

chain-of-thought

chain-of-thought-reasoning

multimodal-large-language-models

video

EVE

361

Stars

12

Forks

361

Watchers

EVE Series: Encoder-Free Vision-Language Models from BAAI

baaivision

clip

encoder-free-vlm

instruction-following

large-language-models

cambrian

1.7k

Stars

112

Forks

Watchers

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

cambrian-mllm

chatbot

clip

computer-vision

dino

CompBench

30

Stars

1

Forks

Watchers

CompBench evaluates the comparative reasoning of multimodal large language models (MLLMs) with 40K image pairs and questions across 8 dimensions of relative comparison: visual attribute, existence, st...

RaptorMai

benchmark

evaluation-llms

foundation-models

human-annotation