multimodal-large-language-models topic

List multimodal-large-language-models repositories

Chinese-CLIP-opencv-onnxrun

49
Stars
10
Forks
Watchers

使用OpenCV+onnxruntime部署中文clip做以文搜图,给出一句话来描述想要的图片,就能从图库中搜出来符合要求的图片。包含C++和Python两个版本的程序

Gemini-Commonsense-Evaluation

35
Stars
2
Forks
Watchers

Official implementation of "Gemini in Reasoning: Unveiling Commonsense in Multimodal Large Language Models"

MM-InstructEval

24
Stars
1
Forks
Watchers

This repository contains code to evaluate various multimodal large language models using different instructions across multiple multimodal content comprehension tasks.

Woodpecker

599
Stars
29
Forks
Watchers

✨✨Woodpecker: Hallucination Correction for Multimodal Large Language Models. The first work to correct hallucinations in MLLMs.

DrugLAMP

33
Stars
0
Forks
Watchers

A PyTorch-based system for highly accurate drug-target interaction predictions utilizing multi-modal large language models to discern structural affinities in drug-target pairs.

ComfyUI-Hangover-Moondream

41
Stars
6
Forks
Watchers

Moondream is a lightweight multimodal large language model

Bunny

886
Stars
66
Forks
Watchers

A family of lightweight multimodal models.

MLM_Filter

40
Stars
1
Forks
Watchers

Official implementation of our paper "Finetuned Multimodal Language Models are High-Quality Image-Text Data Filters".

mPLUG-DocOwl

1.5k
Stars
99
Forks
Watchers

mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding