large-multimodal-models topic

List large-multimodal-models repositories

Awesome_Matching_Pretraining_Transfering

357
Stars
47
Forks
Watchers

The Paper List of Large Multi-Modality Model, Parameter-Efficient Finetuning, Vision-Language Pretraining, Conventional Image-Text Matching for Preliminary Insight.

BenchLMM

80
Stars
5
Forks
Watchers

BenchLMM: Benchmarking Cross-style Visual Capability of Large Multimodal Models

MMMU

273
Stars
19
Forks
Watchers

This repo contains evaluation code for the paper "MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI"

LLaVA-Plus-Codebase

641
Stars
49
Forks
Watchers

LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skills

awesome-multimodal-in-medical-imaging

348
Stars
35
Forks
Watchers

A collection of resources on applications of multi-modal learning in medical imaging.

OpenAdapt

656
Stars
80
Forks
Watchers

AI-First Process Automation with Large ([Language (LLMs) / Action (LAMs) / Multimodal (LMMs)] / Visual Language (VLMs)) Models

multi_token

150
Stars
6
Forks
Watchers

Embed arbitrary modalities (images, audio, documents, etc) into large language models.

TinyLLaVA_Factory

287
Stars
25
Forks
Watchers

A Framework of Small-scale Large Multimodal Models

MMStar

110
Stars
1
Forks
Watchers

This repo contains evaluation code for the paper "Are We on the Right Way for Evaluating Large Vision-Language Models"