large-multimodal-models topic

List large-multimodal-models repositories

Awesome_Matching_Pretraining_Transfering

397
Stars
47
Forks
Watchers

The Paper List of Large Multi-Modality Model, Parameter-Efficient Finetuning, Vision-Language Pretraining, Conventional Image-Text Matching for Preliminary Insight.

BenchLMM

81
Stars
6
Forks
Watchers

[ECCV 2024] BenchLMM: Benchmarking Cross-style Visual Capability of Large Multimodal Models

MMMU

332
Stars
21
Forks
Watchers

This repo contains evaluation code for the paper "MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI"

LLaVA-Plus-Codebase

696
Stars
52
Forks
Watchers

LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skills

awesome-multimodal-in-medical-imaging

486
Stars
51
Forks
Watchers

A collection of resources on applications of multi-modal learning in medical imaging.

OpenAdapt

963
Stars
133
Forks
Watchers

Open Source Generative Process Automation (i.e. Generative RPA). AI-First Process Automation with Large ([Language (LLMs) / Action (LAMs) / Multimodal (LMMs)] / Visual Language (VLMs)) Models

multi_token

175
Stars
12
Forks
Watchers

Embed arbitrary modalities (images, audio, documents, etc) into large language models.

TinyLLaVA_Factory

604
Stars
54
Forks
Watchers

A Framework of Small-scale Large Multimodal Models

MMStar

144
Stars
5
Forks
Watchers

[NeurIPS 2024] This repo contains evaluation code for the paper "Are We on the Right Way for Evaluating Large Vision-Language Models"