large-multimodal-models topic

List large-multimodal-models repositories

VisualWebBench

34
Stars
0
Forks
Watchers

Evaluation framework for paper "VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding?"

TextCoT

20
Stars
1
Forks
Watchers

The official repo for “TextCoT: Zoom In for Enhanced Multimodal Text-Rich Image Understanding”.

MileBench

16
Stars
1
Forks
Watchers

This repo contains evaluation code for the paper "MileBench: Benchmarking MLLMs in Long Context"

Open-LLaVA-NeXT

18
Stars
1
Forks
Watchers

An open-source implementation of LLaVA-NeXT.