lvlm topic

List lvlm repositories

ms-swift

3.6k
Stars
310
Forks
Watchers

Use PEFT or Full-parameter to finetune 350+ LLMs or 90+ MLLMs. (LLM: Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, Gemma2, ...; MLLM: Qwen2-VL, Qwen2-Audio, Llama3.2-Visio...

LightCompress

625
Stars
62
Forks
625
Watchers

[EMNLP 2024 & AAAI 2026] A powerful toolkit for compressing large models including LLM, VLM, and video generation models.

MMStar

199
Stars
5
Forks
199
Watchers

[NeurIPS 2024] This repo contains evaluation code for the paper "Are We on the Right Way for Evaluating Large Vision-Language Models"

Awesome-LLMs-meet-Multimodal-Generation

518
Stars
30
Forks
518
Watchers

🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).

Eagle

898
Stars
47
Forks
898
Watchers

Eagle: Frontier Vision-Language Models with Data-Centric Strategies

Awesome-LVLM-Hallucination

211
Stars
8
Forks
211
Watchers

up-to-date curated list of state-of-the-art Large vision language models hallucinations research work, papers & resources

AISurveyPapers

19
Stars
1
Forks
19
Watchers

Large Visual Language Model(LVLM), Large Language Model(LLM), Multimodal Large Language Model(MLLM), Alignment, Agent, AI System, Survey

VoRA

344
Stars
29
Forks
344
Watchers

[Fully open] [Encoder-free MLLM] Vision as LoRA

OpenThinkIMG

324
Stars
6
Forks
324
Watchers

OpenThinkIMG is an end-to-end open-source framework that empowers LVLMs to think with images.

FrameFusion

66
Stars
1
Forks
66
Watchers

[ICCV'25] The official code of paper "Combining Similarity and Importance for Video Token Reduction on Large Visual Language Models"