lvlm topic

List lvlm repositories

ms-swift

11.9k
Stars
1.1k
Forks
11.9k
Watchers

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, Phi...

LightCompress

649
Stars
64
Forks
649
Watchers

[EMNLP 2024 & AAAI 2026] A powerful toolkit for compressing large models including LLMs, VLMs, and video generative models.

MMStar

199
Stars
5
Forks
199
Watchers

[NeurIPS 2024] This repo contains evaluation code for the paper "Are We on the Right Way for Evaluating Large Vision-Language Models"

Awesome-LLMs-meet-Multimodal-Generation

518
Stars
30
Forks
518
Watchers

🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).

Eagle

898
Stars
47
Forks
898
Watchers

Eagle: Frontier Vision-Language Models with Data-Centric Strategies

Awesome-LVLM-Hallucination

237
Stars
9
Forks
237
Watchers

up-to-date curated list of state-of-the-art Large vision language models hallucinations research work, papers & resources

AISurveyPapers

19
Stars
1
Forks
19
Watchers

Large Visual Language Model(LVLM), Large Language Model(LLM), Multimodal Large Language Model(MLLM), Alignment, Agent, AI System, Survey

VoRA

344
Stars
29
Forks
344
Watchers

[Fully open] [Encoder-free MLLM] Vision as LoRA

OpenThinkIMG

324
Stars
6
Forks
324
Watchers

OpenThinkIMG is an end-to-end open-source framework that empowers LVLMs to think with images.

FrameFusion

66
Stars
1
Forks
66
Watchers

[ICCV'25] The official code of paper "Combining Similarity and Importance for Video Token Reduction on Large Visual Language Models"