video-large-language-models topic

List video-large-language-models repositories

Awesome_Long_Form_Video_Understanding

326
Stars
14
Forks
326
Watchers

Awesome papers & datasets specifically focused on long-term videos.

MPP-LLaVA

501
Stars
26
Forks
501
Watchers

Personal Project: MPP-Qwen14B & MPP-Qwen-Next(Multimodal Pipeline Parallel based on Qwen-LM). Support [video/image/multi-image] {sft/conversations}. Don't let the poverty limit your imagination! Train...

VTG-LLM

115
Stars
3
Forks
115
Watchers

[AAAI 2025] VTG-LLM: Integrating Timestamp Knowledge into Video LLMs for Enhanced Video Temporal Grounding

TRACE

136
Stars
3
Forks
136
Watchers

[ICLR 2025] TRACE: Temporal Grounding Video LLM via Casual Event Modeling

Video-RAG-master

336
Stars
32
Forks
336
Watchers

✨✨[NeurIPS 2025] This is the official implementation of our paper "Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension"

QuoTA

73
Stars
2
Forks
73
Watchers

✨✨[AAAI 2026] This is the official implementation of our paper "QuoTA: Query-oriented Token Assignment via CoT Query Decouple for Long Video Comprehension"

HoliTom

57
Stars
1
Forks
57
Watchers

[NeurIPS'25] HoliTom: Holistic Token Merging for Fast Video Large Language Models

VidCom2

38
Stars
1
Forks
38
Watchers

[EMNLP 2025 Main] Video Compression Commander: Plug-and-Play Inference Acceleration for Video Large Language Models

Consistency-of-Video-LLM

15
Stars
0
Forks
15
Watchers

[CVPR 2025] Official Repository of the paper "On the Consistency of Video Large Language Models in Temporal Comprehension"