multi-modal-large-language-model topic

List multi-modal-large-language-model repositories

Awesome-CVPR2024-ECCV2024-AIGC

437
Stars
12
Forks
Watchers

A Collection of Papers and Codes for CVPR2024/ECCV2024 AIGC

VTG-LLM

115
Stars
3
Forks
115
Watchers

[AAAI 2025] VTG-LLM: Integrating Timestamp Knowledge into Video LLMs for Enhanced Video Temporal Grounding

Video-RAG-master

336
Stars
32
Forks
336
Watchers

✨✨[NeurIPS 2025] This is the official implementation of our paper "Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension"