multi-modal-large-language-model topic
List
multi-modal-large-language-model repositories
Awesome-CVPR2024-ECCV2024-AIGC
437
Stars
12
Forks
Watchers
A Collection of Papers and Codes for CVPR2024/ECCV2024 AIGC
VTG-LLM
115
Stars
3
Forks
115
Watchers
[AAAI 2025] VTG-LLM: Integrating Timestamp Knowledge into Video LLMs for Enhanced Video Temporal Grounding
Video-RAG-master
336
Stars
32
Forks
336
Watchers
✨✨[NeurIPS 2025] This is the official implementation of our paper "Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension"