Awesome-Multimodal-Large-Language-Models
Awesome-Multimodal-Large-Language-Models copied to clipboard
Add VideoLLM-online
Thanks for your great work! If possible, could you consider adding
VideoLLM-online: Online Large Language Model for Streaming Video (CVPR 2024)
We have released code, data, checkpoints, demo at: https://showlab.github.io/videollm-online/
Thanks for sharing. Your work has been incorporated into the repo. Please also consider citing:
@article{yin2024survey,
title={A survey on multimodal large language models},
author={Yin, Shukang and Fu, Chaoyou and Zhao, Sirui and Li, Ke and Sun, Xing and Xu, Tong and Chen, Enhong},
journal={National Science Review},
pages={nwae403},
year={2024},
publisher={Oxford University Press}
}
@article{fu2024mme,
title={MME-Survey: A Comprehensive Survey on Evaluation of Multimodal LLMs},
author={Fu, Chaoyou and Zhang, Yi-Fan and Yin, Shukang and Li, Bo and Fang, Xinyu and Zhao, Sirui and Duan, Haodong and Sun, Xing and Liu, Ziwei and Wang, Liang and others},
journal={arXiv preprint arXiv:2411.15296},
year={2024}
}