multi-modal-chatgpt topic

List multi-modal-chatgpt repositories

Video-LLaMA

2.5k
Stars
229
Forks
15
Watchers

[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

NExT-GPT

3.0k
Stars
305
Forks
Watchers

Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model