multi-modal-chatgpt topic

List multi-modal-chatgpt repositories

Video-LLaMA

2.7k
Stars
242
Forks
15
Watchers

[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

NExT-GPT

3.2k
Stars
319
Forks
Watchers

Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model