Awesome-Multimodal-Large-Language-Models
Awesome-Multimodal-Large-Language-Models copied to clipboard
Add LITA: Language Instructed Temporal-Localization Assistant
A nice job focusing on temporal localization when generating captions or answer questions about a video.