multimodal-dataset topic

List multimodal-dataset repositories

conceptual-12m

327
Stars
16
Forks
Watchers

Conceptual 12M is a dataset containing (image-URL, caption) pairs collected for vision-and-language pre-training.

modd

46
Stars
5
Forks
Watchers

Dataset and Evaluation Scripts for Obstacle Detection via Semantic Segmentation in a Marine Environment