cross-modal topics

Analyze the unstructured data with Towhee, such as reverse image search, reverse video search, audio classification, question and answer systems, molecular search, etc.

towhee-io

audio-classification

cross-modal

embeddings

image-classification

VLTVG

88

Stars

7

Forks

Watchers

Improving Visual Grounding with Visual-Linguistic Verification and Iterative Reasoning, CVPR 2022

yangli18

cross-modal

vision-language

visual-grounding

visual-linguistic

aaai17-cdq

35

Stars

24

Forks

Watchers

The implementation of AAAI-17 paper "Collective Deep Quantization of Efficient Cross-modal Retrieval"

caoyue10

cross-modal

deep-learning

quantization

similarity-search

Xmodal-Ctx

60

Stars

10

Forks

Watchers

Official PyTorch implementation of our CVPR 2022 paper: Beyond a Pre-Trained Object Detector: Cross-Modal Textual and Visual Context for Image Captioning

GT-RIPL

clip

cross-modal

image-captioning

vision-and-language