cross-modal topic

List cross-modal repositories

XFlow

25
Stars
3
Forks
Watchers

Generalized cross-modal NNs; new audiovisual benchmark (IEEE TNNLS 2019)

examples

390
Stars
104
Forks
Watchers

Analyze the unstructured data with Towhee, such as reverse image search, reverse video search, audio classification, question and answer systems, molecular search, etc.

VLTVG

88
Stars
7
Forks
Watchers

Improving Visual Grounding with Visual-Linguistic Verification and Iterative Reasoning, CVPR 2022

aaai17-cdq

35
Stars
24
Forks
Watchers

The implementation of AAAI-17 paper "Collective Deep Quantization of Efficient Cross-modal Retrieval"

Xmodal-Ctx

60
Stars
10
Forks
Watchers

Official PyTorch implementation of our CVPR 2022 paper: Beyond a Pre-Trained Object Detector: Cross-Modal Textual and Visual Context for Image Captioning

RIM

199
Stars
12
Forks
Watchers

[CVPR 2023] Referring Image Matting

Text2Pos-CVPR2022

37
Stars
7
Forks
Watchers

Code, dataset and models for our CVPR 2022 publication "Text2Pos"

ZeroVL

41
Stars
5
Forks
Watchers

[ECCV2022] Contrastive Vision-Language Pre-training with Limited Resources

SOLC

170
Stars
25
Forks
Watchers

Remote Sensing Sar-Optical Land-use Classfication Pytorch Pytorch高分辨率遥感语义分割/地物分割/地物分类

multimodal-maestro

1.0k
Stars
71
Forks
Watchers

Effective prompting for Large Multimodal Models like GPT-4 Vision, LLaVA or CogVLM. 🔥