cross-modal topic
awesome-audio-visual
A curated list of different papers and datasets in various areas of audio-visual processing
SCAN
PyTorch source code for "Stacked Cross Attention for Image-Text Matching" (ECCV 2018)
knowledge-graphs
A collection of research on knowledge graphs
erlexec
Represent, send, store and search multimodal data
docarray
Represent, send, store and search multimodal data
discoart
🪩 Create Disco Diffusion artworks in one line
Weakly-Supervised-3D-Object-Detection
Weakly Supervised 3D Object Detection from Point Clouds (VS3D), ACM MM 2020
objects-that-sound
Unofficial Implementation of Google Deepmind's paper `Objects that Sound`
MoTIS
[NAACL 2022]Mobile Text-to-Image search powered by multimodal semantic representation models(e.g., OpenAI's CLIP)
DSRAN
Code for journal paper "Learning Dual Semantic Relations with Graph Attention for Image-Text Matching", TCSVT, 2020.
ml-CCA
Implementation of Fast ml-CCA from the ICCV-2015 work "Multi-Label Cross-Modal Retrieval"