cross-modality topic
Sound-of-Pixels
Codebase for ECCV18 "The Sound of Pixels"
CrossLeak
Code for the WWW'20 paper "Nowhere to Hide: Cross-modal Identity Leakage between Biometrics and Devices"
clip-as-service
🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP
jina
☁️ Build multimodal AI applications with cloud-native stack
movienet-tools
Tools for movie and video research
sem-pcyc
PyTorch implementation of the paper "Semantically Tied Paired Cycle Consistency for Zero-Shot Sketch-based Image Retrieval", CVPR 2019.
Awesome-cross-modality-person-re-identification
Awesome Cross-modality Person Re-identification
Visible-Thermal-Person-Re-Identification
Demo code for visible thermal (cross-modality) person re-identification
co-separation
Co-Separating Sounds of Visual Objects (ICCV 2019)
VideoNavQA
An alternative EQA paradigm and informative benchmark + models (BMVC 2019, ViGIL 2019 spotlight)