cross-modal-retrieval topic
Image-Text-Embedding
TOMM2020 Dual-Path Convolutional Image-Text Embedding :feet: https://arxiv.org/abs/1711.05535
Text2Pos-CVPR2022
Code, dataset and models for our CVPR 2022 publication "Text2Pos"
STT
A multi-task model which does image captioning, sentence paraphrasing and cross-modal retrieval.
eccv-caption
Extended COCO Validation (ECCV) Caption dataset (ECCV 2022)
pcme
Official Pytorch implementation of "Probabilistic Cross-Modal Embedding" (CVPR 2021)
on-the-fly-FGSBIR
[CVPR 2020, Oral] "Sketch Less for More: On-the-Fly Fine-Grained Sketch Based Image Retrieval”, IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2020. .
tidy
Offline semantic Text-to-Image and Image-to-Image search on Android powered by quantized state-of-the-art vision-language pretrained CLIP model and ONNX Runtime inference engine
VNEL
Dataset and code for EMNLP 2022 "Visual Named Entity Linking: A New Dataset and A Baseline"
GNN4CMR
PyTorch implementation of the AAAI-21 paper "Dual Adversarial Label-aware Graph Neural Networks for Cross-modal Retrieval" and the TPAMI-22 paper "Integrating Multi-Label Contrastive Learning with Dua...
UCCH
Unsupervised Contrastive Cross-modal Hashing (IEEE TPAMI 2023, PyTorch Code)