visual-semantic topic
meshed-memory-transformer
Meshed-Memory Transformer for Image Captioning. CVPR 2020
SCAN
PyTorch source code for "Stacked Cross Attention for Image-Text Matching" (ECCV 2018)
show-control-and-tell
Show, Control and Tell: A Framework for Generating Controllable and Grounded Captions. CVPR 2019
Image-Text-Embedding
TOMM2020 Dual-Path Convolutional Image-Text Embedding :feet: https://arxiv.org/abs/1711.05535
lostX
(RSS 2018) LoST - Visual Place Recognition using Visual Semantics for Opposite Viewpoints across Day and Night
vse_infty
Code for "Learning the Best Pooling Strategy for Visual Semantic Embedding", CVPR 2021
speaksee
PyTorch library for Visual-Semantic tasks