image-captioning topic
sightseq
Computer vision tools for fairseq, containing PyTorch implementation of text recognition and object detection
Attention-Beam-Image-Captioning
Image captioning using beam search heuristic on top of the encoder-decoder based architecture
MAGIC
Language Models Can See: Plugging Visual Controls in Text Generation
bottom-up-attention
Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome
coco-caption
Adds SPICE metric to coco-caption evaluation server codes
SPICE
Semantic Propositional Image Caption Evaluation
Up-Down-Captioner
Automatic image captioning model based on Caffe, using features from bottom-up attention.
AdaptiveAttention
Implementation of "Knowing When to Look: Adaptive Attention via A Visual Sentinel for Image Captioning"
fairseq-image-captioning
Transformer-based image captioning extension for pytorch/fairseq
virtex
[CVPR 2021] VirTex: Learning Visual Representations from Textual Annotations