caption-generation topic
D3Net
[ECCV2022] D3Net: A Unified Speaker-Listener Architecture for 3D Dense Captioning and Visual Grounding
Scan2Cap
[CVPR 2021] Scan2Cap: Context-aware Dense Captioning in RGB-D Scans
meshed-memory-transformer
Meshed-Memory Transformer for Image Captioning. CVPR 2020
show-control-and-tell
Show, Control and Tell: A Framework for Generating Controllable and Grounded Captions. CVPR 2019
Image-Caption-Generator
A neural network to generate captions for an image using CNN and RNN with BEAM Search.
ARNet
CVPR 2018 - Regularizing RNNs for Caption Generation by Reconstructing The Past with The Present
deep-learning-image-caption-generator
Deep CNN-LSTM for Generating Image Descriptions :smiling_imp:
Image-Captioning
Implemented 3 different architectures to tackle the Image Caption problem, i.e, Merged Encoder-Decoder - Bahdanau Attention - Transformers
speaksee
PyTorch library for Visual-Semantic tasks
SpaCap3D
[IJCAI 2022] Spatiality-guided Transformer for 3D Dense Captioning on Point Clouds (official pytorch implementation)