captioning topic
PaperNotes
My notes on some Deep Learning papers
mmf
A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
VSUA-Captioning
Code for "Aligning Linguistic Words and Visual Semantic Units for Image Captioning", ACM MM 2019
fully-convolutional-point-network
Fully-Convolutional Point Networks for Large-Scale Point Clouds
iPerceive
Applying Common-Sense Reasoning to Multi-Modal Dense Video Captioning and Video Question Answering | Python3 | PyTorch | CNNs | Causality | Reasoning | LSTMs | Transformers | Multi-Head Self Attention...
MedicalReportGeneration
A Base Tensorflow Project for Medical Report Generation
Tennis
A Tennis dataset and models for event detection & commentary generation
MTL-AQA
What and How Well You Performed? A Multitask Learning Approach to Action Quality Assessment [CVPR 2019]
R3Transformer
Official python implementation of R3-Transformer
awesome-diverse-captioning
Some papers about *diverse* image (a few videos) captioning