activitynet-captions topic
densecap
Dense video captioning in PyTorch
BMT
Source code for "Bi-modal Transformer for Dense Video Captioning" (BMVC 2020)
MDVC
PyTorch implementation of Multi-modal Dense Video Captioning (CVPR 2020 Workshops)
recurrent-transformer
[ACL 2020] PyTorch code for MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning
Awesome-Temporally-Language-Grounding
A curated list of “Temporally Language Grounding” and related area
PDVC
End-to-End Dense Video Captioning with Parallel Decoding (ICCV 2021)
Temporally-language-grounding
A Pytorch implemention for some state-of-the-art models for" Temporally Language Grounding in Untrimmed Videos"
dense-video-captioning-pytorch
Second-place solution to dense video captioning task in ActivityNet Challenge (CVPR 2020 workshop)
video_captioning_datasets
Summary about Video-to-Text datasets. This repository is part of the review paper *Bridging Vision and Language from the Video-to-Text Perspective: A Comprehensive Review*
video_features_extractor
Python implementation of extraction of several visual features representations from videos