captioning-videos topic
cyclical-visual-captioning
PyTorch code for: Learning to Generate Grounded Visual Captions without Localization Supervision
awesome-grounding
awesome grounding: A curated list of research papers in visual grounding
vognet-pytorch
[CVPR20] Video Object Grounding using Semantic Roles in Language Description (https://arxiv.org/abs/2003.10606)
iPerceive
Applying Common-Sense Reasoning to Multi-Modal Dense Video Captioning and Video Question Answering | Python3 | PyTorch | CNNs | Causality | Reasoning | LSTMs | Transformers | Multi-Head Self Attention...
cst_captioning
PyTorch Implementation of Consensus-based Sequence Training for Video Captioning
mvad-names-dataset
M-VAD Names Dataset. Multimedia Tools and Applications (2019)
MSR-VTT-Challenge
Video to Language Challenge (MSR-VTT Challenge 2016)
Ask-Anything
[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
VidSitu
[CVPR21] Visual Semantic Role Labeling for Video Understanding (https://arxiv.org/abs/2104.00990)
Dataset-Curation-Tool
A tool for downloading from public image boards (which allow scraping) / preview your images & tags / edit your images & tags. Additional tabs for downloading other desired code repositories as well a...