msrvtt topics

video-captioning-models-in-Pytorch

68

Stars

15

Forks

Watchers

A PyTorch implementation of state of the art video captioning models from 2015-2019 on MSVD and MSRVTT datasets.

nasib-ullah

deep-learning

marn

msrvtt

msvd

UniVL

330

Stars

54

Forks

Watchers

An official implementation for " UniVL: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation"

microsoft

alignment

caption

caption-task

coin

CLIP4Clip

792

Stars

116

Forks

Watchers

An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"

ArrowLuo

activitynet

clip

didemo

lsmdc

Semantics-AssistedVideoCaptioning

56

Stars

17

Forks

Watchers

Source code for Semantics-Assisted Video Captioning Model Trained with Scheduled Sampling Strategy

WingsBrokenAngel

msrvtt

msvd

python3

state-of-the-art

VidIL

112

Stars

1

Forks

Watchers

Pytorch code for Language Models with Image Descriptors are Strong Few-Shot Video-Language Learners

MikeWangWZHL

blip

clip

gpt-3

msrvtt

MAC

23

Stars

0

Forks

Watchers

An end-to-end masked contrastive video-and-language pre-training framework

shufangxun

activitynet

clip

contrastive-learning

didemo

X-CLIP

114

Stars

15

Forks

Watchers

An official implementation for "X-CLIP: End-to-End Multi-grained Contrastive Learning for Video-Text Retrieval"

xuguohai

activitynet

didemo

lsmdc

msrvtt