video-language topic

List video-language repositories

UniVL

330
Stars
54
Forks
Watchers

An official implementation for " UniVL: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation"

Multi-Modal-Transformer

211
Stars
29
Forks
Watchers

The repository collects many various multi-modal transformer architectures, including image transformer, video transformer, image-language transformer, video-language transformer and self-supervised l...

all-in-one

273
Stars
16
Forks
Watchers

[CVPR2023] All in One: Exploring Unified Video-Language Pre-training

ReferFormer

310
Stars
26
Forks
Watchers

[CVPR2022] Official Implementation of ReferFormer

ALPRO

184
Stars
18
Forks
Watchers

Align and Prompt: Video-and-Language Pre-training with Entity Prompts

EgoVLP

208
Stars
19
Forks
Watchers

[NeurIPS2022] Egocentric Video-Language Pretraining

Region_Learner

42
Stars
4
Forks
Watchers

The Pytorch implementation for "Video-Text Pre-training with Learned Regions"

VidIL

112
Stars
1
Forks
Watchers

Pytorch code for Language Models with Image Descriptors are Strong Few-Shot Video-Language Learners

Perceiver_VL

32
Stars
3
Forks
Watchers

PyTorch code for "Perceiver-VL: Efficient Vision-and-Language Modeling with Iterative Latent Attention" (WACV 2023)

VidSitu

56
Stars
8
Forks
Watchers

[CVPR21] Visual Semantic Role Labeling for Video Understanding (https://arxiv.org/abs/2104.00990)