activitynet topic
CLIP4Clip
An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"
PaddleVideo
Awesome video understanding toolkits based on PaddlePaddle. It supports video data annotation tools, lightweight RGB and skeleton based action recognition model, practical applications for video taggi...
ACAR-Net
[CVPR 2021] Actor-Context-Actor Relation Network for Spatio-temporal Action Localization
activitynet-qa
An VideoQA dataset based on the videos from ActivityNet
A2CL-PT
Adversarial Background-Aware Loss for Weakly-supervised Temporal Activity Localization (ECCV 2020)
MAC
An end-to-end masked contrastive video-and-language pre-training framework
X-CLIP
An official implementation for "X-CLIP: End-to-End Multi-grained Contrastive Learning for Video-Text Retrieval"