video-dataset topic
SPHAR-Dataset
Surveillance Perspective Human Action Recognition Dataset: 7759 Videos from 14 Action Classes, aggregated from multiple sources, all cropped spatio-temporally and filmed from a surveillance-camera lik...
Video-Dataset-Loading-Pytorch
Generic PyTorch dataset implementation to load and augment VIDEOS for deep learning training loops.
Awsome-Deep-Learning-for-Video-Analysis
Papers, code and datasets about deep learning and multi-modal learning for video analysis
Multimodal-short-video-dataset-and-baseline-classification-model
500,000 multimodal short video data and baseline models. 50万条多模态短视频数据集和基线模型(TensorFlow2.0)。
pytorch-VideoDataset
Tools for loading video dataset and transforms on video in pytorch. You can directly load video files without preprocessing.
epic-kitchens-55-starter-kit-action-recognition
:seedling: Starter kit for working with the EPIC-KITCHENS-55 dataset for action recognition or anticipation
video_captioning_datasets
Summary about Video-to-Text datasets. This repository is part of the review paper *Bridging Vision and Language from the Video-to-Text Perspective: A Comprehensive Review*
VideoLT
Official Code for VideoLT: Large-scale Long-tailed Video Recognition (ICCV 2021)
InternVideo
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
AVCAffe
[AAAI 2023] AVCAffe: A Large Scale Audio-Visual Dataset of Cognitive Load and Affect for Remote Work