video-transformer topic
VideoMAE
[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
Multi-Modal-Transformer
The repository collects many various multi-modal transformer architectures, including image transformer, video transformer, image-language transformer, video-language transformer and self-supervised l...
long-short-term-transformer
[NeurIPS 2021 Spotlight] Official implementation of Long Short-Term Transformer for Online Action Detection
video-transformers
Easiest way of fine-tuning HuggingFace video classification models
transfomers-silicon-research
Research and Materials on Hardware implementation of Transformer Model
VideoMAE-Action-Detection
[NeurIPS 2022 Spotlight] VideoMAE for Action Detection
vid-TLDR
Official implementation of CVPR 2024 paper "vid-TLDR: Training Free Token merging for Light-weight Video Transformer".