vision-transformers topic
DFSpot-Deepfake-Recognition
Determine whether a given video sequence has been manipulated or synthetically generated
SViTE
[NeurIPS'21] "Chasing Sparsity in Vision Transformers: An End-to-End Exploration" by Tianlong Chen, Yu Cheng, Zhe Gan, Lu Yuan, Lei Zhang, Zhangyang Wang
CrossViT-pytorch
Implementation of CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification
svt
Official repository for "Self-Supervised Video Transformer" (CVPR'22)
FocusOnDepth
A Monocular depth-estimation for in-the-wild AutoFocus application.
dino-vit-features
Official implementation for the paper "Deep ViT Features as Dense Visual Descriptors".
UNETR-Pose
3D Multi-person Pose Estimation in Multi-view Environment using 3D U-Net Transformer Networks
vits-robustness-torch
Code for the paper "A Light Recipe to Train Robust Vision Transformers" [SaTML 2023]
VTGAN
[ICCV'21] [Tensorflow] Semi-supervised Retinal Image Synthesis and Disease Prediction using Vision Transformers
deit-tf
Includes PyTorch -> Keras model porting code for DeiT models with fine-tuning and inference notebooks.