vit topic
PyTorch-Scratch-Vision-Transformer-ViT
Simple and easy to understand PyTorch implementation of Vision Transformer (ViT) from scratch, with detailed steps. Tested on common datasets like MNIST, CIFAR10, and more.
Persian-Image-Captioning
A Persian Image Captioning model based on Vision Encoder Decoder Models of the transformers🤗.
Code-Canvas
A hub for innovation through web development projects
ViTPose_pytorch
An unofficial implementation of ViTPose [Y. Xu et al., 2022]
EEG-Transformer
i. A practical application of Transformer (ViT) on 2-D physiological signal (EEG) classification tasks. Also could be tried with EMG, EOG, ECG, etc. ii. Including the attention of spatial dimension (c...
TransformerX
Flexible Python library providing building blocks (layers) for reproducible Transformers research (Tensorflow ✅, Pytorch 🔜, and Jax 🔜)
Vit-RGTS
Open source implementation of "Vision Transformers Need Registers"
TubeViT
An unofficial implementation of TubeViT in "Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video Learning"
Facial-Attribute-Recognition-from-face-images
FacialAttributesExtractor is a Python library for precise facial attribute extraction, offering comprehensive insights into various features using OpenCV and Deep Learning. Enhance your image processi...
RevCol
Official Code of Paper "Reversible Column Networks" "RevColv2"