vit topic
NaViT
My implementation of "Patch nā Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution"
Deepfake-detection
Learning a Deep Dual-level Network for Robust DeepFake Detection
ECG-Representation-Learning
Self-supervised pre-training for ECG representation with inspiration from transformers & computer vision
vision-transformer
Vision Transformer explanation and implementation with PyTorch
VisionGPT2
Combining ViT and GPT-2 for image captioning. Trained on MS-COCO. The model was implemented mostly from scratch.
ViT_PyTorch
A PyTorch Implementation of ViT (Vision Transformer)
simple-aesthetics-predictor
CLIP-based aesthetics predictor inspired by the interface of š¤ huggingface transformers.
NeRF-MAE
[ECCV 2024] Pytorch code for our ECCV'24 paper NeRF-MAE: Masked AutoEncoders for Self-Supervised 3D Representation Learning for Neural Radiance Fields
Awesome-DiT-Inference
šA curated list of Awesome Diffusion Inference Papers with Codes: Sampling, Cache, Quantization, Parallelism, etc.š
transformer-fusion
Official repository of the "Transformer Fusion with Optimal Transport" paper, published as a conference paper at ICLR 2024.