vit-pytorch
vit-pytorch copied to clipboard
This ViT implementation as generative network
First of all, I want to thank you for a very good job, it looks great!
I would like to use the ViT implementation presented in this repository to generate images in a supervised training process.
The ViT implementation presented in this repository provides a very clear API, however, from what I understand, the main role of these implementations is the task of classification.
Let's assume that I would like to use the presented ViT as a generator in the GAN network (without going into details) to generate 224x224 image based on another input image of the same size. Can I use the presentent API to generate the appropriate architecture?
Thank you in advance Best regards
@MrCorsair3 there was a lot of interest in introducing attention to GANs, before DDPMs swept the field away
would start with https://arxiv.org/abs/2107.04589