vit-pytorch icon indicating copy to clipboard operation
vit-pytorch copied to clipboard

This ViT implementation as generative network

Open MrCorsair3 opened this issue 1 year ago • 1 comments

First of all, I want to thank you for a very good job, it looks great!

I would like to use the ViT implementation presented in this repository to generate images in a supervised training process.

The ViT implementation presented in this repository provides a very clear API, however, from what I understand, the main role of these implementations is the task of classification.

Let's assume that I would like to use the presented ViT as a generator in the GAN network (without going into details) to generate 224x224 image based on another input image of the same size. Can I use the presentent API to generate the appropriate architecture?

Thank you in advance Best regards

MrCorsair3 avatar Aug 21 '23 19:08 MrCorsair3

@MrCorsair3 there was a lot of interest in introducing attention to GANs, before DDPMs swept the field away

would start with https://arxiv.org/abs/2107.04589

lucidrains avatar Sep 04 '23 14:09 lucidrains