Vision-Transformer
Vision-Transformer copied to clipboard
Pytorch implementation of ViT on CIFAR-10.
Hi there! Thanks for this repository The code doesn't run though... Two issues that I found: 1. The model name in the file is ViT, while in the training file...
I sincerely hope that the author can complete the code
After running line 134 in the ViT forward: patches = images.unfold(2, self.patch_height, self.patch_width).unfold(3, self.patch_height, self.patch_width) I get a tensor with sizes data:image/s3,"s3://crabby-images/b14ab/b14abc924f3a60843d079e94ad43720047955be3" alt="image" The next line is: patches = patches.permute(0, 2,...