PyTorch-Pretrained-ViT issues

The parameter in transforms.Normalize() for ImageNet-1K

Excuse me, I want to know what are the parameters in transforms.Normalize(, ) when fine-tune on the ImageNet-1k dataset.

Yamato1

Multi head uses just one set of Q, K, V?

In **transformer.py,** in class **MultiHeadedSelfAttention()** we have the var declaration: self.proj_q = nn.Linear(dim, dim) self.proj_k = nn.Linear(dim, dim) self.proj_v = nn.Linear(dim, dim) but wasn't suposed to be Q, K and...

daviduarte

Cannot load custom config

3

Hey! First of all, thanks for your contribution! I have looked at multiple ViT implementations and yours seems like the most straightforward, well-organized and simple to use. I'd like to...

arkel23

Qs about your code compared with the orignal code.

Hi, I noticed that it: your code: ``` x = self.positional_embedding(x) # b,gh*gw+1,d x = self.transformer(x) # b,gh*gw+1,d ``` Vision Transformer(from https://github.com/lucidrains/vit-pytorch/blob/main/vit_pytorch/vit.py)： ``` x += self.pos_embedding[:, :(n + 1)] x...

elk-april

Will you release the L_16 pretrained model?

3

Will you release the L_16 pretrained model?

ljk1072911239

anicolson

Cannot load representation layer

I was trying to load the whole model with pretrained=True and representation layer = True but I get an error Further inspection by looking at the keys of the state_dict...

arkel23

PyTorch-Pretrained-ViT
PyTorch-Pretrained-ViT copied to clipboard

Metadata

The parameter in transforms.Normalize() for ImageNet-1K

Multi head uses just one set of Q, K, V?

Cannot load custom config

Qs about your code compared with the orignal code.

Will you release the L_16 pretrained model?

Update model.py

Evaluation Performance

B_16 return zeros

Load from google's saved weights.

Cannot load representation layer

← Metadata

Owner

Metadata

PyTorch-Pretrained-ViT PyTorch-Pretrained-ViT copied to clipboard

Metadata

← Metadata

Owner

Metadata

PyTorch-Pretrained-ViT
PyTorch-Pretrained-ViT copied to clipboard