examples icon indicating copy to clipboard operation
examples copied to clipboard

vision-transformer problem report

Open ChenDaiwei-99 opened this issue 2 years ago • 4 comments

image image image

ChenDaiwei-99 avatar Aug 24 '23 02:08 ChenDaiwei-99

If you wanna contribute some fixes would be happy to merge

msaroufim avatar Sep 06 '23 18:09 msaroufim

Sure, will do it in a few days :)

ChenDaiwei-99 avatar Sep 06 '23 19:09 ChenDaiwei-99

I can confirm this example is badly broken. I added some code to compare individual labels to predictions and discovered the forward pass of ViT always returns the same tensor. No matter the input. The tensor it returns is different each time I run it, even if I load the same weights from the save file and don't do any training. It's no wonder it can't do better than 2.3. Always giving the same prediction should accidentally hit on the correct label about as often as random guessing.

colinosterman avatar Oct 28 '23 01:10 colinosterman

I added the printout of accuracy in the code, but during the training process, the accuracy does not improve, and the loss does not converge, even after training for many epochs. Does this model really work? I think this model has serious problems. I hope to get an answer, this is very important to me

fanqieguo avatar May 13 '24 07:05 fanqieguo