examples vision-transformer problem report

vision-transformer problem report

Open ChenDaiwei-99 opened this issue 2 years ago • 4 comments

Aug 24 '23 02:08 ChenDaiwei-99

If you wanna contribute some fixes would be happy to merge

Sep 06 '23 18:09 msaroufim

Sure, will do it in a few days :)

Sep 06 '23 19:09 ChenDaiwei-99

I can confirm this example is badly broken. I added some code to compare individual labels to predictions and discovered the forward pass of ViT always returns the same tensor. No matter the input. The tensor it returns is different each time I run it, even if I load the same weights from the save file and don't do any training. It's no wonder it can't do better than 2.3. Always giving the same prediction should accidentally hit on the correct label about as often as random guessing.

Oct 28 '23 01:10 colinosterman

I added the printout of accuracy in the code, but during the training process, the accuracy does not improve, and the loss does not converge, even after training for many epochs. Does this model really work? I think this model has serious problems. I hope to get an answer, this is very important to me

May 13 '24 07:05 fanqieguo

examples examples copied to clipboard

vision-transformer problem report

examples
examples copied to clipboard