T2T-ViT icon indicating copy to clipboard operation
T2T-ViT copied to clipboard

ICCV2021, Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet

Results 22 T2T-ViT issues
Sort by recently updated
recently updated
newest added

I see that the entire main.py file also does not change arg.num_gpu to a value greater than 1. Under what circumstances does this value change to a value greater than...

Hi, Thank you for this repo! It is really helpful. However, we fail to obtain the reported MACs for performer-based models (T2T-ViT-7/10/12). **Importantly, we found a strange phenomenon.** Both the...

thanks for your nice work! I have some issue following the repo and run the code for transfer learning. I get the pretrain model for cifar10 from the link and...

hi, would you mind releasing the training log for T2t-vit-t-14 training with 8 GPUs? I tried to rerun the script for training T2t-vit-t-14 with 8 GPUs. It gained 0.094 for...

I've tried several tools, and downloaded all the files just to see if maybe the first one was corrupt, but every file I try I get this error (from multiple...

https://github.com/yitu-opensource/T2T-ViT/blob/main/models/token_performer.py#L18 My code has turned on fp16, so the 1e-8 on this line to prevent division by 0 is not enough for my code... the loss of the network calculation...

Have you compaired the inference speed of T2T-ViT and resnet? At the same accuracy, which famaliy of models achieve higher FPS?

![image](https://user-images.githubusercontent.com/80735955/146641232-2be69e47-bc73-4b19-b24a-294ff0c6facf.png) When I want to use the pretrained model, I got a problem likes this. Could you tell me how to slove it? Thanks!

When I tried to train t2t-vit-14 with hyperparameter that was released, I had a NaN loss problem. After AMP was turned off, loss became stable. But, I want to know...