T2T-ViT
T2T-ViT copied to clipboard
ICCV2021, Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet
I see that the entire main.py file also does not change arg.num_gpu to a value greater than 1. Under what circumstances does this value change to a value greater than...
Hi, Thank you for this repo! It is really helpful. However, we fail to obtain the reported MACs for performer-based models (T2T-ViT-7/10/12). **Importantly, we found a strange phenomenon.** Both the...
thanks for your nice work! I have some issue following the repo and run the code for transfer learning. I get the pretrain model for cifar10 from the link and...
hi, would you mind releasing the training log for T2t-vit-t-14 training with 8 GPUs? I tried to rerun the script for training T2t-vit-t-14 with 8 GPUs. It gained 0.094 for...
I've tried several tools, and downloaded all the files just to see if maybe the first one was corrupt, but every file I try I get this error (from multiple...
https://github.com/yitu-opensource/T2T-ViT/blob/main/models/token_performer.py#L18 My code has turned on fp16, so the 1e-8 on this line to prevent division by 0 is not enough for my code... the loss of the network calculation...
Have you compaired the inference speed of T2T-ViT and resnet? At the same accuracy, which famaliy of models achieve higher FPS?
data:image/s3,"s3://crabby-images/5522c/5522c738030270809e4ff1c8b77a452ba53b4402" alt="image" When I want to use the pretrained model, I got a problem likes this. Could you tell me how to slove it? Thanks!
When I tried to train t2t-vit-14 with hyperparameter that was released, I had a NaN loss problem. After AMP was turned off, loss became stable. But, I want to know...