0xTaki.eth
0xTaki.eth
> @KeremTurgutlu I found the same question when I want to train CLIP with 8 GPUs. I found that if my total batch size is 128, then pytorch will split...
I tried run testing on CIFAR100 following by `README.md (Zero-Shot Prediction)` and also can't achieve the performance in paper. did you solve this problem?
> This is mentioned in paper: "The base query list is all words occurring at least 100 times in > the English version of Wikipedia. This is augmented with bi-grams...