0xTaki.eth

Results 3 comments of 0xTaki.eth

> @KeremTurgutlu I found the same question when I want to train CLIP with 8 GPUs. I found that if my total batch size is 128, then pytorch will split...

I tried run testing on CIFAR100 following by `README.md (Zero-Shot Prediction)` and also can't achieve the performance in paper. did you solve this problem?

> This is mentioned in paper: "The base query list is all words occurring at least 100 times in > the English version of Wikipedia. This is augmented with bi-grams...