vic
vic copied to clipboard
What is CLIP upper bound?How did you get the model?
Thank you for your work, I have some questions and hope you can answer them despite your busy schedule What is CLIP upper bound?How did you get the model?
We consider three main groups of baselines for our comparisons. The most straightforward baselines consist of using CLIP with large vocabularies, such as WordNet [41] (117k names) or the English Words (234k names [16]). As an upper bound, we also consider CLIP with the perfect vocabulary, i.e. the ground-truth names of the target dataset (CLIP upper bound). Due to lack of space, we only report results for CLIP with ViT-L [13].