AlphaCLIP
AlphaCLIP copied to clipboard
Captions in GRIT
Thank you for your work.
Considering the captions in the GRIT dataset consist solely of noun words like berries, person ... Did you use Templates to expand the captions, such as "a photo of a xxx"?