BLIP
BLIP copied to clipboard
How to train itm from itc
Hi, Suppose we have an already pretrained ITC model, for example the openAI CLIP or BLIP-ITC, how can we add image-text-mactching params on top of the ITC model to train the ITM? Have anyone tried it?