Zafir Stojanovski
Zafir Stojanovski
@amyeroberts Doesn't `idefics2` already handle this? https://github.com/huggingface/transformers/blob/cf7bed98325a0be9d195cb6b66c6a0bef9fccbc8/src/transformers/models/idefics2/modeling_idefics2.py#L139-L149 For example, the following sample script: ```py import torch import requests from PIL import Image from transformers import Idefics2Processor, Idefics2ForConditionalGeneration device = torch.device("cuda"...
Opened a PR (#30722) addressing this issue for the BLIP family of models (BLIP, BLIP2, InstructBLIP).
@amyeroberts good point! the tests now also have a check for the textual content.
@haileyschoelkopf I believe this issue should be closed now.