mPLUG-Owl icon indicating copy to clipboard operation
mPLUG-Owl copied to clipboard

other downstream tasks available? Like Visual Reasoning, requires the model to predict whether a sentence describes a pair of images

Open fansticOne opened this issue 1 year ago • 1 comments

fansticOne avatar Feb 06 '24 12:02 fansticOne

Owl series support multiple images inputs. You can develop the downstream pipeline by passing a list of images and place the same number of "<|image|>" in your prompt.

LukeForeverYoung avatar Feb 07 '24 07:02 LukeForeverYoung