mPLUG-Owl other downstream tasks available? Like Visual Reasoning, requires the model to predict whether a sentence describes a pair of images

other downstream tasks available? Like Visual Reasoning, requires the model to predict whether a sentence describes a pair of images

Open fansticOne opened this issue 1 year ago • 1 comments

Feb 06 '24 12:02 fansticOne

Owl series support multiple images inputs. You can develop the downstream pipeline by passing a list of images and place the same number of "<|image|>" in your prompt.

Feb 07 '24 07:02 LukeForeverYoung

mPLUG-Owl mPLUG-Owl copied to clipboard

other downstream tasks available? Like Visual Reasoning, requires the model to predict whether a sentence describes a pair of images

mPLUG-Owl
mPLUG-Owl copied to clipboard