mPLUG-Owl
mPLUG-Owl copied to clipboard
other downstream tasks available? Like Visual Reasoning, requires the model to predict whether a sentence describes a pair of images
Owl series support multiple images inputs. You can develop the downstream pipeline by passing a list of images and place the same number of "<|image|>" in your prompt.