DaVinci
DaVinci copied to clipboard
some questions on evaluating
How can the model evaluate on GLEU tasks?The tasks are text-pure, but in the paper it said “Similar to PLM, when prefix image is none, this task will degenerate into “text-to-image generation” task, forcing the model to generate an image with the input caption”, so how can the model complete text-pure tasks?