ViLT icon indicating copy to clipboard operation
ViLT copied to clipboard

inference image captioning

Open trucvip123 opened this issue 3 years ago • 2 comments

who can do a demo for my image-captioning of ViLT. pleaseee!@! I'm a newbie in NLP field <33

trucvip123 avatar Nov 18 '21 03:11 trucvip123

Hi @trucvip123,

Though ViLT has not undergone a captioning fine-tuning, you can emulate the captioning by passing text query as [MASK] [MASK] [MASK] ... [MASK] [MASK] ([MASK] * your desired length) to MLM demo.

dandelin avatar Nov 22 '21 14:11 dandelin

Thank you @dandelin

trucvip123 avatar Nov 24 '21 12:11 trucvip123