unilm
unilm copied to clipboard
Can kosmos-2 be finetuning with paires of Chinese text and image?
I am interested in Kosmos-2 and appreciate it. I want finetune kosmos-2 on my dataset which included images and Chinese texts, but I couldn't do it successfully. So I tried to call Kosmos2Processor on a text, I found it could not be decoded.