LLaVA
LLaVA copied to clipboard
[Question] TextVQA’s OCR
Question
In ./playground/data/eval/textvqa/llava_textvqa_val_v051_ocr.jsonl, the "text" part of each piece of data contains the Reference OCR token content. May I ask where this part of OCR is obtained from?
Hi! I also have the same question.