LLaVA
LLaVA copied to clipboard
llava_v1_5_mix665k dataset
Describe the issue
Hello Looking at the dataset list, which dataset does the prompts with an empty model belong to? For example:
"id": "wgByO4Y_0", "model": "",
Thanks
@taltlusty Where did you get this dataset from? Didn't find in playground/data.
Thanks @aneet-javis This is the published dataset for finetuning: https://huggingface.co/datasets/liuhaotian/LLaVA-Instruct-150K/blob/main/llava_v1_5_mix665k.json
Thanks @aneet-javis This is the published dataset for finetuning: https://huggingface.co/datasets/liuhaotian/LLaVA-Instruct-150K/blob/main/llava_v1_5_mix665k.json
It seems that I have browsed it under other issues before: Adding some plain text Q&A from llava_v1_5_mix665k to his custom dataset (image based Q&A) can improve his fine-tuning effect.
the same. I also found that some ocr_vqa data do not exist in the downloaded data..........
the same. some ocr_vqa data do not exits