LLaVA icon indicating copy to clipboard operation
LLaVA copied to clipboard

llava_v1_5_mix665k dataset

Open taltlusty opened this issue 1 year ago • 5 comments

Describe the issue

Hello Looking at the dataset list, which dataset does the prompts with an empty model belong to? For example:

"id": "wgByO4Y_0", "model": "",

Thanks

taltlusty avatar Nov 21 '23 08:11 taltlusty

@taltlusty Where did you get this dataset from? Didn't find in playground/data.

aneet-javis avatar Nov 22 '23 07:11 aneet-javis

Thanks @aneet-javis This is the published dataset for finetuning: https://huggingface.co/datasets/liuhaotian/LLaVA-Instruct-150K/blob/main/llava_v1_5_mix665k.json

taltlusty avatar Nov 22 '23 08:11 taltlusty

Thanks @aneet-javis This is the published dataset for finetuning: https://huggingface.co/datasets/liuhaotian/LLaVA-Instruct-150K/blob/main/llava_v1_5_mix665k.json

It seems that I have browsed it under other issues before: Adding some plain text Q&A from llava_v1_5_mix665k to his custom dataset (image based Q&A) can improve his fine-tuning effect.

CrazyBrick avatar Nov 30 '23 13:11 CrazyBrick

the same. I also found that some ocr_vqa data do not exist in the downloaded data..........

zengxingchen avatar Feb 28 '24 12:02 zengxingchen

the same. some ocr_vqa data do not exits

421zuoduan avatar Apr 24 '24 03:04 421zuoduan