LLaVA icon indicating copy to clipboard operation
LLaVA copied to clipboard

add ocr vqa images

Open Victorwz opened this issue 1 year ago • 10 comments

Most of downloading urls of images in ocr_vqa dataset are no longer available. Everyone has to rerun the downloading script to get a small portion of ocr_vqa images in the LLaVA-v1.5-665k instruction dataset. I zip all images from original release into a zip file. Everyone can easily download it and unzip to their path of ./ocr_vqa/images

Victorwz avatar Apr 26 '24 03:04 Victorwz

You are a legend

SamuelSchmidgall avatar Apr 27 '24 00:04 SamuelSchmidgall

Most of downloading urls of images in ocr_vqa dataset are no longer available. Everyone has to rerun the downloading script to get a small portion of ocr_vqa images in the LLaVA-v1.5-665k instruction dataset. I zip all images from original release from https://huggingface.co/datasets/howard-hou/OCR-VQA into a zip file. Everyone can easily download it and unzip to their path of ./ocr_vqa/images

It's parquet, not jpg, can not use to train directly

hellangleZ avatar May 01 '24 04:05 hellangleZ

Most of downloading urls of images in ocr_vqa dataset are no longer available. Everyone has to rerun the downloading script to get a small portion of ocr_vqa images in the LLaVA-v1.5-665k instruction dataset. I zip all images from original release from https://huggingface.co/datasets/howard-hou/OCR-VQA into a zip file. Everyone can easily download it and unzip to their path of ./ocr_vqa/images

It's parquet, not jpg, can not use to train directly

You misunderstand my pull request. Please check the changed readme file. The downloading link for the ocr_vqa images are https://huggingface.co/datasets/weizhiwang/llava_v15_instruction_images/resolve/main/ocr_vqa_images_llava_v15.zip?download=true. The mentioned link is the original release.

Victorwz avatar May 01 '24 04:05 Victorwz

Most of downloading urls of images in ocr_vqa dataset are no longer available. Everyone has to rerun the downloading script to get a small portion of ocr_vqa images in the LLaVA-v1.5-665k instruction dataset. I zip all images from original release from https://huggingface.co/datasets/howard-hou/OCR-VQA into a zip file. Everyone can easily download it and unzip to their path of ./ocr_vqa/images

It's parquet, not jpg, can not use to train directly

You misunderstand my pull request. Please check the changed readme file. The downloading link for the ocr_vqa images are https://huggingface.co/datasets/weizhiwang/llava_v15_instruction_images/resolve/main/ocr_vqa_images_llava_v15.zip?download=true. The mentioned link is the original release.

Dude, You are a Legend

hellangleZ avatar May 01 '24 07:05 hellangleZ

hero!

yanghu819 avatar May 05 '24 09:05 yanghu819

Life saver!

CuriousCat-7 avatar Jul 16 '24 07:07 CuriousCat-7

Super hero!

passing2961 avatar Jul 22 '24 05:07 passing2961

Most of downloading urls of images in ocr_vqa dataset are no longer available. Everyone has to rerun the downloading script to get a small portion of ocr_vqa images in the LLaVA-v1.5-665k instruction dataset. I zip all images from original release from https://huggingface.co/datasets/howard-hou/OCR-VQA into a zip file. Everyone can easily download it and unzip to their path of ./ocr_vqa/images

It's parquet, not jpg, can not use to train directly

You misunderstand my pull request. Please check the changed readme file. The downloading link for the ocr_vqa images are https://huggingface.co/datasets/weizhiwang/llava_v15_instruction_images/resolve/main/ocr_vqa_images_llava_v15.zip?download=true. The mentioned link is the original release.

@Victorwz 您好,请问这个数据现在没法下载了吗?

yhl41001 avatar Jul 29 '24 07:07 yhl41001

你点一下链接应该直接可以下载的,我刚点了一下没问题

Victorwz avatar Jul 29 '24 07:07 Victorwz

Thanks a lot!

redagavin avatar Jul 31 '24 20:07 redagavin