LLaVA add ocr vqa images

Most of downloading urls of images in ocr_vqa dataset are no longer available. Everyone has to rerun the downloading script to get a small portion of ocr_vqa images in the LLaVA-v1.5-665k instruction dataset. I zip all images from original release into a zip file. Everyone can easily download it and unzip to their path of ./ocr_vqa/images

Apr 26 '24 03:04 Victorwz

You are a legend

Apr 27 '24 00:04 SamuelSchmidgall

Most of downloading urls of images in ocr_vqa dataset are no longer available. Everyone has to rerun the downloading script to get a small portion of ocr_vqa images in the LLaVA-v1.5-665k instruction dataset. I zip all images from original release from https://huggingface.co/datasets/howard-hou/OCR-VQA into a zip file. Everyone can easily download it and unzip to their path of ./ocr_vqa/images

It's parquet, not jpg, can not use to train directly

May 01 '24 04:05 hellangleZ

Most of downloading urls of images in ocr_vqa dataset are no longer available. Everyone has to rerun the downloading script to get a small portion of ocr_vqa images in the LLaVA-v1.5-665k instruction dataset. I zip all images from original release from https://huggingface.co/datasets/howard-hou/OCR-VQA into a zip file. Everyone can easily download it and unzip to their path of ./ocr_vqa/images

It's parquet, not jpg, can not use to train directly

You misunderstand my pull request. Please check the changed readme file. The downloading link for the ocr_vqa images are https://huggingface.co/datasets/weizhiwang/llava_v15_instruction_images/resolve/main/ocr_vqa_images_llava_v15.zip?download=true. The mentioned link is the original release.

May 01 '24 04:05 Victorwz

Most of downloading urls of images in ocr_vqa dataset are no longer available. Everyone has to rerun the downloading script to get a small portion of ocr_vqa images in the LLaVA-v1.5-665k instruction dataset. I zip all images from original release from https://huggingface.co/datasets/howard-hou/OCR-VQA into a zip file. Everyone can easily download it and unzip to their path of ./ocr_vqa/images

It's parquet, not jpg, can not use to train directly

You misunderstand my pull request. Please check the changed readme file. The downloading link for the ocr_vqa images are https://huggingface.co/datasets/weizhiwang/llava_v15_instruction_images/resolve/main/ocr_vqa_images_llava_v15.zip?download=true. The mentioned link is the original release.

Dude, You are a Legend

May 01 '24 07:05 hellangleZ

hero!

May 05 '24 09:05 yanghu819

Life saver!

Jul 16 '24 07:07 CuriousCat-7

Super hero!

Jul 22 '24 05:07 passing2961

Most of downloading urls of images in ocr_vqa dataset are no longer available. Everyone has to rerun the downloading script to get a small portion of ocr_vqa images in the LLaVA-v1.5-665k instruction dataset. I zip all images from original release from https://huggingface.co/datasets/howard-hou/OCR-VQA into a zip file. Everyone can easily download it and unzip to their path of ./ocr_vqa/images

It's parquet, not jpg, can not use to train directly

You misunderstand my pull request. Please check the changed readme file. The downloading link for the ocr_vqa images are https://huggingface.co/datasets/weizhiwang/llava_v15_instruction_images/resolve/main/ocr_vqa_images_llava_v15.zip?download=true. The mentioned link is the original release.

@Victorwz 您好，请问这个数据现在没法下载了吗？

Jul 29 '24 07:07 yhl41001

你点一下链接应该直接可以下载的，我刚点了一下没问题

Jul 29 '24 07:07 Victorwz

Thanks a lot!

Jul 31 '24 20:07 redagavin

LLaVA LLaVA copied to clipboard

add ocr vqa images

LLaVA
LLaVA copied to clipboard