showstarpro

Results 3 comments of showstarpro

Why did I find over 900 errors when using the jpeginfo tool to check images from source https://huggingface.co/datasets/ej2/llava-ocr-vqa? And most of their URLs are GIFs ![微信图片_20241201163211](https://github.com/user-attachments/assets/19c92bd9-a5c8-4d46-9448-313f00cd2a75)

> Additionally, in this dataset, 1437717772.jpg seems to be corrupted and needs to be downloaded again: > > ``` > wget http://ecx.images-amazon.com/images/I/51YTH4k3fUL.jpg > cp 51YTH4k3fUL.jpg playground/data/ocr_vqa/images/1437717772.jpg > ``` thank you...

Thank you for sharing this excellent work. I am very interested in the dataset you mentioned. Would it be possible for the authors to provide access to this dataset, or...