table-transformer icon indicating copy to clipboard operation
table-transformer copied to clipboard

Couldn't download PubTables-1M dataset

Open srcjun opened this issue 2 years ago • 7 comments

I'm stucked with downloading PubTables-1M dataset now. I followed the instruction how to download PubTables-1M dataset on this github page. However, the dataset is locked. (The situation is as in the picture below.) Image1 Also, I couldn't login the page to dowload the dataset. (The error is as in the picture below.) Image2 Please let me know how to solve this problem. Thanks.

srcjun avatar Nov 08 '22 08:11 srcjun

me too. i also meet "You do not have permission ..."

MaxKinny avatar Nov 11 '22 02:11 MaxKinny

I have the same problem, Have anyone an example of how the PubTables-1M-Image_Page_Detection_PASCAL_VOC.tar.gz(75.11 GB) labels are? To reply this structure in my own dataset

emigomez avatar Nov 11 '22 09:11 emigomez

Hey everyone, I am looking into this and reaching out to the team that manages the hosting for the data set. If there is no resolution soon I'll find another host for the data set and post an update. Please check back soon!

bsmock avatar Nov 14 '22 20:11 bsmock

I had the same issue. Is the issue resolved ? Tagging @bsmock for visibility.

abhayhk2001 avatar Nov 23 '22 05:11 abhayhk2001

Looks like the issue is not getting resolved, so I just uploaded the dataset here: https://huggingface.co/datasets/bsmock/pubtables-1m

Try this new link and let me know if you have any issues.

Cheers, Brandon

bsmock avatar Nov 23 '22 18:11 bsmock

Was anyone able to get the data set from the new link?

bsmock avatar Nov 28 '22 17:11 bsmock

Looks like the issue is not getting resolved, so I just uploaded the dataset here: https://huggingface.co/datasets/bsmock/pubtables-1m

Try this new link and let me know if you have any issues.

Cheers, Brandon

Awesome! I can download the dataset from the new link. Is "PubTables-1M-Detection_Page_Words.tar.gz" not uploaded?

MaxKinny avatar Nov 29 '22 03:11 MaxKinny