fsdl-text-recognizer-project icon indicating copy to clipboard operation
fsdl-text-recognizer-project copied to clipboard

Dataset cant be accessed from S3 url

Open phospheneai opened this issue 5 years ago • 2 comments

all data set resides in cloud cant be retrieved , 403 forbidden error

phospheneai avatar Feb 26 '20 20:02 phospheneai

To solve the problem, Replace url at fsdl-text-recognizer-project/data/raw/emnist/metadata.toml with http://www.itl.nist.gov/iaui/vip/cs_links/EMNIST/matlab.zip

But It is EXTREMELY slow😂

yunh-kwak avatar Mar 06 '20 15:03 yunh-kwak

To avoid suffering those download speeds again, I backed up the byclass.h5 file generated under data/processed/mnist. You can download it from here, if it helps: https://mega.nz/#F!B6QFVQSI (decryption key 8ODxY82iUjLCbW3EUuiD0g)

Besides downloading the .mat file and creating the HDF5 file, the script also generates the JSON file lab2/text_recognizer/datasets/emnist_essentials.json, which I also uploaded to that folder.

sebastian-sosa avatar Mar 26 '20 22:03 sebastian-sosa