HandwrittenTextRecognition_MXNet icon indicating copy to clipboard operation
HandwrittenTextRecognition_MXNet copied to clipboard

an issues about your ocr data iteration

Open zzdang opened this issue 6 years ago • 5 comments

Hi,your project is cool ,but your OCR_LSTM_CTC's data iteration is very slow? Could you update it? Thank you very much

zzdang avatar Jul 03 '18 08:07 zzdang

Hi,

Would it be possible for you to include some clarifications? Which data iteration?

jonomon avatar Jul 03 '18 15:07 jonomon

Your "Data Loading" module isn't iterative, you load image data and labels first time. If the training dataset is big,the "images_data" in your "data loading" is hard to handle,the trainning will be very slow.......

zzdang avatar Jul 03 '18 16:07 zzdang

@zzdang, this is true, this was an acceptable trade off given the small size of the IAM dataset, and to get the ability to load pre-processed images quickly.

If your dataset is larger I would recommend using the ImageFolderDataset available in Gluon that would let you load each image only when necessary.

ThomasDelteil avatar Jul 03 '18 16:07 ThomasDelteil

@jonomon @ThomasDelteil @ThomasDelteil I am testing your project but I have an assertion error even though I put my email and password in the credentials.json . In the registration form, they ask for the email and not the username. I found this link in the project to have but it is not functional https://fki.tic.heia-fr.ch/DBs/iamDB/iLogin/index.php please can you help me

samar-smida avatar Apr 25 '22 10:04 samar-smida

You can download the dataset https://fki.tic.heia-fr.ch/databases/iam-handwriting-database.

jonomon avatar Apr 26 '22 04:04 jonomon