handwriting-ocr icon indicating copy to clipboard operation
handwriting-ocr copied to clipboard

words extraction is not happening

Open ashishbasantani opened this issue 5 years ago • 5 comments

I cannot find words-final folder, & I am unable to extract data.

ashishbasantani avatar Jan 11 '19 14:01 ashishbasantani

I see that I forgot to include further details on how to extract the data. I guess that you already downloaded the data. Placed them according to the instructions. Then you have to go into src/data/ folder and run following instructions:

python data_extractor.py -d all
python data_normalization.py -d all
python data_create_sets.py --csv -d all

You can all also specify exact datasets for processing by replacing all with their names. For example:

python data_extractor.py -d iam cvl

There might still be some issues, so let me know if you encounter some other problem.

Breta01 avatar Jan 11 '19 15:01 Breta01

"import enchant error" as enchant is not available for 64 bit OS so what can be other option to install enchant

ashishbasantani avatar Jan 11 '19 17:01 ashishbasantani

What did you try to install? You should run: pip install pyenchant.

I just found out, that it's no longer under maintenance. I will try to replace it.

Breta01 avatar Jan 11 '19 18:01 Breta01

Getting error while executing OCR.ipynb file with MODEL_LOC = '../models/word-clas/' + LANG + '/CTC/Classifier2' but when i executed with MODEL_LOC = '../models/char-clas/' + LANG + '/CharClassifier' it got executed without any error.

I made this change also "CHARACTER_MODEL = Model(MODEL_LOC,'word_prediction')" instead of CHARACTER_MODEL = Model(MODEL_LOC)

"TypeError: Cannot interpret feed_dict key as Tensor: The name 'x:0' refers to a Tensor which does not exist. The operation, 'x', does not exist in the graph."

ashishbasantani avatar Jan 16 '19 09:01 ashishbasantani

Yes, it is because CTC model requires little bit different inputs. OCR notebook right now works only with char classifier. Take a look into ocr_evaluator.ipynb at class WordCycler(). I hope you will be able to figure out neccesseray changes.

Once I have some time, I will extend the OCR notebook to support all models.

Breta01 avatar Jan 18 '19 20:01 Breta01