tesseract icon indicating copy to clipboard operation
tesseract copied to clipboard

SUGG: Font Recognition and Font Similarity Learning

Open ghost opened this issue 7 years ago • 0 comments

As the development of OCR is moving towards LSTM, new difficulties arise, such as having created lstm/traineddata for different fonts.

Example situations:

  • I have created multiple lstm/ traineddata models, each for a different font and form of writing, and want to choose the most suitable one.
  • Or, I just want to identify the font type from an image, and similar font types, instead of a full ocr.

I suggest integrating a new feature which gives the ability to train deconvolutional networks in identifying the Font Type and provide similar font matches suggestion from an image.

Demostration: An example of such implementation would be the Adobe DeepFont, have a look:

Related research: A Sequence Learning Approach for Multiple Script Identification

The result: I will know what is the name of the font from an image, along with similar font matches to it.

untitled-1

Thank you all for your contributions: @theraysmith @tmbdev @mittagessen @zuphilip @zdenop @stweil @amitdo @Shreeshrii @kba @zuphilip

ghost avatar Jun 19 '17 03:06 ghost