pottan-ocr
pottan-ocr copied to clipboard
Cleanup dataset generation using augmentor python module
Should investigate the possibility of https://github.com/Sanster/text_renderer
Text_renderer uses Pillow to render text. Pillow will not support complex scripts if libraqm is not available.
Also, if we are using Pillow, if a gyph is not available in a font file, that glyph will be rendered as empty square boxes
Augmentor module is not specific to text . So, handling long lines of text is bit difficult if we are using augmentor .