TextRecognitionDataGenerator icon indicating copy to clipboard operation
TextRecognitionDataGenerator copied to clipboard

A synthetic data generator for text recognition

Results 163 TextRecognitionDataGenerator issues
Sort by recently updated
recently updated
newest added

first of all I Would to thank you for this awesome project when use the arabic_reshaper repo when generating the the text for fix the Arabic shapes issue , i...

Add docstrings to TRDG to generate better automated references.

![image](https://user-images.githubusercontent.com/82756842/180180053-c7be619c-9f72-4b66-9ad5-4521f5d635ce.png)

colab code !mkdir /usr/local/lib/python3.7/dist-packages/trdg/fonts/kzdigits !cp /content/fonts/*.* /usr/local/lib/python3.7/dist-packages/trdg/fonts/kzdigits/ e.g. Verdana.ttf generator = GeneratorFromRandom( blur=0, random_blur=False, random_skew=True, language='kzdigits' ) print(generator.generator.fonts) #Verdana.ttf Sometimes the generator produces a valid image ![image](https://user-images.githubusercontent.com/24579937/179394479-8b2e1b14-20a0-4ba5-a91a-4070f116711c.png) Sometimes not ![image](https://user-images.githubusercontent.com/24579937/179394506-4708932f-ad3c-4a44-a582-3ab6ee3d21e5.png)...

bug

GeneratorFromStrings produces identical images every time random.seed(100) # doesn't help data = [] start = 200 for i in range(start, start + 3): data.append( str(i) + ' ' + '₸')...

to aid AI training for object localization. To generate a sample dataset, run: run.py -c 60 -b 100 --text_color black -num -let -id images -w 200 -w 300 -wd 600...

Added regex option for patterns like serial numbers or license plates

Added support for Yoruba Language Language Code = 'yo ![11507 Danpascu jẹ plánẹ tì kékeré ní ibi ìgbàjá ástẹ rọ ìdì_0](https://user-images.githubusercontent.com/38908008/136361842-28c456ca-93c0-40bb-9716-8895fb3ba337.jpg) ![Àyọkà yìí tàbí apá rẹ únfẹ àtúnṣe sí_1](https://user-images.githubusercontent.com/38908008/136361844-f46f01ce-adfe-4678-9595-b04dcd24150e.jpg) '

Farsi and Arabic is a right-left languages so we need to write its text in that way and this PR covers some report issues [#57](https://github.com/Belval/TextRecognitionDataGenerator/issues/57#issuecomment-811590185) and [#161](https://github.com/Belval/TextRecognitionDataGenerator/issues/161) to fixing and...

Hello, Your work is very amazing! But when I run command `trdg -l ja -c 10 -w 5` to synthesize 10 images with Japanese text, I have got 10 images...