Edouard Belval comments

Results 293 comments of


                                            Edouard Belval

Corrupt symbols in dict files (de)

Should be fixed by #174. Can you confirm?

who would like to share some chinese ttf fonts?

If you find any open source Chinese fonts, I'd be willing to add them to the project.

can it create chinese handwritten text?

Chinese script is not supported by the handwritten generative model. I'll add a more explicit explanation in the README.

can it create chinese handwritten text?

It's surprisingly hard to imitate because the original dataset dates from 2012 and used a method that isn't really common nowadays. I would look into GAN-based character generation like [ScrabbleGAN](https://openaccess.thecvf.com/content_CVPR_2020/papers/Fogel_ScrabbleGAN_Semi-Supervised_Varying_Length_Handwritten_Text_Generation_CVPR_2020_paper.pdf)...

Format of labels .txt is not as expected by easy OCR

You can use `--name_format 2` but that will give you spaces instead of csv. You could just make a code change here: https://github.com/Belval/TextRecognitionDataGenerator/blob/master/trdg/run.py#L473

projective

This project does not allow for affine/homography transformations. This is however a very good idea if you wish to make a PR. OpenCV has everything to do it, you could...

Masks for handwritten characters

The current version does not return a mask for handwritten text. I am not currently working on it so feel free to try and implement it.

Generate rendered text in PyTorch dataset

Your PyTorch dataset could hold the GeneratorFromStrings object and you can implement a lazy loading for the strings in your dataset. Since you dataset is only strings you could most...

Why use image height param 'format' as font size ?

I do agree that it ends up being a stupidly high value, the idea was just to keep the font size consistent so it wouldn't look terrible after up scaling....

OSError: [Errno 22] Invalid argument: 'out/megaphotographic Okarche angiotonase inevitableness it"ll_3955.jpg'

This is caused but the `"` in the filename. You can fix it by using a different output format with `-na 2`. Here is the description of the format parameter:...