text_renderer icon indicating copy to clipboard operation
text_renderer copied to clipboard

New branch for english corpus

Open ashwath98 opened this issue 5 years ago • 1 comments

Hey, I've been using your library for dataset generation for my ocr project. I think it would be useful to have a branch dedicated to English, also adding an English corpus file in the data directory. While using your code I have added some features like random subset selection(from a sentence selected to be the text), which can be useful while training in applications where not all sentences are of fixed length.

Do you suggest I make a Pull Request containing these features?

ashwath98 avatar Jan 04 '20 08:01 ashwath98

Hey, I've been using your library for dataset generation for my ocr project. I think it would be useful to have a branch dedicated to English, also adding an English corpus file in the data directory. While using your code I have added some features like random subset selection(from a sentence selected to be the text), which can be useful while training in applications where not all sentences are of fixed length.

Do you suggest I make a Pull Request containing these features?

May I ask you to send your English version to my mailbox? I want to use it for study, not for commercial use.Thank you! My Email: [email protected]

starry-xin avatar Feb 24 '22 13:02 starry-xin