Text-Image-Augmentation-python
Text-Image-Augmentation-python copied to clipboard
Python implementation of Text-Image-Augmentation
Text Image Augmentation
A general geometric augmentation tool for text images in the CVPR 2020 paper "Learn to Augment: Joint Data Augmentation and Network Optimization for Text Recognition". We provide the tool to avoid overfitting and gain robustness of text recognizers.
Note that this is a general toolkit. Please customize for your specific task. If the repo benefits your work, please cite the papers.
Requirements
Demo
- Distortion
- Stretch
- Perspective
Speed
To transform an image with size (H:64, W:200), it takes less than 14ms using a 2.5GHz CPU. It is possible to accelerate the process by calling multi-process batch samplers in an on-the-fly manner, such as setting "num_workers" in PyTorch.
Attention
Modify from https://github.com/Canjie-Luo/Text-Image-Augmentation.git.