Need some inspiration?
https://github.com/whitelok/image-text-localization-recognition https://github.com/qurator-spk/eynollah
I have definitely seen eynollah before (it's not fast enough as it is to integrate with tesseract) -- the others not so much. I've been preoccupied with a few other projects at work, but in a few weeks I hope to get back to this project, implement supporting compressing existing PDFs (for use with OCRMyPDF), and support the ocr_photo elements.
https://tel.archives-ouvertes.fr/tel-01221308/document
https://www.math.uni-sb.de/service/preprints/preprint269.pdf
https://arxiv.org/pdf/1712.08232.pdf
https://www.researchgate.net/publication/334130136_Compressing_Flow_Fields_with_Edge-aware_Homogeneous_Diffusion_Inpainting
https://github.com/RenYurui/StructureFlow https://github.com/topics/image-inpainting?l=matlab&o=asc&s=stars https://github.com/topics/inpainting?o=asc&s=updated
Wow! https://github.com/Djdefrag/QualityScaler