PaddleOCR
PaddleOCR copied to clipboard
Integration to ocrmypdf
PaddleOCR seems to be very nice way to OCR documents.
There is project called ocrmypdf https://github.com/ocrmypdf/OCRmyPDF which has plugin system, where HOCR -compliant OCR engines can be integrated (it is using currenctly Tesseract as OCR engine).
It would be nice to use PaddleOCR as OCR engine on ocrmypdf.