any plans to add ability to embed the extracted text back into the pdf?

Open npomfret opened this issue 2 years ago • 2 comments

May 31 '23 17:05 npomfret

I would also welcome this feature. I currently use ocrmypdf, but I think that the Apple OCR works better for the languages is has.

Jun 09 '23 20:06 grantbarrett

I'm also scoping out how to best incorporate this. I've hacked ocrmypdf before to work with custom inputs with success, but it'd potentially mean I may have to convert textra to a Python package that links with a precompiled Swift library. Figuring that out and redoing the library would take a bit.

The other approach would be figuring out how to translate ocrmypdf's approach into Swift for textra. But working with PDFs is very very brittle so using a library that's already demonstrated success is a safer approach.

Jun 10 '23 15:06 freedmand