any plans to add ability to embed the extracted text back into the pdf?
I would also welcome this feature. I currently use ocrmypdf, but I think that the Apple OCR works better for the languages is has.
I'm also scoping out how to best incorporate this. I've hacked ocrmypdf before to work with custom inputs with success, but it'd potentially mean I may have to convert textra to a Python package that links with a precompiled Swift library. Figuring that out and redoing the library would take a bit.
The other approach would be figuring out how to translate ocrmypdf's approach into Swift for textra. But working with PDFs is very very brittle so using a library that's already demonstrated success is a safer approach.