naps2
naps2 copied to clipboard
Translate PDF while scanning / during OCR (ideally maintaining layout)
Is your feature request related to a problem? Please describe.
I would be nice to translate a scanned document after OCR.
Describe the solution you'd like
There could be an option to translate selected pages, adding text in more languages than what the OCR text is.
Describe alternatives you've considered
Extracting OCT text and running a third-party translation service is less convenient.
Additional context
Actually, it would be great is also the original layout of the scanned document / PDF could be maintained while translating, basically creating the appearance as if the original paper document was in another language in the first place.
Some (maybe) helpful projects in this context:
- https://github.com/aws-samples/amazon-translate-pdf
- https://github.com/phkhanhtrinh23/translation_layoutrecovery