naps2 icon indicating copy to clipboard operation
naps2 copied to clipboard

Translate PDF while scanning / during OCR (ideally maintaining layout)

Open sschuberth opened this issue 6 months ago • 1 comments

Is your feature request related to a problem? Please describe.

I would be nice to translate a scanned document after OCR.

Describe the solution you'd like

There could be an option to translate selected pages, adding text in more languages than what the OCR text is.

Describe alternatives you've considered

Extracting OCT text and running a third-party translation service is less convenient.

Additional context

Actually, it would be great is also the original layout of the scanned document / PDF could be maintained while translating, basically creating the appearance as if the original paper document was in another language in the first place.

sschuberth avatar May 11 '25 11:05 sschuberth

Some (maybe) helpful projects in this context:

  • https://github.com/aws-samples/amazon-translate-pdf
  • https://github.com/phkhanhtrinh23/translation_layoutrecovery

sschuberth avatar May 11 '25 11:05 sschuberth