OCRmyPDF icon indicating copy to clipboard operation
OCRmyPDF copied to clipboard

after deskew remove grey frame or cut like a A4?

Open Homer-Sim opened this issue 5 years ago • 5 comments

Hello,

I am running OCRmyPDF with a CLI command on fresh scanned files. Deskewing is working fine but it keeps a bit of the grey frame. Is there a chance to make it plane white and a rectangulare cut? 2020-04-01 12_33_20-Window 2020-04-01 12_34_06-Window

Homer-Sim avatar Apr 01 '20 10:04 Homer-Sim

One more question/feature request.

My scannes are containing several pages. Can it be that deskewing is only working on the first one?

Homer-Sim avatar Apr 01 '20 12:04 Homer-Sim

Blanking out content on the edges like is very hard to get right, if you want it to work safely and reliably on thousands of pages without reviewing them all afterwards. Edge removal is better done by the scanner software since it can tell where the page actually is. I realize these black edges are messy and rescanning is not usually an option.

Deskewing should be attempted on all pages. Some pages don't have a clear dominant skew angle, however. If you have a file that does not seem to deskew correctly please share it.

jbarlow83 avatar Apr 01 '20 23:04 jbarlow83

Can I share the file ONLY with you? I do not want to show it to everybody.

So you see no chance for black edges to be removed without scanner software?

Homer-Sim avatar Apr 02 '20 07:04 Homer-Sim

You can encrypt it with my public key as described here: https://github.com/jbarlow83/OCRmyPDF/wiki

That is the same key that is used to sign releases.

Let me put it this way. It's easy enough to get features like this right for 98-99% of pages. But an error rate of 1-2% per page is unacceptable at the kind of volume people use ocrmypdf for.

jbarlow83 avatar Apr 02 '20 07:04 jbarlow83

Any chance to send it by mail?

Homer-Sim avatar Apr 02 '20 11:04 Homer-Sim