invoice2data icon indicating copy to clipboard operation
invoice2data copied to clipboard

Multipage PDF support, Pass Image files cleanly

Open bosd opened this issue 2 years ago • 3 comments

Fixes: Multi pdf support https://github.com/invoice-x/invoice2data/issues/271 pass clean image https://github.com/invoice-x/invoice2data/issues/169

all pages from a PDF file get converted to a png image before feeding it into tesseract. Because the convert function of Imagemagick loses the dpi information, it is set to 300 dpi.

bosd avatar Mar 27 '22 12:03 bosd

@RossK1 Would you be so kind to review?

bosd avatar Apr 13 '22 08:04 bosd

@m3nu Can you trigger tests?

bosd avatar Aug 05 '22 09:08 bosd

force pushed to trigger tests

bosd avatar Aug 08 '22 11:08 bosd

@m3nu Can you see the button?

bosd avatar Aug 26 '22 07:08 bosd

@rmilecki Can you review this one? :pray:

bosd avatar Sep 01 '22 15:09 bosd

Removed the filename change from this code. As I think the code in Pr #384 is better. So this one is now only for the fixes and improvement of OCR. Hope this one can be merged quickly (into a new release). @m3nu 🙏

bosd avatar Sep 19 '22 08:09 bosd

Looks fine I guess.

rmilecki avatar Sep 24 '22 17:09 rmilecki