jbarlow83

Results 350 comments of jbarlow83

Thoughts - it's hard to get monochrome right because there are various options to invert that are not always respected by all programs - because of the above, it's hard...

@alirf81 If you'd like to move things along faster please submit a reproducible example PDF and conmand line.

We can do tesseract-timeout because it's still possible to produce a functional, mostly OCRed PDF if Tesseract fails on certain pages. But Ghostscript is a one-shot - it has to...

`-r12106.537530x12106.537530` That's the problem... ocrmypdf picked too high of a rendering resolution for the file some reason. It tries to pick a resolution that will capture all details in the...

This looks like a problem in Windows Python 3.9 that is solved in Python 3.10. I suggest trying 3.10. https://github.com/python/cpython/pull/24793 > On Sep 22, 2022, at 06:03, eduardodataeasy ***@***.***> wrote:...

Yes, just update to python 3.10. The version of ocrmypdf shouldn't matter much for this issue.

That makes sense - unfortunately I still have not a chance to replace -remove-background.

Use pdfminer.six for this. It's a whole PDF library dedicated to text processing - because it's very complicated. There's about 6 types of fonts, say.

@stweil Any call to `exit()` from code intended for use in a library is a bug - it doesn't really matter how to trigger them. For a GUI that binds...

C++ exceptions are zero cost when the exception does not occur.