Eric Ihli
Eric Ihli
Tidying up some old issues that haven't had activity for a long time.
Actually, I'll leave this open for a little longer in case someone wants to tackle this. https://kba.github.io/hocr-spec/1.2/
That's a tough one. I don't know how I'd do that. Maybe [combine all of the pdf's pages into a single image](https://softwarerecs.stackexchange.com/questions/6895/converting-pdf-with-several-pages-to-one-image) and then do something with that?
Sorry to say that I have very little knowledge of Windows 10. I'll leave this comment open for a while in case anyone else has a suggestion.
If anyone wants to submit a patch to make this more portable across Linux/Windows, please do!
I think this might be because `open` uses your locale's encoding when you don't specify one as a kwarg in the `open` call and your locale's encoding is ASCII. https://docs.python.org/3/library/functions.html#open...