Sandro Mani

Results 119 comments of Sandro Mani
trafficstars

Can you elaborate? When recognizing in plain text mode, the text from each recognized image is separated with a line break. If you recognize to hOCR, the output is split...

Something like the strip line break functionality of the plain text mode should be doable. Regarding applying styling to the HTML: as far as I know this is how hOCR...

Might make sense to collect all the missing fonts and then display a final warning after the export, rather than risking many popups appearing during the export.

This is more a tesseract training issue that something gImageReader can handle.

Consequence of [asyncio.loop.create_unix_server()](https://docs.python.org/3.13/library/asyncio-eventloop.html#asyncio.loop.create_unix_server) will now automatically remove the Unix socket when the server is closed. (Contributed by Pierre Ossman in https://github.com/python/cpython/issues/111246.) ? cont.end() will close the server which will now...

Ok for Fedora, F38 (oldest non EOL release) has python-3.11.

Fixed in https://github.com/manisandro/gImageReader/commit/e72d657a408dc6b77c48c086feede31e08700b4c

Yes, but you will most likely need to recompile the application against the older libtesseract.