jbarlow83

Results 350 comments of jbarlow83

@amitdo I will look. I'd consider using a separate `Tz` for the trailing space rather than modifying the font. `1.0 Tz [ ] TJ 0.001 Tz [ ] TJ` Seems...

The OCR engine sometimes cannot detect the skew angle. It's a heuristic/subjective calculation. You may get better results for this type of image by adding `--tesseract-thresholding sauvola`

On second thought, the issue is that `--tesseract-timeout 0` prevents detection of the skew angle, since that feature depends on Tesseract.

Sorry, I can't help with this. I really don't know anything about cx_freeze or how to use it. I suggest going to the cx_Freeze support forums for guidance. ocrmypdf is...

If you use `--redo-ocr` you will get most of what you want. There is no OCR text to "redo" but this does you want anyway, because * create an image...

Discussed here - #541. Multi resolution compression is main name for this. Image segmentation is the main technique to label pixels as belonging to different categories. DPI adjustments aren't necessarily...

@heinrich-ulbricht You are correct that what you are describing is not currently possible with OCRmyPDF.

It's possible replace the OCR engine with a plugin of your choosing or use plugins to manipulate what the OCR engine sees. It's unlikely I'll have the time to work...

The `--force-ocr` options asks for the page to be rasterized, so the appearance will differ. When using this mode, ocrmypdf tries to guess an appropriate resolution to rasterize at. In...

I suspect you may not using the most recent version of Tesseract. Using Tesseract v4.0, many of the "red arrow" areas are recognized. Note the improvement on the third page...