jbarlow83 comments

Results 350 comments of


jbarlow83

Searching math equations

It is possible to use Tesseract with the pseudo-language "equ" for equation detection. You could do `ocrmypdf -l eng+equ` for that (English + "equation language). I have not tried this...

Error: File did not complete the page properly and may be damaged.

Are the files visually damaged in any way, or is it just the error message that appears with no apparent error? Are you using Ghostscript 9.51 or 9.52? On Thu.,...

Chocolately package for Windows

I'm looking for someone to make `choco install ocrmypdf` do all of the steps, instead of the longer set of instructions on the site. You can manually download more languages...

Chocolately package for Windows

Maybe they are in a branch? On Thu., May 28, 2020, 06:39 ajweber, wrote: > FYI: choco instructions appear to have installed tesseract 5.0 (alpha). > Apparently the traineddata files...

Oversampling + image processing enlarges output image unnecessarily

`--oversample 10` would ensure images have a DPI of at least 10. (Where the DPI is calculated based on how the image is positioned on the canvas, not whatever DPI...

Oversampling + image processing enlarges output image unnecessarily

No, I just need to have the code make some decision about whether it needs to save the oversampled image. You could use a higher optimization setting to get smaller...

Azure ocr with ocrmypdf

OCRmyPDF has a plugin interface that would allow you to replace Tesseract with a different OCR engine such as Azure. To the best of my knowledge no one has published...

Documentation Bug: Mention of consulting inquiries includes no contact information

Please contact [email protected] Thank you!

support monochromatic conversion

See issue #541 for some discussion on this in the past, in particular for page segmentation. For a while my reply was that I needed someone to sponsor the work...

Command line option deskew not found but d is available

I'd never hear the end of it if this were true in all cases. How did you install ocrmypdf? Can you locate the file `ocrmypdf/cli.py` and paste it here?