jbarlow83

Results 350 comments of jbarlow83

It is possible to use Tesseract with the pseudo-language "equ" for equation detection. You could do `ocrmypdf -l eng+equ` for that (English + "equation language). I have not tried this...

Are the files visually damaged in any way, or is it just the error message that appears with no apparent error? Are you using Ghostscript 9.51 or 9.52? On Thu.,...

I'm looking for someone to make `choco install ocrmypdf` do all of the steps, instead of the longer set of instructions on the site. You can manually download more languages...

Maybe they are in a branch? On Thu., May 28, 2020, 06:39 ajweber, wrote: > FYI: choco instructions appear to have installed tesseract 5.0 (alpha). > Apparently the traineddata files...

`--oversample 10` would ensure images have a DPI of at least 10. (Where the DPI is calculated based on how the image is positioned on the canvas, not whatever DPI...

No, I just need to have the code make some decision about whether it needs to save the oversampled image. You could use a higher optimization setting to get smaller...

OCRmyPDF has a plugin interface that would allow you to replace Tesseract with a different OCR engine such as Azure. To the best of my knowledge no one has published...

See issue #541 for some discussion on this in the past, in particular for page segmentation. For a while my reply was that I needed someone to sponsor the work...

I'd never hear the end of it if this were true in all cases. How did you install ocrmypdf? Can you locate the file `ocrmypdf/cli.py` and paste it here?