jbarlow

Results 352 comments of jbarlow

The current behavior wasn't quite planned. Yes it does calculate the orientation for skipped pages and says they will be fixed but then does not fix them. For skipped pages,...

@ajab21 No change in behavior for 9.0.0, which I really wanted to get out for other reasons.

@jrk2401 I think you could monkeypatch it but this isn't quite exposed behavior. I'm still working on plugins and hooks at major decision points, but I cut that feature from...

You could use a plugin that allows rotation detection to occur as normal, but suppresses regular OCR. That wasn't available when this issue was live. Pretty much just "subclassing" the...

From #647 the opener of this issue was complaining that even with --tesseract-timeout=0, ocrmypdf still seemed slow. The reason is partially that tesseract timeout is a bit of hack -...

A /Form is a "Form XObject" or a group object that can contain one or more images among other object types, including recursively other forms. A /Form is not an...

Can you provide a file that demonstrates this and the command line you used? There are multiple ways software can create a hyperlink in a PDF. On Thu., Jul. 30,...

By design `--force-ocr` is going to discard hyperlinks and other active content. A major use case of this feature is getting as much content out of possible out of damaged...

`--force-ocr` is going discard hyperlinks, that is intended behavior. If you use `--skip-text` to skip OCR on pages that already have printable text (i.e. all pages on this file), and...