Jeremy Singer-Vine
Jeremy Singer-Vine
FWIW, it seems that the same text is not copy/paste-able from a standard PDF viewer into a plain text editor. I haven't examined the potential cause closely, but this suggests...
Interesting edge-case, @bronislav. Thanks for sharing and flagging. I think passing `edge_min_length` to the `filter_edge` calls _should_ work ... but perhaps I'm overlooking something?
Ah, I see what you mean. Yes, I think you're correct that it's a good idea to allow these to be set separately. Here's an attempt to implement that, now...
Hi @ennamarie19, and thank you for reaching out. My general inclination is to keep this repository as lean as is feasible. Would it be possible to run the fuzz harness...
Thanks for following up. No objection from me, and you can use the email address listed in my GitHub profile. Looking forward to seeing what the fuzzers generate.
Version v0.11.1, just released, attempts to fix `repair=True`. Can you upgrade your version of `pdfplumber` (`pip install -U pdfplumber`) and try again?
Hi @luanmota, a couple of additional notes: - I've never heard of color components using string literals. (Interesting!) My guess is that this is against the PDF spec, although I...
@luanmota, no apology necessary! That's a utility method that's mainly used internally (and thus not listed in the core documentation), but might be useful here for your edge-case. It resolves...
Thanks for the kind words @luanmota, and thanks for the very interesting example. The [PDF specification](https://ghostscript.com/~robin/pdf_reference17.pdf) has a section ("4.6 Patterns") on patterns, and it seems like this is what...
Fascinating! Thank you for sharing. An idle thought: What if `pdfplumber` could allow users to _choose_ their parsing backend? Would require `pdfplumber` to develop some additional abstractions, but might be...