Jeremy Singer-Vine
Jeremy Singer-Vine
Hi @rerik, and thanks for your interest in `pdfplumber`. Can you share the PDF and a minimal Python script that reproduces the problem?
Thank you, this is very helpful. I can reproduce the issue, and will see if I can find a solution.
Hi @stolarczyk, and thank you for this suggestion. It makes sense to provide such warnings, although I'd lean toward a more generalizable approach rather than specifying parameters for each type...
My apologies for the misunderstanding! I think the name of the proposed parameter threw me off, but I also should have looked more closely. I think I understand it now....
Thanks, @stolarczyk — I've pushed a small tweak, above, so that the linter is happy. But looks like we're missing a bit of test coverage:
Thanks! Merged into `develop`
Impressive! Do you think there's an approach to implementing this where `pdfminer.six` and `paves` could be interchangeable? I.e., the user could select which engine to use?
Thank you for flagging. Could you provide a PDF, as well as a reproducible script, that replicates this issue?
Hi @alzambranolu13, try adjusting the [`x_tolerance` parameter ](https://github.com/jsvine/pdfplumber#extracting-text) to a value larger than the default of `3`. Does that work for you?
Thanks for checking. Can you share the PDF? (It'll be hard to diagnose your issue without it.)