Jeremy Singer-Vine

Results 105 comments of Jeremy Singer-Vine

Hi @SpencerNorris, and thanks for pushing the limits of `pdfplumber`! While it's possible there's a memory leak in `pdfplumber` itself, it's hard to debug this issue without the PDF in...

Yes, this is a problem, and one I'd like to fix. Based on a bit of exploration, it seems that the memory issues might stem from within `pdfminer.six`, possibly in...

Thanks for re-flagging this. Based on some testing, I think there's a more straightforward solution — one which does not require you to open and close the PDF multiple times:...

Update: Hah, I forgot that `pdfplumber` already has an (undocumented) way of doing this :) ```python with pdfplumber.open("data/my.pdf") as pdf: for page in pdf.pages: run_my_code() page.flush_cache() ```

Hi @navkirat, and thanks for flagging. Are you able to share the PDF that triggered the memory issues?

I agree! That could be a very nice feature. I will consider adding it in the future. In the meantime, PRs are welcome on this.

Have you examined the way in which the dotted lines are represented in either (a) the raw PDF file, or (b) `page.objects`? That's usually my first step in trying to...

Can you paste some examples of line objects that represent the dotted lines vs. line objects that represent the solid lines? The more explicit and detailed issues are, the easier...

Thanks, @samkit-jain! From reading the raw directives in the `Buprenorphine.pdf` file, it appears that this PDF is setting the graphics state's "line dash pattern" (see p. 217 of [the PDF...

Thanks for sharing, @matheusefagundes!