pypdf
pypdf copied to clipboard
A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files
We currently have a configuration for *mutmut* (https://mutmut.readthedocs.io/en/latest/index.html) inside the repository, but do not seem to really use or have a look at it. This does not really make much...
While trying to extract lemmas from this page, I found that some text "nodes" (not sure what the technical term is, I'll refer to them as nodes in this issue)...
I'm raising this issue as a result of a super useful (and helpful!) chat with @MartinThoma. For simplicity, I am trying to extract the first page of the 'SECTOR ANALYSIS'...
The [`PdfWriter.open_destination`](https://pypdf2.readthedocs.io/en/latest/modules/PdfWriter.html#PyPDF2.PdfWriter.open_destination) docs isn't parsed "right" and so looks a bit incorrect: Looking at the other `@property` values, it doesn't seem like having a `:param:` though is the right way...
``` Instead of overwriting the page every time and loading it again (which takes a lot of time) you might want to create an empty page and merge the watermark...
After I fill in the form fields, the form fields remain visible and on top of the text filled in causing the filled in text to be hidden under it...
Also change TABLE to table
closes #2613 Added code to detect patterns in "_get_ids_image". To avoid any conflicts with images that could be located directly in a page or images using the same ID in...
When merging PDF containing images (one per page) some images were alterd in the resulting merged file. The issue was discussed on stackoverflow here: https://stackoverflow.com/questions/78508800/pypdf-does-not-give-me-the-right-image where it was proposed to...