pypdf
pypdf copied to clipboard
A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files
**"TypeError: argument of type 'NoneType' is not iterable"** Got this when I tried to read the outlines of a PDF file with `PdfReader.outlines`. ## Environment Which environment were you using...
If you have links in PDF file (html anchor tag with element id as href) they would not work after merging. ```html Go to target ....some content
## Explanation PDF documents can contain objects which are not referenced anywhere in the document. Those objects could be deleted without affecting how the document looks like for file size...
I wanted to extract text from a PDF ## Environment Which environment were you using when you encountered the problem? ```bash $ python -m platform Linux-5.4.0-121-generic-x86_64-with-glibc2.31 $ python -c "import...
I've been trying out PyPDF2 and encountered cases where it is skipping text. It has no problem with one file (https://github.com/rnzucker/MadLib/blob/master/test-1.pdf), beyond adding newlines at 80 characters. But with another...
So I am trying to split a dual-page-on-one-sheet of a pdf and then extract the text. The file is at https://disclosure.bursamalaysia.com/FileAccess/apbursaweb/download?id=212505&name=EA_DS_ATTACHMENTS You can see that each sheet there are two...
Failure using mergePage() with pages that do not have a resource dictionary. This appears to be a valid condition, and the page should then inherit dictionary content from its parent....
When using `pdftk input1.pdf input2.pdf cat output output.pdf` to concatenate two input pdf containing the same AcroForm form fields (ie. to separately saved variants of the same pdf with different...
When I create a PDF by merging multiple PDFs then only the first page shows up correctly but all subsequent pages show blank form fields. This happens only when this...
From https://github.com/py-pdf/PyPDF2/discussions/1046, the following may be useful to useful to cherry-pick if possible: > Improved embedded file handling (Rüdiger Jungbeck, rjungbeck) Allow attachment of more than 1 file with PdfFileWriter.addAttachment()...