pypdf
pypdf copied to clipboard
A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files
Replace this: What happened? What were you trying to achieve? ## Environment Which environment were you using when you encountered the problem? ```bash $ python -m platform # Windows-10-10.0.22000-SP0 $...
Attempt to work around #2234
Many years ago I used pypdf to create links for a book of maps for our storm sewer system. I had an index page that had links to all of...
Trying to extract text from one pdf page. Parsing crashes. ## Environment Which environment were you using when you encountered the problem? ```bash $ python -m platform Darwin-22.6.0-x86_64-i386-64bit $ python...
closes #2087 closes #2090 add also compatibility with RF (adding list) still in progress
When extracting an image from some specific PDF file, the corresponding output file is distorted. Evince and *pdfimages* are able to extract correct images. ## Environment Which environment were you...
## Explanation To be able to reconstruct a document (like an HTML page), it would be necessary to add a tag like [tagimage]1[/tagimage] in the extracted text at the place...
Currently, I am trying to extract text from PDF files which partially report some warnings like ``` /home/stefan/temp/venv/lib/python3.9/site-packages/pypdf/_cmap.py:183: PdfReadWarning: Advanced encoding /GBK2K-H not implemented yet warnings.warn( /home/stefan/temp/venv/lib/python3.9/site-packages/pypdf/_cmap.py:183: PdfReadWarning: Advanced encoding...
I added a FreeText annotation to a PDF, but noticed that it's not shown in some PDF viewers. The input PDF seems not to matter. ## Environment - Windows-10-10.0.22000-SP0 -...
I am having problems adding Ink annotations back to a PDF using PdfWriter.add_annotation(). I think the problem is related to the PDF viewer. When I open the file after adding...