pypdf
pypdf copied to clipboard
A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files
I'm trying to extract text (see https://stackoverflow.com/q/75587416/562769 ) ## Environment Which environment were you using when you encountered the problem? ```bash $ python -m platform Linux-5.4.0-139-generic-x86_64-with-glibc2.31 $ python -c "import...
Hi @pubpub-zz, absolutely appreciate your effort in fixing py-pdfs form capabilities. I tested 3.11.1 and replaced pdftk with pypdf in our app. Its already looking quite good but there are...
2 issues to report. Not sure if these are bugs or feature. First, often, end of line words are concatenated with begining of next line words. For example: I used...
The annotation lines are very thin. I can only see them if i put a shape fill, but that means I get a filled-in shape as well. If I draw...
I'm trying to add a polyline annotation to a pdf file. It does not work whether it is an existing file or a new file (add_blank_page). ## Environment Which environment...
When trying to create a rectangle on pdf, there is no box displayed. If the argument interiour_color is passed, then the rectangle is displayed. How to create just a bordered...
If I create a link with positional, it jumps to an unintended page. ## Environment ```bash $ python -m platform Windows-10-10.0 $ python -c "import PyPDF2;print(PyPDF2.__version__)" 2.10.7 ``` ## Code...
I've just noticed that PR #1172 introduced a space regression issue for text extraction. A lot of spaces got removed. Those spaces should have stayed. ## Code + PDF Just...
extracted from #1379 PS : in the extraction result, the arabic characters are replaced with /afiinnnn. this is because the data uses the iso 10036 standard that I've not been...
closes #2633