pypdf icon indicating copy to clipboard operation
pypdf copied to clipboard

A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files

Results 221 pypdf issues
Sort by recently updated
recently updated
newest added

I'm trying to extract text (see https://stackoverflow.com/q/75587416/562769 ) ## Environment Which environment were you using when you encountered the problem? ```bash $ python -m platform Linux-5.4.0-139-generic-x86_64-with-glibc2.31 $ python -c "import...

workflow-text-extraction
help wanted

Hi @pubpub-zz, absolutely appreciate your effort in fixing py-pdfs form capabilities. I tested 3.11.1 and replaced pdftk with pypdf in our app. Its already looking quite good but there are...

workflow-forms

2 issues to report. Not sure if these are bugs or feature. First, often, end of line words are concatenated with begining of next line words. For example: I used...

workflow-text-extraction
whitespace

The annotation lines are very thin. I can only see them if i put a shape fill, but that means I get a filled-in shape as well. If I draw...

nf-documentation
Easy

I'm trying to add a polyline annotation to a pdf file. It does not work whether it is an existing file or a new file (add_blank_page). ## Environment Which environment...

nf-documentation
workflow-annotation

When trying to create a rectangle on pdf, there is no box displayed. If the argument interiour_color is passed, then the rectangle is displayed. How to create just a bordered...

workflow-annotation

If I create a link with positional, it jumps to an unintended page. ## Environment ```bash $ python -m platform Windows-10-10.0 $ python -c "import PyPDF2;print(PyPDF2.__version__)" 2.10.7 ``` ## Code...

is-bug

I've just noticed that PR #1172 introduced a space regression issue for text extraction. A lot of spaces got removed. Those spaces should have stayed. ## Code + PDF Just...

is-bug
workflow-text-extraction
whitespace

extracted from #1379 PS : in the extraction result, the arabic characters are replaced with /afiinnnn. this is because the data uses the iso 10036 standard that I've not been...

help wanted
workflow-arabic-text-extraction