pypdf icon indicating copy to clipboard operation
pypdf copied to clipboard

A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files

Results 221 pypdf issues
Sort by recently updated
recently updated
newest added

Added correct_annotation_encodings method in writer class to automatically deal with issue where output PDF form with undefined '/Encoding' in the "/DR" section in the fields/annotations. It will be called in...

needs-discussion

I need to extract text from a PDF document using the `page.extract_text` function, but all the extracted Chinese characters are garbled. I suspect that this PDF document uses several special...

workflow-text-extraction
is-cjk-issue

pypdf version: 4.2.0 platform: Linux-6.5.0-1018-oem-x86_64-with-glibc2.35 Python: 3.10.12 Traceback error ``` File "/home/suresh/venv-lanchain/lib/python3.10/site-packages/pypdf/_page.py", line 2083, in extract_text return self._extract_text( File "/home/suresh/venv-lanchain/lib/python3.10/site-packages/pypdf/_page.py", line 1804, in _extract_text for operands, operator in content.operations: File...

generic
is-robustness-issue

This functionality originally added back in ced2890104dd4633132fe981b7f6bdd67aa17ed8 Reduces memory usage by size of loaded file. Benchmark script ``` from pypdf import * filename = '/home/msirabella/tmp/100MB-TESTFILE.ORG.pdf' writer = PdfWriter(clone_from=filename) writer.write("out.pdf") ```...

PdfReader

## Explanation I read up on a couple of old issues referencing transfer_rotation_to_content as well as Annotations an came to the conclusion that this seems to be an egde-case issue:...

workflow-annotation

Running the tests with `Pillow==10.3.0` breaks `test_filters.test_rgba`. `Pillow==10.2.0` works correctly. ## Environment Which environment were you using when you encountered the problem? ```bash $ python -m platform Linux-5.14.21-150400.24.100-default-x86_64-with-glibc2.31 $ python...

is-bug
workflow-images

At the moment, the requirement files used by the CI are only updated partially and if required. The same applies to the pre-commit configuration. There have been some related issues...

Meta

We should review the following pinned dependencies in our include files for the `requirements.txt` files to ensure that we keep being compatible with recent Python versions: * Recent `fpdf` versions...

dependencies

pypdf uses id(obj) to keep track of objects in `_id_translated`. This identifier is **not** unique for different objects, only for objects existing at the same time. e.g. In CPython id(obj),...

## Explanation I'd love to be able to change field properties. Specifically, in Adobe, it is possible to change the background color of a signature field to yellow, red, or...

workflow-forms