pypdf issues

ENH: Add AnnotationBuilder.widget(...)

2

See #107

MartinThoma

help wanted

BUG: sweep indirect references when cloning from PdfReader object

4

Addresses #1185

mtd91429

BUG: Fix Parsing of Inline Images

7

The inline image parser does not look for whitespace before the `EI` keyword as it should. Thus if you have a content stream as follows, the parser would crash: ```...

speedplane

is-bug

workflow-images

needs-test

needs-rebase

Add popup annotation support

1

## Explanation More complete feature support for annotations. ## Code Example Just like the other annotations ## Hints * TABLE 8.34 Additional entries specific to a pop-up annotation * See...

MartinThoma

is-feature

Resolve indirect objects when reading metadata

1

## Explanation The pattern `writer.add_metadata(reader.metadata)` doesn't work as there can be indirect objects. This is unfortunate and an unnecessary complexity for the user. ## Code Example ```python from PyPDF2 import...

MartinThoma

PdfReader

is-feature

ENH: Support adding multiple file attachments

1

Hi! I would like the possibility of merging in pages as hidden attachments, is this possible with pypdf at all? Edit: This seems to have been added now (thanks!), but...

moodh

is-feature

Huge memory/cpu utilization for 1 page PDF extraction

8

extractText() cpu/memory utilization is massive for the following 1 page 3 MB file. The extraction doesn't complete and the process has to be killed. http://www.dora.state.co.us/pls/efi/efi_p2_v2_demo.show_document?p_dms_document_id=105933&p_session_id=

alisufian

nf-performance

workflow-text-extraction

Has MCVE

is-maintenance

Issue in text extraction (spaces)

Issue with text extraction (spacing) ## Environment Which environment were you using when you encountered the problem? windows 10 ``` $ python -c "import PyPDF2;print(PyPDF2.__version__)" 2.7.0 ``` ## Code +...

pubpub-zz

is-bug

workflow-text-extraction

whitespace

Support reading PDF tags

Chapter "10.7 Tagged PDF" (PDF 1.7 standard) is something that PyPDF2 doesn't support at all... at least I think so. I need to read that chapter (+ several related other...

MartinThoma

is-feature

PyPDF2.errors.PdfReadError: EOF marker not found

2

The following script originally hanged, but with PyPDF2==2.4.2 we get `PdfReadError: EOF marker not found`. ## MCVE: PDF + Code [This file](https://www.puc.nh.gov/Regulatory/CASEFILE/2001/01-006%20THROUGH%20MARCH%202010/01-006%202009-04-30%20FRP%20NON%20CONFIDENTIAL%20PAP%20FILING.PDF) is 298MB with 21 pages. ```python from PyPDF2...

alisufian

PdfReader

is-robustness-issue

Has MCVE

pypdf
pypdf copied to clipboard

Metadata

ENH: Add AnnotationBuilder.widget(...)

BUG: sweep indirect references when cloning from PdfReader object

BUG: Fix Parsing of Inline Images

Add popup annotation support

Resolve indirect objects when reading metadata

ENH: Support adding multiple file attachments

Huge memory/cpu utilization for 1 page PDF extraction

Issue in text extraction (spaces)

Support reading PDF tags

PyPDF2.errors.PdfReadError: EOF marker not found

← Metadata

Owner

Metadata

pypdf pypdf copied to clipboard

Metadata

← Metadata

Owner

Metadata

pypdf
pypdf copied to clipboard