pdfbox issues

lenient DomXmpParser

The XMP box library is nice, but out in the wild are PDF files that fail parsing. For example dc.create is a Bag instead of a Seq. Ideally the parser...

gunnar-ifp

PDFBOX-4073 Choosable Coordinate-Unitsystem

1

This pull request is discussed in Jira ticket: https://issues.apache.org/jira/browse/PDFBOX-4073 Our take on this: There could be a need to work with millimetres or inches instead of points. @THausherr commented that...

habbisify

Unread trailing e when scientific notation was expected

Addresses issue https://issues.apache.org/jira/browse/PDFBOX-5025 Unreading the trailing 'e' from the endobject string allows parsing to continue and complete as expected.

cwholmes

Rebuild the trailer when missing pages item

Addresses issue https://issues.apache.org/jira/browse/PDFBOX-5026 Rebuilding the trailer when the pages item is missing can allow the building of the PDF when lenient parsing is enabled.

cwholmes

[PDFBOX-3812] Support auto size font for multiline PDTextField

6

https://issues.apache.org/jira/browse/PDFBOX-3812

dannymcpherson

PDFBOX-4952 PDF compression - object stream creation

This pull request is discussed in JIRA ticket: https://issues.apache.org/jira/browse/PDFBOX-4952 I implemented a basic starting point to realize a PDF compression based on PDFBox 2.0.22-SNAPSHOT I want to use this ticket,...

christianAppl

COSName should be written with same charset as it was read. PDFBOX-4728

3

If read is done using Windows-1252 and write using UTF-8 then PDF containing Windows-1252 encoded XObject dictionry names will be broken after doin load and save PDDocument document = PDDocument.load(sourcePath);...

oikku