PdfPig icon indicating copy to clipboard operation
PdfPig copied to clipboard

Handling control characters and overstrike with backspace

Open BobLd opened this issue 6 years ago • 0 comments

The following document contains several control characters. For example, the page 8 contains \u0002, \u0003, \b, \u0005, \u0006, \a, \t, \n, \v, \f. The problem with some of the control chars listed above is that their bounding boxes create issues with words, lines and textblocks. I guess they should be removed from the page's letters (appart from the backspace, see below).

The backspace character (\b) is used in this page to overstrike the = sign and transform it into an (equal sign with a strikethrough). This backspace behaviore should be handled to return the correct character.

BobLd avatar Oct 21 '19 19:10 BobLd