pdfbox
pdfbox copied to clipboard
PDFBOX-5747: Surrogate pairs with combining diacritics are incorrectly ordered on text extraction
- Changed TextPosition.insertDiacritic() to preserve surrogate pairs
- Added unit test
- Included example test PDF file attached to PDFBOX-5747