pdf2docx icon indicating copy to clipboard operation
pdf2docx copied to clipboard

All strings must be XML compatible: Unicode or ASCII, no NULL bytes or control characters...still exist

Open cloudtuotuo opened this issue 1 year ago • 0 comments

Description of the bug

We have a pdf file which has some invalid characters, like \uffff. An error occurred during the conversion.

Please help.

How to reproduce the bug

\uffff in doc

pdf2docx version

0.5.8

Operating system

Windows

Python version

3.11

cloudtuotuo avatar Oct 25 '24 01:10 cloudtuotuo