pdf2docx icon indicating copy to clipboard operation
pdf2docx copied to clipboard

[ERROR] Ignore page 1 due to making page error: invalid literal for int() with base 16: 'x1'

Open jangkim7 opened this issue 11 months ago • 11 comments

Description of the bug

I am unable to convert the attached PDF to DOCX.

sample.pdf

I am using the parse method to simply convert all pages.

parse("sample.pdf", "sample.docx")

Here is the output that I get:

[INFO] Start to convert sample.pdf [INFO] [1/4] Opening document... [INFO] [2/4] Analyzing document... [INFO] [3/4] Parsing pages... [INFO] (1/1) Page 1 [INFO] [4/4] Creating pages... [INFO] (1/1) Page 1 [ERROR] Ignore page 1 due to making page error: invalid literal for int() with base 16: 'x1' [INFO] Terminated in 0.21s.

How to reproduce the bug

See my description. It has the sample PDF file and the code snippet causing the error.

pdf2docx version

0.5.8

Operating system

MacOS

Python version

3.10

jangkim7 avatar Dec 06 '24 06:12 jangkim7