pdf2docx icon indicating copy to clipboard operation
pdf2docx copied to clipboard

Issue with graphs and bullets

Open GhazanfarAteeb opened this issue 2 years ago • 2 comments
trafficstars

I tried pdf2docx and found it great to work with and easy to use. However I faced some issue while converting the graphs and list with bullets. Currently the graphs being rendered as tables and bullets as ☐ with ?. I'm attaching the screenshots of the output file. Screenshot of the converted docx file

The screenshot of the converted docx file

Screenshot of the original pdf file

The screenshot of the original file Please let me know what can be done with it in this regard.

GhazanfarAteeb avatar May 24 '23 06:05 GhazanfarAteeb

Sorry for my so late reply.

  • I guess the bullets issue is caused by a wrong font name, so it can't be rendered correctly. The font name issue was fixed recently. So, please upgrade pdf2docx and have a try.

  • The graph issue is definitely caused by vector graph detecting issue. The chart is formed with lots of shapes like lines and rectangles, rather than a complete image. Due to the limitation of current algorithm, pdf2docx thinks that's a table. 🤣

It'll be much appreciated if you can provide this pdf file for further testing.

dothinking avatar Jan 23 '24 15:01 dothinking