ConvertX icon indicating copy to clipboard operation
ConvertX copied to clipboard

[Feature Request] PDF from/to DOCX

Open ravensorb opened this issue 7 months ago • 6 comments

Any thought on adding support to convert between DOCX and PDF formats?

ravensorb avatar May 20 '25 19:05 ravensorb

have you tested pandoc?

C4illin avatar May 22 '25 15:05 C4illin

> pandoc .\iampdf.pdf -o imword.docx
Unknown input format pdf
Pandoc can convert to PDF, but not from PDF.

using pandoc for docx -> pdf should work. pdf -> docx probably won't

thejjw avatar May 23 '25 08:05 thejjw

There are tools on the internet to convert PDF to DOCX, but they just don't work as they should, and the formatting is all broken. It's probably not possible or really hard to do that conversion properly. Extracting text and formatting it manually is probably much simpler and also faster than fixing broken formatting.

M3dvidek avatar May 23 '25 13:05 M3dvidek

Word has built in docx to pdf which works okayish

C4illin avatar May 23 '25 13:05 C4illin

For me word's pdf to docx broke nearly everything it could but it was probably a pdf from made in old word version. Not sure

M3dvidek avatar May 23 '25 14:05 M3dvidek

and docx to pdf doesnt always work--especially with non-latin characters I think:

> pandoc .\somecjkdocx.docx -o output.pdf
Error producing PDF.
! LaTeX Error: Unicode character 湲?pandoc.exe: <stderr>: hPutChar: invalid argument (Invalid argument)

nothing wrong with convertx just pandoc (and probably pdflatex more than pandoc itself) limitation i suppose

thejjw avatar May 26 '25 01:05 thejjw