docling icon indicating copy to clipboard operation
docling copied to clipboard

No translation of option buttons when converting docx to MarkDown

Open Cybonto opened this issue 11 months ago • 2 comments

Bug

When converting docx content to MD, option buttons were not translated (total absence). The negative impact of this bug can be big depending on the significance of the content items and how the states of the option buttons help with the context. ...

Steps to reproduce

  • Download a sample docx file that has option buttons such as check boxes.
  • Convert the file to MD
  • Inspect and you will see that the option buttons were not translated.
  • Convert the file to MD using other tools such as pandoc, you will see the translated option buttons with correct states. ...

Docling version

v.2.17.0 ...

Python version

Python 3.9.6 ...

Cybonto avatar Jan 31 '25 19:01 Cybonto

Hello @Cybonto , thanks for opening this issue. The option buttons in lists, including whether they are checked or not, are not supported in .docx document conversion. Therefore, they will not appear in markdown exports as task list item markers. I have set this issue as an enhancement. However, the implementation of this feature should be rather simple. We will follow up when we know if it can be implemented and by when.

ceberam avatar Feb 11 '25 12:02 ceberam

Thanks for the update. I did find a workaround by using another docx processing package to replace the dynamic form items with plain texts.

Cybonto avatar Feb 14 '25 20:02 Cybonto