docling icon indicating copy to clipboard operation
docling copied to clipboard

Improve JATS parsing with nested lists, inline formulas, ordered lists, content layers

Open ceberam opened this issue 10 months ago • 0 comments

Requested feature

To catch up with the latest docling and docling-core developments, we should add the following features in the JATS XML parser backend:

  • [ ] parse nested lists (currently only level 1 is addressed)
  • [ ] add inline formulas in inline groups
  • [ ] parse ordered lists (currently they are parsed as unordered lists)
  • [ ] distribute content in body or furniture content layers
  • [ ] support formatted text (bold, italic, underline, superscript,...)

ceberam avatar Feb 27 '25 09:02 ceberam