Issue with Heading Extraction
Hi, Currently, all headings, including subheadings and child headings, are marked with ##, making them indistinguishable from one another. There is no clear differentiation between parent and nested headings.
Anyone else facing this issue?
Checked the same document with LlamaParse, and it identifies headers correctly @dolfim-ibm any ideas of how we can improve headers?
Same here, I tried to html the .pdf and all headers are identified as h2, plus level is always set to level = 1, so there's no way to easily identify h1, h2, h3...
dupe of #287?
Is there anyone following up on this issue?
Would be great if we get a fix around this. Is this in pipeline to be handled @dolfim-ibm ? Thank you!
Closing as duplicate of https://github.com/docling-project/docling/issues/287