unstructured icon indicating copy to clipboard operation
unstructured copied to clipboard

feat: custom HTML tag support

Open scanny opened this issue 1 year ago • 0 comments

HTML content contained in custom HTML tags is currently skipped during partitioning.

Enhance the HTML parser to include content in custom HTML-tags.

Additional Context Challenges include determining whether to treat a custom HTML element as a block-item (flow-item, containing one or more paragraphs) or inline (phrasing, run-level character formatting).

See also #3708

scanny avatar Dec 18 '24 06:12 scanny