crawl4ai
crawl4ai copied to clipboard
(Question) How to retain specific HTML tags (e.g., <span class="entity-embed">) in HTML-to-Markdown conversion without converting them?
I'm working on a web crawling project where I need to convert HTML content into Markdown. However, I want certain HTML tags, like
, to remain in their original HTML form in the Markdown output, without being converted.Currently, when I run the conversion, all tags are transformed into Markdown, which removes specific structures I need to keep intact. Is there a way to retain specific tags or classes during the HTML-to-Markdown conversion?