crawl4ai icon indicating copy to clipboard operation
crawl4ai copied to clipboard

(Question) How to retain specific HTML tags (e.g., <span class="entity-embed">) in HTML-to-Markdown conversion without converting them?

Open truonghoangnguyen opened this issue 3 months ago • 1 comments

I'm working on a web crawling project where I need to convert HTML content into Markdown. However, I want certain HTML tags, like ..., to remain in their original HTML form in the Markdown output, without being converted.

Currently, when I run the conversion, all tags are transformed into Markdown, which removes specific structures I need to keep intact. Is there a way to retain specific tags or classes during the HTML-to-Markdown conversion?

truonghoangnguyen avatar Oct 30 '24 02:10 truonghoangnguyen