firecrawl icon indicating copy to clipboard operation
firecrawl copied to clipboard

v1: Extract LinksOnPage from HTML that has includedTags / excludedTags parameters already applied

Open calebpeffer opened this issue 6 months ago • 0 comments

A customer who is using the linksOnPage field noticed that it still includes links from headers and footers, even though they have been removed from the content.

Move the URL extraction code to extract after the tags have been pruned.

@nickscamara Assigning to you so you can Quarterback (pick whoever is best suited for the task and assign)

calebpeffer avatar Aug 01 '24 20:08 calebpeffer