headless-chrome-crawler
headless-chrome-crawler copied to clipboard
Suggestion: Collect links should be extendable and/or have more infos than their URL
What is the current behavior?
_collectLinks only keep the href of URLs.
What is the expected behavior?
Would be nice to have, or be able to request also:
- The anchor tag if any
- The node (iframe? a? link?)
- The rel attribute (was there a nofollow? a follow?)
What is the motivation / use case for changing the behavior?
You may decide to follow (or not) some type of links based on more than their depth.
@panthony Thanks for a good issue!