headless-chrome-crawler icon indicating copy to clipboard operation
headless-chrome-crawler copied to clipboard

Suggestion: Collect links should be extendable and/or have more infos than their URL

Open panthony opened this issue 6 years ago • 1 comments

What is the current behavior?

_collectLinks only keep the href of URLs.

What is the expected behavior?

Would be nice to have, or be able to request also:

  • The anchor tag if any
  • The node (iframe? a? link?)
  • The rel attribute (was there a nofollow? a follow?)

What is the motivation / use case for changing the behavior?

You may decide to follow (or not) some type of links based on more than their depth.

panthony avatar Mar 31 '18 09:03 panthony

@panthony Thanks for a good issue!

yujiosaka avatar Apr 02 '18 06:04 yujiosaka