spider
spider copied to clipboard
Allow passing a parser as closuer when a route is matched
Making this a draft for now until I add more documentation and testing.
This basically adds a method to execute a closure so that we can start parsing pages before the completion of the whole crawl.
Many websites are big enough that crawling will take hours while we need a quick access to crawled data. Also, if anything goes wrong, we lose all the efforts spend.
This PR depends on #9. A rebase might be needed after the former is merged.
Ahh nice, I started working on this here https://github.com/madeindjs/spider/pull/13
There is a alight difference. My PR adds a list of callbacks.
If you manage to write tests for it. It would be perfect.
Thanks.
On February 11, 2022 3:05:07 AM GMT+01:00, Jeff @.***> wrote:
Ahh nice, I started working on this here https://github.com/madeindjs/spider/pull/13
-- Reply to this email directly or view it on GitHub: https://github.com/madeindjs/spider/pull/10#issuecomment-1035737210 You are receiving this because you authored the thread.
Message ID: @.***> Cordialement / Regards, Touhami https://touha.me
@Dragnucs We are shifting the branch target to main which will close the PR. Feel free to put it up again on main, thanks!