website-evidence-collector
website-evidence-collector copied to clipboard
Added spider/crawler functionality
Added a spider capability to the website-evidence-collector, so it can be instructed to gather evidence of all the pages of a website
Hi,
how can I install this?
I don't think github offers the possibility to download a zip from the pull-request. But you can download it from the fork I created and on which the pull-request is based : https://github.com/bitnesswise/website-evidence-collector/tree/spider
I just notice that both #65 and #66 introduce many many changes. So it is likely better to integrate this feature only after the proposed refactoring.
Dear @bitnesswise,
this is just to let you know that I eventually managed to merge #66, which comes with many changes. I do not know how we proceed now best with this PR. I see the following options:
- rebase this branch on master
- write a separate spider script that uses the modules that WEC exposes now (possibly using a new distinct PR)
- abandon this PR for the time being
The option 2) comes possibly with the most flexibility. The new script can live in the WEC repo or in a separate repo. If this feature does not come with a lot of code, it can also be just one more option as it is the case in the current PR.
What's your view on this?