crawlpy
crawlpy copied to clipboard
__ROADMAP__
Roadmap
v0.4
- [ ] First parse JS, then crawl (possibly via phantomJS)
v0.3
- [ ] Make self-contained spider
v0.2
- [X] ~~Be able to specify http status codes allowed for crawling (not just only
200
)~~ - [X] ~~overwrite internal DEPTH_LIMIT~~
- [x] ~~Ignore list of url patterns~~
v0.1
- [X] ~~Make it work~~
Hi! I can help.