itsy icon indicating copy to clipboard operation
itsy copied to clipboard

A threaded web-spider written in Clojure

Results 5 itsy issues
Sort by recently updated
recently updated
newest added

It'd be nice to change the location of itsy.log. My main partition is pretty small and would fill up pretty quickly during the run I'm doing right now. Not a...

I have added a default-delay-policy (wait 3 seconds before next request). And now, a ready queue is used to look @ the next set of URLs. Increments are done once...

Any plans for robots.txt compliance?

Hi, Right now your crawler polls sites quite aggressively. It would be appropriate to wait for a short while before sending another request to a host.

Hey dakrone, you mention Itsy domain limiting capabilities, can you elaborate? In this case, I'd like to e.g. extract only pages->text which have a certain domain pattern. Of course I...