crusty icon indicating copy to clipboard operation
crusty copied to clipboard

Broad Web Crawler

Results 6 crusty issues
Sort by recently updated
recently updated
newest added

Hello, let4be First of all I want to say it is really impressive what you have built, I am really amazed, so congratulations. Furthermore, I see that you wrote in...

Figure a way to auto-tune domain concurrency(there is a ~perfect N based on CPU and network bandwidth available) Will need some kind of graceful adaptive algo which will look at...

enhancement
Low prio

It seems like it could significantly simplify the code

enhancement

It's especially noticeable when using shorter living jobs `soft_timeout` = 30s `hard_timeout` = 60s throughput falls almost 3 times...

bug
enhancement

seems like https://github.com/jedireza/warc could help

enhancement
Low prio

New one looks a lot tastier :)

enhancement
Low prio