crawlee
crawlee copied to clipboard
Handle `Crawl-delay` directive in robots.txt
It's not part of the original robots.txt spec, but it may be helpful to support it. We should probably look into how common it actually is.
we should revisit this now that we have the respectRobotsTxtFile option. but for this to work, we'll likely need to resolve the bugs around sameDomainDelaySecs (they might as well be issues with RQ2 in disguise)