robots-txt topic

List robots-txt repositories

gatsby-plugin-robots-txt

106
Stars
26
Forks
Watchers

Gatsby plugin that automatically creates robots.txt for your site

advertools

1.1k
Stars
201
Forks
Watchers

advertools - online marketing productivity and analysis tools

robotstxt

267
Stars
55
Forks
Watchers

The robots.txt exclusion protocol implementation for Go language

crawler-commons

229
Stars
73
Forks
Watchers

A set of reusable Java components that implement functionality common to any web crawler

InfinityCrawler

239
Stars
35
Forks
Watchers

A simple but powerful web crawler library for .NET

ultimate-sitemap-parser

171
Stars
64
Forks
Watchers

Ultimate Website Sitemap Parser

fetchbot

781
Stars
94
Forks
Watchers

A simple and flexible web crawler that follows the robots.txt policies and crawl delays.

gocrawl

2.0k
Stars
194
Forks
Watchers

Polite, slim and concurrent web crawler.

robots

392
Stars
30
Forks
Watchers

NuxtJS module for robots.txt

robots-txt

213
Stars
35
Forks
Watchers

Determine if a page may be crawled from robots.txt, robots meta tags and robot headers