crawlers topic
crawlers
Norconex Crawlers (or spiders) are flexible web and filesystem crawlers for collecting, parsing, and manipulating data from the web or filesystem to various data repositories such as search engines.
serritor
Serritor is an open source web crawler framework built upon Selenium and written in Java. It can be used to crawl dynamic web pages that require JavaScript to render data.
user-agents
User agent database in JSON format of bots, crawlers, certain malware, automated software, scripts and uncommon ones.
licitacoes-de-feira
Licitações de Feira de Santana de fácil acesso aos cidadãos 🏦
laravel-block-bots
Block crawlers and high traffic users on your site by IP using Redis
sneakpeek
Sneakpeek is a framework that helps to quickly and conviniently develop scrapers. It’s the best choice for scrapers that have some specific complex scraping logic that needs to be run on a constant ba...
GooglePlayWebServiceAPI
Tiny script to crawl information of a specific application in the Google play/store base on PHP.
zcrawl
An open source web crawling platform
ai.robots.txt
A list of AI agents and robots to block.