crawlers topic

List crawlers repositories

crawlersuseragents

19
Stars
3
Forks
Watchers

Python script to check if there is any differences in responses of an application when the request comes from a search engine's crawler.

DrissionPage

5.3k
Stars
530
Forks
Watchers

基于python的网页自动化工具。既能控制浏览器,也能收发数据包。可兼顾浏览器自动化的便利性和requests的高效率。功能强大,内置无数人性化设计和便捷功能。语法简洁而优雅,代码量少。

APSoft-Web-Scanner-v2

107
Stars
33
Forks
Watchers

Powerful dork searcher and vulnerability scanner for windows platform

Rcrawler

347
Stars
95
Forks
Watchers

An R web crawler and scraper

isbot

835
Stars
72
Forks
Watchers

🤖/👨‍🦰 Detect bots/crawlers/spiders using the user agent string

hproxy

66
Stars
13
Forks
Watchers

hproxy - Asynchronous IP proxy pool, aims to make getting proxy as convenient as possible.(异步爬虫代理池)

social-scraper

68
Stars
40
Forks
Watchers

Vietnamese text data crawler scripts for various sites (including Youtube, Facebook, 4rum, news, ...)

wget-lua

82
Stars
14
Forks
Watchers

Wget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.

robots.txt

83
Stars
37
Forks
Watchers

Simple robots.txt template. Keep unwanted robots out (disallow). White lists (allow) legitimate user-agents. Useful for all websites.