webcrawling topic
scrapyrt
HTTP API for Scrapy spiders
opensearchserver
Open-source Enterprise Grade Search Engine Software
seleniumcrawler
An example using Selenium webdrivers for python and Scrapy framework to create a web scraper to crawl an ASP site
FinnewsHunter
FinnewsHunter: 基于 AgenticX 的多智能体金融情报系统。实时监控全网财经资讯,并进行深度解读与情感分析,挖掘投资阿尔法信号
DotnetCrawler
DotnetCrawler is a straightforward, lightweight web crawling/scrapying library for Entity Framework Core output based on dotnet core. This library designed like other strong crawler libraries like Web...
heritrix3
Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project.
Raspagem-de-dados-para-iniciantes
Raspagem de dados para iniciante usando Scrapy e outras libs básicas
ralger
ralger makes it easy to scrape a website. Built on the shoulders of titans: rvest, xml2.
gotor
This program provides efficient web scraping services for Tor and non-Tor sites. The program has both a CLI and REST API.
newspaperjs
News extraction and scraping. Article Parsing