web-crawler topic

List web-crawler repositories

WebsiteParser

5
Stars
0
Forks
Watchers

Simple library which parses web pages into objects usin attributes

bathyscaphe

92
Stars
25
Forks
Watchers

Fast, highly configurable, cloud native dark web crawler.

ChiChew

25
Stars
7
Forks
Watchers

:notebook_with_decorative_cover: 教育部《重編國語辭典修訂本》 網路爬蟲 :: A live web crawler for the Chinese-Chinese dictionary published by the Ministry of Education in Taiwan

CrawlBox

139
Stars
39
Forks
Watchers

Easy way to brute-force web directory.

kochat

444
Stars
180
Forks
Watchers

Opensource Korean chatbot framework

ComicBookMaker

34
Stars
7
Forks
Watchers

Script to fetch webcomics and use them to create ebooks.

CVPR2019

70
Stars
12
Forks
Watchers

Displays all the 2019 CVPR Accepted Papers in a way that they are easy to parse.

supercrawler

370
Stars
63
Forks
Watchers

A web crawler. Supercrawler automatically crawls websites. Define custom handlers to parse content. Obeys robots.txt, rate limits and concurrency limits.

GoodreadsScraper

117
Stars
30
Forks
Watchers

Scrape data from Goodreads using Scrapy and Selenium :books:

ant

276
Stars
16
Forks
Watchers

A web crawler for Go