extractor topic

List extractor repositories

seo-audits-toolkit

547
Stars
114
Forks
Watchers

SEO & Security Audit for Websites. Lighthouse & Security Headers crawler, Sitemap/Keywords/Images Extractor, Summarizer, etc ...

RuiJi.Net

262
Stars
47
Forks
Watchers

crawler framework, distributed crawler extractor

open-semantic-etl

252
Stars
68
Forks
Watchers

Python based Open Source ETL tools for file crawling, document processing (text extraction, OCR), content analysis (Entity Extraction & Named Entity Recognition) & data enrichment (annotation) pipelin...

TorCrawl.py

219
Stars
48
Forks
Watchers

Crawl and extract (regular or onion) webpages through TOR network

undock

162
Stars
12
Forks
Watchers

Extract contents of a container image in a local folder

OpenBackupExtractor

154
Stars
25
Forks
Watchers

A free program for extracting data (like voicemails) from iPhone and iPad backups.

RecursiveExtractor

183
Stars
26
Forks
Watchers

RecursiveExtractor is a .NET Standard 2.0 archive extraction Library, and Command Line Tool which can process 7zip, ar, bzip2, deb, gzip, iso, rar, tar, vhd, vhdx, vmdk, wim, xzip, and zip archives an...

youtube-jextractor

111
Stars
26
Forks
Watchers

Android based library that allows you to download or play audio and video from Youtube, in other words - youtube-dl for android

runescape-cache-tools

78
Stars
17
Forks
Watchers

A .NET library and command-line interface to interact with RuneScape's cache.

YaEtl

63
Stars
16
Forks
Watchers

Yet Another ETL in PHP