data-extraction topic
hred
Reduce HTML and XML to JSON from the command line, using an expressive query language inspired by CSS selectors.
Scrapegraph-ai
Python scraper based on AI
firecrawl
🔥 The Web Data API for AI - Turn entire websites into LLM-ready markdown or structured data
wildberries-parser-in-python
WildBerries Parser is a Python script that extracts item information from Wildberries.ru and saves it in an Excel file. It supports parsing by directory or search keyword, collecting data like link, I...
youtube_data_engineering_project
Data Engineering Project: Extracting music video metrics of Twice using YouTube API, AWS, and Tableau
Exif
ExifTool is a powerful command-line tool that can be used to extract and edit metadata in a wide range of media files, including images, audio, and video. Metadata is information that is stored within...
scrappey-wrapper-python
An API wrapper for Scrappey.com written in Python (cloudflare, datadome bypass & solver)
maxun
🔥 Open Source No Code Web Data Extraction Platform. Turn Websites To APIs & Spreadsheets With No-Code Robots In Minutes 🔥
Scrapling
🕷️ An undetectable, powerful, flexible, high-performance Python library to make Web Scraping Easy and Effortless as it should be!
parsera
Lightweight library for scraping web-sites with LLMs