data-processing topic

List data-processing repositories

awesome-web-scraping

6.4k
Stars
772
Forks
Watchers

List of libraries, tools and APIs for web scraping and data processing.

distributed-dataset

115
Stars
5
Forks
Watchers

A distributed data processing framework in Haskell.

prosto

89
Stars
4
Forks
Watchers

Prosto is a data processing toolkit radically changing how data is processed by heavily relying on functions and operations with functions - an alternative to map-reduce and join-groupby

dasel

4.9k
Stars
112
Forks
Watchers

Select, put and delete data from JSON, TOML, YAML, XML and CSV files with a single tool. Supports conversion between formats and can be used as a Go package.

cq

153
Stars
9
Forks
Watchers

Clojure Query: A Command-line Data Processor for JSON, YAML, EDN, XML and more

prairie

22
Stars
4
Forks
Watchers

A visual programming environment for Python

xidel

653
Stars
38
Forks
Watchers

Command line tool to download and extract data from HTML/XML pages or JSON-APIs, using CSS, XPath 3.0, XQuery 3.0, JSONiq or pattern matching. It can also create new or transformed XML/HTML/JSON docu...

nonechucks

373
Stars
27
Forks
Watchers

Deal with bad samples in your dataset dynamically, use Transforms as Filters, and more!