data-transformation topic

List data-transformation repositories

glom

1.8k
Stars
60
Forks
Watchers

☄️ Python's nested data operator (and CLI), for all your declarative restructuring needs. Got data? Glom it! ☄️

cq

153
Stars
9
Forks
Watchers

Clojure Query: A Command-line Data Processor for JSON, YAML, EDN, XML and more

temme

272
Stars
13
Forks
Watchers

📄 Concise selector to extract JSON from HTML.

serializer

28
Stars
1
Forks
Watchers

A PHP serialization component focused on performance

Porter

610
Stars
28
Forks
Watchers

:lipstick: Durable and asynchronous data imports for consuming data at scale and publishing testable SDKs.

optimus

737
Stars
153
Forks
Watchers

Optimus is an easy-to-use, reliable, and performant workflow orchestrator for data transformation, data modeling, pipelines, and data quality management.

zingg

889
Stars
109
Forks
Watchers

Scalable identity resolution, entity resolution, data mastering and deduplication using ML

optimus

1.4k
Stars
234
Forks
Watchers

:truck: Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark

pglogical

941
Stars
151
Forks
Watchers

Logical Replication extension for PostgreSQL 15, 14, 13, 12, 11, 10, 9.6, 9.5, 9.4 (Postgres), providing much faster replication than Slony, Bucardo or Londiste, as well as cross-version upgrades.

sqawk

308
Stars
14
Forks
Watchers

Like awk but with SQL and table joins