dataquality topic
OpenMetadata
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team colla...
data-diff
Compare tables within or across databases
zingg
Scalable identity resolution, entity resolution, data mastering and deduplication using ML
chaos_genius
ML powered analytics engine for outlier detection and root cause analysis.
great_expectations
Always know what to expect from your data.
deequ
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
DataCleaner
The premier open source Data Quality solution
lale
Library for Semi-Automated Data Science
re-data
re_data - fix data issues before your users & CEO would discover them 😊
amora-data-build-tool
Amora Data Build Tool enables analysts and engineers to transform data on the data warehouse (BigQuery) by writing Amora Models that describe the data schema using Python's "PEP484 - Type Hints" and s...