data-profiling topic

List data-profiling repositories

ydata-profiling

12.1k
Stars
1.6k
Forks
Watchers

1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.

OpenMetadata

5.4k
Stars
1.0k
Forks
Watchers

OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team colla...

great_expectations

9.6k
Stars
1.5k
Forks
69
Watchers

Always know what to expect from your data.

optimus

1.4k
Stars
233
Forks
Watchers

:truck: Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark

sweetviz

2.8k
Stars
269
Forks
Watchers

Visualize and compare datasets, target values and associations, with one line of code.

odd-platform

1.1k
Stars
93
Forks
Watchers

First open-source data discovery and observability platform. We make a life for data practitioners easy so you can focus on your business.

bumblebee

137
Stars
35
Forks
Watchers

🚕 A spreadsheet-like data preparation web app that works over Optimus (Pandas, Dask, cuDF, Dask-cuDF, Spark and Vaex)

haupt

452
Stars
213
Forks
Watchers

Lineage metadata API, artifacts streams, sandbox, API, and spaces for Polyaxon

traceml

493
Stars
43
Forks
Watchers

Engine for ML/Data tracking, visualization, explainability, drift detection, and dashboards for Polyaxon.