great-expectations topic

List great-expectations repositories

prefect-great-expectations

27
Stars
2
Forks
Watchers

Prefect integrations for interacting with Great Expectations

testing-ml

76
Stars
12
Forks
Watchers

Learn how to create reliable ML systems by testing code, data and models.

lakehouse-engine

188
Stars
35
Forks
Watchers

The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for several lakehouse algorithms, data flows and utilities for Data Prod...

data_engineering_best_practices

141
Stars
20
Forks
Watchers

Sample project to demonstrate data engineering best practices

covid-19-data-engineering-pipeline

22
Stars
5
Forks
Watchers

A Covid-19 data pipeline on AWS featuring PySpark/Glue, Docker, Great Expectations, Airflow, and Redshift, templated in CloudFormation and CDK, deployable via Github Actions.

How to evaluate the Quality of your Data with Great Expectations and Spark.

data-quality-gate

55
Stars
3
Forks
Watchers

Data Quality Gate based on AWS

energy-forecasting

799
Stars
181
Forks
Watchers

🌀 𝗧𝗵𝗲 𝗙𝘂𝗹𝗹 𝗦𝘁𝗮𝗰𝗸 𝟳-𝗦𝘁𝗲𝗽𝘀 𝗠𝗟𝗢𝗽𝘀 𝗙𝗿𝗮𝗺𝗲𝘄𝗼𝗿𝗸 | 𝗟𝗲𝗮𝗿𝗻 𝗠𝗟𝗘 & 𝗠𝗟𝗢𝗽𝘀 for free by designing, building and deploying an end-to-end ML batch system ~ 𝘴𝘰𝘶𝘳𝘤𝘦 𝘤𝘰𝘥𝘦 + 2.5 𝘩𝘰𝘶𝘳𝘴 𝘰𝘧 𝘳𝘦𝘢𝘥𝘪𝘯𝘨 & 𝘷𝘪𝘥𝘦𝘰 𝘮𝘢𝘵𝘦𝘳𝘪𝘢𝘭𝘴

GreatEx

20
Stars
6
Forks
Watchers

A project for exploring how Great Expectations can be used to ensure data quality and validate batches within a data pipeline defined in Airflow.

data_validation

31
Stars
11
Forks
Watchers

Tutorial for implementing data validation in data science pipelines