etl-job topic
bulk-writer
Provides guidance for fast ETL jobs, an IDataReader implementation for SqlBulkCopy (or the MySql or Oracle equivalents) that wraps an IEnumerable, and libraries for mapping entites to table columns.
pyspark-example-project
Implementing best practices for PySpark ETL jobs and applications.
goodreads_etl_pipeline
An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.
Etl.Net
Mass processing data with a complete ETL for .net developers
terraform-aws-kinesis-firehose
This code creates a Kinesis Firehose in AWS to send CloudWatch log data to S3.
analyst
A declarative, SQL-like DSL for data integration tasks.
terraform-aws-glue
Terraform modules for provisioning and managing AWS Glue resources
pyspark-template
A Python PySpark Projet with Poetry
DataModelling
This repo will guide you step-by-step method to create star schema dimensional model.