dataengineering topic
pyDag
Scheduling Big Data Workloads and Data Pipelines in the Cloud with pyDag
DataEngineeringPilipinas
Data Engineering Pilipinas is a community for data engineers, data analysts, data scientists, developers, AI / ML engineers, and users of closed and open source data tools and methods / techniques in...
ghcn-d
Data Pipeline from the Global Historical Climatology Network DataSet
reddit-data-engineering
An end-to-end data engineering pipeline to create a dashboard for the latest content on the r/Stocks subreddit
bridgefour
Bridge Four is a simple, functional, effectful, single-leader, multi worker, distributed compute system optimized for embarrassingly parallel workloads.
modern-polars
Code and data for the Modern Polars book
data-engineering-and-dataops
Duke MIDS: Data Engineering and DataOps Course
jupyter_pandas_cheat_sheet
Learn the basic commands to use Pandas in Jupyter-Notebook to accomplish the most important Data Enginnering tasks. Read the underlying article on Medium:
sqlmesh
Efficient data transformation and modeling framework that is backwards compatible with dbt.
orangutan-stem
An open-source project dedicated to constructing robust data pipelines and scalable software infrastructure. We leverage industry-standard tools favored by developers to enhance efficiency and reliabi...