data-engineer-handbook icon indicating copy to clipboard operation
data-engineer-handbook copied to clipboard

This is a repo with links to everything you'd ever want to learn about data engineering

Results 129 data-engineer-handbook issues
Sort by recently updated
recently updated
newest added

- Add loading of `devices.csv` to incorporate missing columns (`browser_type` and `os_type`). - Perform a left join between `events` and `devices` datasets using `device_id`. - Rename columns (`browser_type` -> `browser_family`,...

This document provides an overview of graph data modeling, explaining how vertices (nodes) and edges (relationships) represent interconnected data. It includes SQL scripts for setting up a graph database, inserting...

Also added the complete syntax, I hope it helps someone.

Hi there, I allow myself to open an issue to put some feedbacks on your great bootcamp content but feel free to close it if this is not welcomed or...

**bootcamp/materials/2-fact-data-modeling/lecture-lab/user_cumulated_populate.sql** conflicts with **bootcamp/materials/1-dimensional-data-modeling/docker-compose.yml** that specifies `image: postgres:14` reason: DATE_TRUNC function is not available in 14. Current pg version is 17. looks like DATE function delivers the same results as...

**Redash**: Connect and query your data sources, build dashboards to visualize data and share them with your company. **Lightdash**: Instantly turns your dbt project into a full stack BI platform....

Clean up Spark Dataset API and make the notebook work successfully.