data-engineer-handbook
data-engineer-handbook copied to clipboard
This is a repo with links to everything you'd ever want to learn about data engineering
- Add loading of `devices.csv` to incorporate missing columns (`browser_type` and `os_type`). - Perform a left join between `events` and `devices` datasets using `device_id`. - Rename columns (`browser_type` -> `browser_family`,...
This document provides an overview of graph data modeling, explaining how vertices (nodes) and edges (relationships) represent interconnected data. It includes SQL scripts for setting up a graph database, inserting...
Also added the complete syntax, I hope it helps someone.
Hi there, I allow myself to open an issue to put some feedbacks on your great bootcamp content but feel free to close it if this is not welcomed or...
**bootcamp/materials/2-fact-data-modeling/lecture-lab/user_cumulated_populate.sql** conflicts with **bootcamp/materials/1-dimensional-data-modeling/docker-compose.yml** that specifies `image: postgres:14` reason: DATE_TRUNC function is not available in 14. Current pg version is 17. looks like DATE function delivers the same results as...
**Redash**: Connect and query your data sources, build dashboards to visualize data and share them with your company. **Lightdash**: Instantly turns your dbt project into a full stack BI platform....
Clean up Spark Dataset API and make the notebook work successfully.