data-engineer-handbook
data-engineer-handbook copied to clipboard
This is a repo with links to everything you'd ever want to learn about data engineering
In [Dimensional Data Modeling Day 2 Lab](https://www.youtube.com/watch?v=nyu-8Si21ec) It's always referred as _scoring_class_, running [load_players_table_day2.sql](../blob/main/bootcamp/materials/1-dimensional-data-modeling/sql/load_players_table_day2.sql) results in error due to bad casting (in day 1 the TYPE created is named _scoring_class_...
…students are facing issues with it changed the order of is_active and current_season as students get the error "is_active" is of type boolean but expression is of type integer
When composing the Makefile with Docker, `make up` throws an error `docker-compose: command not found`. That is because `docker-compose` doesn't exist but `docker compose` does. Removing the dash fixed all...
Update the postgres data.dump import instructions. Also added a specific troubleshooting line about the `error: invalid command \N` which happens if you're trying to import the binary `data.dump` using psql.
**Amazing resources right here! I found this repo while looking for a Feature Engineering Repo**, which is crucial for working with data, especially when it comes to data quality. I...
Added Soumil Shah, and Ananth Pakkildurai, both are very well followed influencers in the data engineering space.
Added the table creation and INSERT logic of the Lab1 Fact Data.
Hi, The events table in the repo is different from the one in the lab video—it’s missing the browser_family column.
This PR aims to fix the `pipeline_query.sql` in accordance with the lab work done by Zach. It adds the missing column, `years_since_last_active`, which was preventing the query from running. Additionally,...