data-engineer-handbook
data-engineer-handbook copied to clipboard
This is a repo with links to everything you'd ever want to learn about data engineering
The following CTE block: https://github.com/DataExpert-io/data-engineer-handbook/blob/954ac02af1edcceaf56c0e784ec88c3904e73550/bootcamp/materials/1-dimensional-data-modeling/lecture-lab/incremental_scd_query.sql#L67-L75 can be simplified to (using `CROSS JOIN` to the `UNNEST`ed array): ```sql unnested_changed_records AS ( SELECT c.player_name, r.* FROM changed_records c, UNNEST(c.records) r ), ```...
The original link (https://newsletter.interestinggigs.com) was not working and showed a DNS error. Updated the link to the working URL (https://www.interestinggigs.com).
-Python for Data Analyst by Wes McKinney----- Another similar option for "Panda's Cookbook", recommended for intermediate software engineers, explains the panda library which is one of the most used in...
I noticed running `pg_restore -U user -d postgres data.dump` repeatedly causes some tables to grow in size. I expected tables to be the same no matter how many restores are...
- added gp to season_stats struct - added years_since_last_active column to players table - fixed players.sql and pipeline_query.sql column name mismatches - added logic for new columns - fixed logic...
``` Run this command after replacing with your computer's username: psql -U postgres < data.dump ``` This command failed until I did `CREATE ROLE WITH LOGIN SUPERUSER PASSWORD ;` In...
Adjust ddl, season_stats type and insert query to make the script work.
Adjust ddl, season_stats type and insert query to make the script work.
In the lab video the column is named `scoring_class` : ```sql CREATE TABLE players ( player_name TEXT, height TEXT, college TEXT, country TEXT, draft_year TEXT, draft_round TEXT, draft_number TEXT, season_stats...