DS-python-data-analysis
DS-python-data-analysis copied to clipboard
Notes from December 2021 workshop
Slides:
- [ ] Update the IDEs page? Should we still mention Eclipse with PyDev?
Jupyter introduction:
- [x] I wouldn't explain "raw text" cell?
- [x] There is a "math processing error"
Pandas:
- [] handle the nuisance columns deprecation
Pandas basic operations:
- [ ] move numpy import to exercise
Pandas groupby notebook
- [ ] Exercise 3: use "mean" instead
Matplotlib:
- [x] Pandas interaction: use of
.format
-> better explain this, or show in extended solution, or put in a Python call out box
Seaborn:
- [x] mention estimator functionality after the exercises
First day:
- pandas notebooks 1, 2, 3a and 4 (timeseries)
- start with bike count case study
Second day:
Morning:
- Continue with bike count
- indexing (loc/iloc)
- groupby
Afternoon:
- visualization matplotlib
- observations processing case study
Third day:
Morning:
- (Start observations analysis until everyone is present)
- Seaborn
- Tidy data (+ melt + pivot)
- Exercise with case 3 bacterial
Afternoon:
- Continue case 3 + case observations
- Combining data as intermezzo
- Visualization landscape