advanced-pandas
advanced-pandas copied to clipboard
Pandas is a powerful tool for data exploration and analysis (including timeseries).
Advanced data analysis techniques with Pandas
https://python.g-node.org Summer School 2017
Setup
$ conda env create -f environment.yml
$ source activate advanced-pandas
$ jupyter notebook
Outline
Pandas objects
Data exploration
Timeseries operations
Multi-indexing
Testing (defensive programming)
SQL-type operations
References
- Python Data Science Handbook by Jake VanderPlas
- Tidy data by Hadley Wickham
- Feather format, for sharing datasets by Wes McKinney