Python-Data-Wrangling-Legacy
Python-Data-Wrangling-Legacy copied to clipboard
Suggested changes to pandas lesson
Possible things to consider for intermediate/advanced pandas class:
- [ ] reading/writing to formats other than csv
- [ ] reading data in by chunks
- [ ] multi-indexing and its methods
- [ ] sparse matrices and their methods
- [ ] ways to speed up mathy things (lots of %timeit)
- [ ] parallelized methods for pandas
We can both also look up other pandas materials and think of other things we might like to add.
Thanks! This looks like a great start.
I'll start a concurrent list of other potentially useful topics/methods:
- [ ]
.get_dummies()
- [ ] split-apply-combine
- [ ] datetime
@davclark it might be nice to have your input on this too. The context is that we are introducing pandas in the intro series, so we might want to update the pandas workshop.
To be sure I'm clear - students will have basic pandas in the intro series, so here it will be intermediate / advanced topics?
Correct. I'm okay with some topical overlap or review of the basics, but it seemed like a good opportunity to expand our offerings.
On Thu, Feb 11, 2016 at 8:10 AM Dav Clark [email protected] wrote:
To be sure I'm clear - students will have basic pandas in the intro series, so here it will be intermediate / advanced topics?
— Reply to this email directly or view it on GitHub https://github.com/dlab-berkeley/introduction-to-pandas/issues/5#issuecomment-182937718 .
Also, this issue was started here because it was a space that was already available. When we actually create the actual intermediate / advanced materials, it will be in another repository. (I mention this in case there was confusion about why the topics mentioned in this issue were in the introduction to pandas repo.)