data-science
data-science copied to clipboard
Potential Tutorials and Guides
Dependency
- Re-evaluate once the tutorial issues are complete.
Overview
Guides and tutorials to be produced to help new data scientists and analysts with tasks assigned to them at Hack for LA.
Action Items
Cop Lead
- [x] Define our top three from the below list
- [ ] Send issue back to product team.
Product team
Make issues for the top three
Resources
Tutorials Folder Docker/Selenium Tutorial
List of potential turorials
- ETL/Data Cleaning: Pandas, statsmodels, scikit-learn #143
- Data Analysis: R #145
- Data Visualization: Pandas, Seaborn, Matplotlib, Tableau #144
- Data Engineering: SQL, NoSQL #147
- Docker: installation (potential standalone guide), building containers, running python from within a container https://github.com/hackforla/ops/issues/13
- Webscraping: Python (Selenium, BeautifulSoup, Requests), Using APIs #146
- Geospatial Data Analysis: GeoPandas, QGIS/ArcGIS #148
- Text Analysis: nltk, SpaCy #153
- Data Ops: EC2, Lambda, RDS, Athena/Hive, Flask #154
- Stats: Logistic/Linear Regression, Experimental Design, Significance Testing, Bayesian Analysis #155
- Machine Learning/Stats: XGBoost, Random Forest #156
- Deep Learning: PyTorch, Keras, HuggingFace #157
Created and assigned the following tickets:
- hackforla/ops#143
- hackforla/ops#144
- hackforla/ops#145
- hackforla/ops#146
- #147
@akhaleghi I added Ryan's tutorial list to the top section, but I didn't see Create Data Analysis with R Tutorial on there, even though he has made an issue for it. Can you talk with him to find out where it sits in the priority list above, and add it?