R-for-Data-Science
R-for-Data-Science copied to clipboard
D-Lab R-intensive teaching materials
title: Materials for D-Lab's R for Data Science author: Dillon Niederhut
This repository contains the instructor materials for the D-Lab's R intensive.
If you are a student:
You can download the contents of this repository with:
git clone https://github.com/dlab-berkeley/r-for-data-science.git
or, by clicking the "Download Zip" button and then extracting the .zip
file.
The instructor of this workshop series will lead you through the activities for each day.
If you are a D-Lab instructor
You'll see accumulated teaching notes and examples for each day's topics in the instructor folder. For your convenience, these are available as .Rmd, commented .R files, PDF documents, and HTML slides. The meta-document for this workshop series, which explains the logic behind the structure and topics, can be viewed at the D-Lab guides repository
For information on contributing to this repository, see CONTRIBUTING.md
If you are a D-Lab facilitator
The standard Drupal workshop descriptions and facetweet postings for this workshop series are in PUBLICITY.md
Description
-
data/
: data necessary for interactive coding examples -
examples/
-
save_console_output.R
: R code for saving console output to pdf
-
-
instructor/
: teaching notes -
scripts/
-
feedback_cleaner.R
: used to clean data for use in Day 3 -
regenrate_files.R
: for regenerating.R
and.pdf
files from.Rmd
-
Topics:
This workshop series covers:
- Interacting with R
- Datatypes
- Data structures
- Reading data
- Sanitizing data
- Missing data
- Reshaping data
- Summary statistics
- Plotting
- Linear models
- Non-parametric models
- Functions
- Loops
- Parallelization
- Packages
Libraries
This workshop uses the following packages:
- Amelia
- devtools
- dplyr
- foreign
- ggplot2
- parallelMap
- RCurl
- roxygen2
- stringr
- tidyr
- XML
D-Lab == Data Intensive Social Science, For All!