zmPDSwR icon indicating copy to clipboard operation
zmPDSwR copied to clipboard

Example R scripts and data for "Practical Data Science with R" 1st edition by Nina Zumel and John Mount (Manning Publications)

trafficstars

Example code and data for "Practical Data Science with R 1st Edition" by Nina Zumel and John Mount, Manning 2014.

Please consider upgrading to the in-progress "Practical Data Science with R 2nd Edition" by Nina Zumel and John Mount (Manning 2019) (code/data/examples here).

The book:

Additional materials

Zip file of code excerpts from the book:

  • CodeExamples.zip
  • CodeExamples directory
  • Notebooks running all examples

More from the authors:

Example data sets:

Includes works derived from others (data sets) remain controlled by them. We are distributing as these parties have allowed and not making any claim or grant of additional rights or license.

  • bioavailability Synthetic simple ADME data (source).
  • Bookdata Book ratings (source).
  • Buzz Discussion forum popularity (source).
  • CDC US CDC birth statistics (source).
  • Custdata Synthetic example data derived from Census PUMS data to demonstrate retail related plots.
  • KDD2009 Credit account prediction (source).
  • PUMS US Census PUMS (source).
  • Protein Dietary protein sources across multiple countries (source).
  • SQLExample Synthetic example data relating price to hotel reservation pickup.
  • Spambase Email spam classification (source).
  • Statlog German loans defaults (source).
  • UCICar Synthetic car ratings (source).

Download

You can download all of the examples and code by following the "git clone" or "download zip" instructions at our master repository: github.com/WinVector/zmPDSwR.

License for additional documentation, notes, code, and example data:

Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.

No guarantee, indemnification or claim of fitness is made regarding any of these items.

No claim of license on works of others or derived data.