Data
Data copied to clipboard
Data for the Foundations of Applied Mathematics lab curriculum.
These data files correspond to the Foundations of Applied Mathematics lab curriculum.
Instead of downloading or cloning this repository directly, use the download_data.sh
script included in the Student-Materials repository.
$ cd /path/to/folder
$ bash download_data.sh
The download requires git, which you can download at https://git-scm.com/downloads.
Below the data files are listed by volume (in the order given by each table of contents), then alphabetically by lab folder.
Labs by Volume
Lab Title | Folder/Data File | Source |
---|---|---|
Introduction to NumPy | NumpyIntro/grid.npy | https://projecteuler.net/problem=11 |
Introduction to Matplotlib | MatplotlibIntro/FARS.npy | Gathered from https://www.nhtsa.gov/FARS |
Exceptions and File Input/Output | Exceptions_FileIO/hello_world.txt | Written by hand |
Exceptions and File Input/Output | Exceptions_FileIO/cf_example1.txt | Written by hand |
Exceptions and File Input/Output | Exceptions_FileIO/cf_example2.txt | Written by hand |
Profiling | Profiling/names.txt | https://projecteuler.net/problem=22 |
Profiling | Profiling/triangle.txt | https://projecteuler.net/problem=18 |
Profiling | Profiling/triangle_large.txt | https://projecteuler.net/problem=67 |
Data Visualization | DataVisualization/anscombe.npy | https://en.wikipedia.org/wiki/Anscombe's_quartet. Original citation: Anscombe, F. J. (1973). "Graphs in Statistical Analysis". American Statistician. 27 (1): 17–21. JSTOR 2682899 |
Data Visualization | DataVisualization/MLB.npy | http://wiki.stat.ucla.edu/socr/index.php/SOCR_Data_MLB_HeightsWeights (modified) |
Data Visualization | DataVisualization/earthquakes.npy | Gathered from https://earthquake.usgs.gov/earthquakes/search/ |
Data Visualization | DataVisualization/countries.npy | Combined from https://en.wikipedia.org/wiki/List_of_countries_by_GDP_(nominal), http://www.averageheight.co/, and https://en.wikipedia.org/wiki/List_of_countries_and_dependencies_by_population |
Lab Title | Folder/Data File | Source |
---|---|---|
Regular Expressions | RegularExpressions/fake_contacts.txt | Generated by http://www.Generatedata.com/ |
SQL 1: Introduction | SQL1/student_info.csv | Written by hand |
SQL 1: Introduction | SQL1/student_grades.csv | Written by hand |
SQL 1: Introduction | SQL1/us_earthquakes.csv | |
SQL 2 (The Sequel) | SQL2/students.db | Combined from student_info.csv , student_grades.csv , and the other tables in SQL 1 |
Web Technologies | WebTechnologies/nyc_traffic.json | Modified from https://data.cityofnewyork.us/Public-Safety/NYPD-Motor-Vehicle-Collisions/h9gi-nx95, gathered August 2017 |
Introduction to Beautiful Soup | WebScraping1/example.html | https://www.example.com |
Introduction to Beautiful Soup | WebScraping1/san_diego_weather.html | |
Introduction to Beautiful Soup | WebScraping1/large_banks_index.html | |
Introduction to Beautiful Soup | WebScraping1/large_banks_data.html | |
Pandas 1: Introduction | Data Science Essentials | Pandas1/crime_data.csv |
Pandas 1: Introduction | Pandas1/final_accidents2.pickle | |
Pandas 1: Introduction | Pandas1/final_drivers.pickle | |
Pandas 2: Plotting | Pandas2/final_accidents2.pickle | |
Pandas 2: Plotting | Pandas2/final_drivers.pickle | |
Pandas 2: Plotting | Pandas2/new_york_crime_clean.csv | |
Pandas 3: Grouping | Pandas3/Ohio_1999.csv | |
Pandas 3: Grouping | Pandas3/time_usage.txt | |
Pandas 4: Time Series | Pandas4/DJIA.csv | |
Pandas 4: Time Series | Pandas4/finances.csv | |
Pandas 4: Time Series | Pandas4/paychecks.csv | |
Pandas 4: Time Series | Pandas4/website_traffic.csv | |
Pandas 5: GeoPandas | Pandas5/airports.csv | |
Data Augmentation | DataAugmentation/chopin.wav | |
Data Augmentation | DataAugmentation/creditcard.npy | https://www.kaggle.com/mlg-ulb/creditcardfraud |
Data Augmentation | DataAugmentation/lion.png | |
Data Augmentation | DataAugmentation/music.zip | U. Marchand, G. Peeters, "The Extended Ballroom Dataset", in ISMIR 2016 Late-Breaking Session, New-York, USA. (http://anasynth.ircam.fr/home/media/ExtendedBallroom) |
Data Augmentation | DataAugmentation/restaurant-ambience.wav | |
Data Augmentation | DataAugmentation/styles.npy | U. Marchand, G. Peeters, "The Extended Ballroom Dataset", in ISMIR 2016 Late-Breaking Session, New-York, USA. (http://anasynth.ircam.fr/home/media/ExtendedBallroom) |
Lab Title | Folder/Data File | Source |
---|---|---|
Linear Transformations | LinearTransformations/horse.npy | Generated |
Least Squares and Computing Eigenvalues | LeastSquares_Eigenvalues/circle.npy | Generated |
Least Squares and Computing Eigenvalues | LeastSquares_Eigenvalues/ellipse.npy | Generated |
Least Squares and Computing Eigenvalues | LeastSquares_Eigenvalues/housing.npy | Gathered from https://www.fhfa.gov/DataTools/Downloads/Pages/House-Price-Index.aspx |
Image Segmentation | ImageSegmentation/dream.png | |
Image Segmentation | ImageSegmentation/dream_gray.png | |
The SVD and Image Compression | SVD_ImageCompression/hubble.jpg | https://www.nasa.gov/multimedia/imagegallery/image_feature_2099.html |
The SVD and Image Compression | SVD_ImageCompression/hubble_gray.jpg | Modification of hubble.jpg |
Facial Recognition | FacialRecognition/faces94.zip | http://cswww.essex.ac.uk/mv/allfaces/faces94.html |
Differentiation | Differentiation/plane.npy | Generated |
Conditioning and Stability | Conditioning_Stability/stability_data.npy | Generated |
The PageRank Algorithm | PageRank/web_stanford.txt | Subset of web-Stanford.txt from http://snap.stanford.edu/data/web-Stanford.html |
The PageRank Algorithm | PageRank/ncaa2010.csv | Scraped from https://www.sports-reference.com |
The PageRank Algorithm | PageRank/ncaa2011.csv | Scraped from https://www.sports-reference.com |
The PageRank Algorithm | PageRank/ncaa2012.csv | Scraped from https://www.sports-reference.com |
The PageRank Algorithm | PageRank/ncaa2013.csv | Scraped from https://www.sports-reference.com |
The PageRank Algorithm | PageRank/ncaa2014.csv | Scraped from https://www.sports-reference.com |
The PageRank Algorithm | PageRank/ncaa2015.csv | Scraped from https://www.sports-reference.com |
The PageRank Algorithm | PageRank/ncaa2016.csv | Scraped from https://www.sports-reference.com |
The PageRank Algorithm | PageRank/ncaa2017.csv | Scraped from https://www.sports-reference.com |
The PageRank Algorithm | PageRank/top250movies.txt | Subset of movie_data.txt , scraped with imdbpy (https://imdbpy.sourceforge.io/) |
The Drazin Inverse | DrazinInverse/social_network.csv | Adapted from https://en.wikipedia.org/wiki/Zachary%27s_karate_club |
Lab Title | Folder/Data File | Source |
---|---|---|
Linked Lists | LinkedLists/english.txt | Generated |
Binary Search Trees | BinaryTrees/english.txt | Generated |
Nearest Neighbor Search | NearestNeighbor/mnist_subset.npz | Subset of the MNIST database from http://yann.lecun.com/exdb/mnist/ |
Breadth-first Search | BreadthFirstSearch/movie_data.txt | Scraped with imdbpy (https://imdbpy.sourceforge.io/) |
Breadth-first Search | BreadthFirstSearch/movie_data_small.txt | Subset of movie_data.txt . |
Markov Chains | MarkovChains/yoda.txt | Gathered from http://www.imsdb.com/scripts/Star-Wars-The-Empire-Strikes-Back.html, http://www.imsdb.com/scripts/Star-Wars-Return-of-the-Jedi.html, http://www.imsdb.com/scripts/Star-Wars-The-Phantom-Menace.html, http://www.imsdb.com/scripts/Star-Wars-Attack-of-the-Clones.html, and http://www.imsdb.com/scripts/Star-Wars-Revenge-of-the-Sith.html |
The Discrete Fourier Transform | FourierTransform/tada.wav | https://www.youtube.com/watch?v=bjxf-eQWKoo |
The Discrete Fourier Transform | FourierTransform/mystery_chord.wav | Generated |
The Discrete Fourier Transform | FourierTransform/CGC.wav | Generated |
The Discrete Fourier Transform | FourierTransform/GCG.wav | Generated |
The Discrete Fourier Transform | FourierTransform/balloon.wav | Recorded at BYU |
The Discrete Fourier Transform | FourierTransform/chopin.wav | |
The Discrete Fourier Transform | FourierTransform/noisy1.wav | |
The Discrete Fourier Transform | FourierTransform/noisy2.wav | |
The Discrete Fourier Transform | FourierTransform/vuvuzela.wav | Part of https://www.youtube.com/watch?v=g_0NoBKWCT8 |
The Discrete Fourier Transform | FourierTransform/noisy_face.png | Sample from faces94.zip |
The Discrete Fourier Transform | FourierTransform/license_plate.png | |
Introduction to Wavelets | Wavelets/mandrill.png | http://sipi.usc.edu/database/ |
Introduction to Wavelets | Wavelets/woman_darkhair.png | |
Introduction to Wavelets | Wavelets/noisy_darkhair.png | |
Introduction to Wavelets | Wavelets/uncompressed_finger.png | |
Polynomial Interpolation | PolynomialInterpolation/airdata.npy | |
Gradient Descent Methods | GradientMethods/linregression.txt | |
Gradient Descent Methods | GradientMethods/challenger.npy | |
Simplex | Simplex/productMix.npy | |
CVXOPT | CVXOPT_Intro/ForestData.npy | |
Interior Point 1: Linear Programs | InteriorPoint_Linear/simdata.txt | |
Interior Point 2: Quadratic Programs | InteriorPoint_Quadratic/portfolio.txt |
Labs by Folder Name
Folder/Data File | Lab Title | Volume | Source |
---|---|---|---|
BinaryTrees/english.txt | Binary Search Trees | Volume 2 | Generated |
BreadthFirstSearch/movieData.txt | Breadt-first Search | Volume 2 | Scraped with imdbpy (https://imdbpy.sourceforge.io/) |
CVXOPT_Intro/ForestData.npy | CVXOPT | Volume 2 | |
Conditioning_Stability/stability_data.npy | Conditioning and Stability | Volume 1 | Generated |
DataAugmentation/chopin.wav | Data Augmentation | Data Science Essentials | |
DataAugmentation/creditcard.npy | Data Augmentation | Data Science Essentials | (https://www.kaggle.com/mlg-ulb/creditcardfraud) |
DataAugmentation/lion.png | Data Augmentation | Data Science Essentials | |
DataAugmentation/music.zip | Data Augmentation | Data Science Essentials | U. Marchand, G. Peeters, "The Extended Ballroom Dataset", in ISMIR 2016 Late-Breaking Session, New-York, USA. (http://anasynth.ircam.fr/home/media/ExtendedBallroom) |
DataAugmentation/restaurant-ambience.wav | Data Augmentation | Data Science Essentials | |
DataAugmentation/styles.npy | Data Augmentation | Data Science Essentials | U. Marchand, G. Peeters, "The Extended Ballroom Dataset", in ISMIR 2016 Late-Breaking Session, New-York, USA. (http://anasynth.ircam.fr/home/media/ExtendedBallroom) |
DataVisualization/anscombe.npy | Data Visualization | Python Essentials | https://en.wikipedia.org/wiki/Anscombe's_quartet. Original citation: Anscombe, F. J. (1973). "Graphs in Statistical Analysis". American Statistician. 27 (1): 17–21. JSTOR 2682899 |
DataVisualization/MLB.npy | Data Visualization | Python Essentials | http://wiki.stat.ucla.edu/socr/index.php/SOCR_Data_MLB_HeightsWeights (modified) |
DataVisualization/earthquakes.npy | Data Visualization | Python Essentials | Gathered from https://earthquake.usgs.gov/earthquakes/search/ |
DataVisualization/countries.npy | Data Visualization | Python Essentials | Combined from https://en.wikipedia.org/wiki/List_of_countries_by_GDP_(nominal), http://www.averageheight.co/, and https://en.wikipedia.org/wiki/List_of_countries_and_dependencies_by_population |
Differentiation/plane.npy | Differentiation | Volume 1 | Generated |
DrazinInverse/social_network.csv | The Drazin Inverse | Volume 1 | Adapted from https://en.wikipedia.org/wiki/Zachary%27s_karate_club |
Exceptions_FileIO/hello_world.txt | Exceptions and File Input/Output | Python Essentials | Written by hand |
Exceptions_FileIO/cf_example1.txt | Exceptions and File Input/Output | Python Essentials | Written by hand |
Exceptions_FileIO/cf_example2.txt | Exceptions and File Input/Output | Python Essentials | Written by hand |
FacialRecognition/faces94.zip | Facial Recognition | Volume 1 | http://cswww.essex.ac.uk/mv/allfaces/faces94.html |
FourierTransform/tada.wav | The Discrete Fourier Transform | Volume 2 | https://www.youtube.com/watch?v=bjxf-eQWKoo |
FourierTransform/mystery_chord.wav | The Discrete Fourier Transform | Volume 2 | Generated |
FourierTransform/CGC.wav | The Discrete Fourier Transform | Volume 2 | Generated |
FourierTransform/GCG.wav | The Discrete Fourier Transform | Volume 2 | Generated |
FourierTransform/balloon.wav | The Discrete Fourier Transform | Volume 2 | Recorded at BYU |
FourierTransform/chopin.wav | The Discrete Fourier Transform | Volume 2 | |
FourierTransform/noisy1.wav | The Discrete Fourier Transform | Volume 2 | |
FourierTransform/noisy2.wav | The Discrete Fourier Transform | Volume 2 | |
FourierTransform/vuvuzela.wav | The Discrete Fourier Transform | Volume 2 | Part of https://www.youtube.com/watch?v=g_0NoBKWCT8 |
FourierTransform/noisy_face.png | The Discrete Fourier Transform | Volume 2 | Sample from faces94.zip |
FourierTransform/license_plate.png | The Discrete Fourier Transform | Volume 2 | |
GradientMethods/linregression.txt | Gradient Descent Methods | Volume 2 | |
GradientMethods/challenger.npy | Gradient Descent Methods | Volume 2 | |
ImageSegmentation/dream.png | Image Segmentation | Volume 1 | |
ImageSegmentation/dream_gray.png | Image Segmentation | Volume 1 | |
InteriorPoint_Linear/simdata.txt | Interior Point 1: Linear Programs | Volume 2 | |
InteriorPoint_Quadratic/portfolio.txt | Interior Point 2: Quadratic Programs | Volume 2 | |
LeastSquares_Eigenvalues/circle.npy | Least Squares and Computing Eigenvalues | Volume 1 | Generated |
LeastSquares_Eigenvalues/ellipse.npy | Least Squares and Computing Eigenvalues | Volume 1 | Generated |
LeastSquares_Eigenvalues/housing.npy | Least Squares and Computing Eigenvalues | Volume 1 | Gathered from https://www.fhfa.gov/DataTools/Downloads/Pages/House-Price-Index.aspx |
LinearTransformations/horse.npy | Linear Transformations | Volume 1 | Generated |
LinkedLists/english.txt | Linked Lists | Volume 2 | Generated |
MarkovChains/yoda.txt | Markov Chains | Volume 2 | Gathered from http://www.imsdb.com/scripts/Star-Wars-The-Empire-Strikes-Back.html, http://www.imsdb.com/scripts/Star-Wars-Return-of-the-Jedi.html, http://www.imsdb.com/scripts/Star-Wars-The-Phantom-Menace.html, http://www.imsdb.com/scripts/Star-Wars-Attack-of-the-Clones.html, and http://www.imsdb.com/scripts/Star-Wars-Revenge-of-the-Sith.html |
MatplotlibIntro/FARS.npy | Introduction to Matplotlib | Python Essentials | Gathered from https://www.nhtsa.gov/FARS |
NearestNeighbor/mnist_subset.npz | Nearest Neighbor Search | Volume 2 | Subset of the MNIST database from http://yann.lecun.com/exdb/mnist/ |
NumpyIntro/grid.npy | Introduction to NumPy | Python Essentials | https://projecteuler.net/problem=11 |
PageRank/web_stanford.txt | The PageRank Algorithm | Volume 1 | Subset of web-Stanford.txt from http://snap.stanford.edu/data/web-Stanford.html |
PageRank/ncaa2010.csv | The PageRank Algorithm | Volume 1 | Scraped from https://www.sports-reference.com |
PageRank/ncaa2011.csv | The PageRank Algorithm | Volume 1 | Scraped from https://www.sports-reference.com |
PageRank/ncaa2012.csv | The PageRank Algorithm | Volume 1 | Scraped from https://www.sports-reference.com |
PageRank/ncaa2013.csv | The PageRank Algorithm | Volume 1 | Scraped from https://www.sports-reference.com |
PageRank/ncaa2014.csv | The PageRank Algorithm | Volume 1 | Scraped from https://www.sports-reference.com |
PageRank/ncaa2015.csv | The PageRank Algorithm | Volume 1 | Scraped from https://www.sports-reference.com |
PageRank/ncaa2016.csv | The PageRank Algorithm | Volume 1 | Scraped from https://www.sports-reference.com |
PageRank/ncaa2017.csv | The PageRank Algorithm | Volume 1 | Scraped from https://www.sports-reference.com |
PageRank/top250movies.txt | The PageRank Algorithm | Volume 1 | Subset of movie_data.txt , scraped with imdbpy (https://imdbpy.sourceforge.io/) |
Pandas1/crime_data.csv | Pandas 1: Introduction | Data Science Essentials | |
Pandas1/final_accidents2.pickle | Pandas 1: Introduction | Data Science Essentials | |
Pandas1/final_drivers.pickle | Pandas 1: Introduction | Data Science Essentials | |
Pandas2/final_accidents2.pickle | Pandas 2: Plotting | Data Science Essentials | |
Pandas2/final_drivers.pickle | Pandas 2: Plotting | Data Science Essentials | |
Pandas2/new_york_crime_clean.csv | Pandas 2: Plotting | Data Science Essentials | |
Pandas3/Ohio_1999.csv | Pandas 3: Grouping | Data Science Essentials | |
Pandas3/time_usage.txt | Pandas 3: Grouping | Data Science Essentials | |
Pandas4/DJIA.csv | Pandas 4: Time Series | Data Science Essentials | |
Pandas4/finances.csv | Pandas 4: Time Series | Data Science Essentials | |
Pandas4/paychecks.csv | Pandas 4: Time Series | Data Science Essentials | |
Pandas4/website_traffic.csv | Pandas 4: Time Series | Data Science Essentials | |
Pandas5/airports.csv | Pandas 5: GeoPandas | Data Science Essentials | |
PolynomialInterpolation/airdata.npy | Polynomial Interpolation | Volume 2 | |
Profiling/names.txt | Profiling | Python Essentials | https://projecteuler.net/problem=22 |
Profiling/triangle.txt | Profiling | Python Essentials | https://projecteuler.net/problem=18 |
Profiling/triangle_large.txt | Profiling | Python Essentials | https://projecteuler.net/problem=67 |
QuasiNewtonMethods/population.npy | Newton and Quasi-Newton Methods | Volume 2 | |
RegularExpressions/fake_contacts.txt | Regular Expressions | Data Science Essentials | Generated by http://www.Generatedata.com/ |
Simplex/productMix.npy | Simplex | Volume 2 | |
SQL1/student_info.csv | SQL 1: Introduction | Data Science Essentials | Written by hand |
SQL1/student_grades.csv | SQL 1: Introduction | Data Science Essentials | Written by hand |
SQL1/us_earthquakes.csv | SQL 1: Introduction | Data Science Essentials | |
SQL2/students.db | SQL 2 (The Sequel) | Data Science Essentials | Combined from student_info.csv , student_grades.csv , and the other tables in SQL 1 |
SVD_ImageCompression/hubble.jpg | The SVD and Image Compression | Volume 1 | https://www.nasa.gov/multimedia/imagegallery/image_feature_2099.html |
SVD_ImageCompression/hubble_gray.jpg | The SVD and Image Compression | Volume 1 | Modification of hubble.jpg |
Wavelets/mandrill.png | Introduction to Wavelets | Volume 2 | http://sipi.usc.edu/database/ |
Wavelets/woman_darkhair.png | Introduction to Wavelets | Volume 2 | |
Wavelets/noisy_darkhair.png | Introduction to Wavelets | Volume 2 | |
Wavelets/uncompressed_finger.png | Introduction to Wavelets | Volume 2 | |
WebScraping1/example.html | Introduction to Beautiful Soup | Data Science Essentials | https://www.example.com |
WebScraping1/san_diego_weather.html | Introduction to Beautiful Soup | Data Science Essentials | |
WebScraping1/large_banks_index.html | Introduction to Beautiful Soup | Data Science Essentials | |
WebScraping1/large_banks_data.html | Introduction to Beautiful Soup | Data Science Essentials | |
WebTechnologies/nyc_traffic.json | Web Technologies | Data Science Essentials | Modified from https://data.cityofnewyork.us/Public-Safety/NYPD-Motor-Vehicle-Collisions/h9gi-nx95, gathered August 2017 |