datasets-for-good icon indicating copy to clipboard operation
datasets-for-good copied to clipboard

List of datasets to apply stats/machine learning/technology to the world of social good.

Datasets for Social Good Projects

I was inspired to create this after taking many project-based CS and AI classes at Stanford, where I would spend more time finding data for a problem I actually cared about than writing the baseline algorithm.

The list is divided by sector, and each link has a (D), (T), or (C) next to it. (D) represents a dataset; (T) represents a tutorial; (C) represents an online challenge you can download data from and contribute knowledge to.

I am sure there are many great datasets I have missed. If you have datasets to add, please create a pull request!

Health

Education

Environment

Government

Public Good

Other Good Lists of Datasets

  • https://www.datasciencecentral.com/profiles/blogs/great-github-list-of-public-data-sets
  • https://ibmhadoop.devpost.com/details/data
  • http://kevinchai.net/datasets
  • https://www.kaggle.com/datasets
  • http://archive.ics.uci.edu/ml/datasets.html?sort=nameUp&view=list
  • https://github.com/rafalab/dslabs/tree/master/data