AIF360 icon indicating copy to clipboard operation
AIF360 copied to clipboard

Add support for "Communities and Crime Data Set"

Open anupamamurthi opened this issue 2 years ago • 2 comments

http://archive.ics.uci.edu/ml/datasets/communities%2Band%2Bcrime

Potential Tasks:

  • [ ] Ensure the license permits open source us
  • [ ] Verify that this dataset is appropriate for fairness tasks and subset it accordingly (removing un-necessary columns etc.)
  • [ ] Ensure we have instance level records with protected attributes and outcomes
  • [ ] First create sklearn-compatible dataset (dataframe) and an appropriate "classic" dataset (second priority)
  • [ ] Create a simple notebook where the dataset is consumed and simple fairness measures and computed at least.
  • [ ] DO NOT download and incorporate the data, rather include a function that will do this since data is not hosted in AIF360.

anupamamurthi avatar Aug 27 '22 03:08 anupamamurthi

This dataset does not have a categorical value for protected attribute (like race) rather it has numerical value, so need to check carefully if this will fit in the existing set of fairness metrics/algorithms in AIF360.

nrkarthikeyan avatar Sep 15 '22 19:09 nrkarthikeyan

Hi, I would like to work on this issue.

SathvickN avatar Sep 22 '23 18:09 SathvickN