AIF360
AIF360 copied to clipboard
Add support for "Communities and Crime Data Set"
http://archive.ics.uci.edu/ml/datasets/communities%2Band%2Bcrime
Potential Tasks:
- [ ] Ensure the license permits open source us
- [ ] Verify that this dataset is appropriate for fairness tasks and subset it accordingly (removing un-necessary columns etc.)
- [ ] Ensure we have instance level records with protected attributes and outcomes
- [ ] First create sklearn-compatible dataset (dataframe) and an appropriate "classic" dataset (second priority)
- [ ] Create a simple notebook where the dataset is consumed and simple fairness measures and computed at least.
- [ ] DO NOT download and incorporate the data, rather include a function that will do this since data is not hosted in AIF360.
This dataset does not have a categorical value for protected attribute (like race) rather it has numerical value, so need to check carefully if this will fit in the existing set of fairness metrics/algorithms in AIF360.
Hi, I would like to work on this issue.