AIF360
AIF360 copied to clipboard
Add support for Dutch Census dataset
https://github.com/tailequy/fairness_dataset/tree/main/Dutch_census
Please get the data from the IPUMS website linked to the above repository. The person file here https://microdata.worldbank.org/index.php/catalog/2102/data-dictionary/F2?file_name=NLD2001-P-H seems to be best suited for fairness
Potential Tasks:
- [ ] Ensure the license permits open source us
- [ ] Verify that this dataset is appropriate for fairness tasks and subset it accordingly (removing un-necessary columns etc.)
- [ ] Ensure we have instance level records with protected attributes and outcomes
- [ ] First create sklearn-compatible dataset (dataframe) and an appropriate "classic" dataset (second priority)
- [ ] Create a simple notebook where the dataset is consumed and simple fairness measures and computed at least.
- [ ] DO NOT download and incorporate the data, rather include a function that will do this since data is not hosted in AIF360.
Hi, I would like to work on this issue.