AIF360 icon indicating copy to clipboard operation
AIF360 copied to clipboard

Add support for Dutch Census dataset

Open anupamamurthi opened this issue 2 years ago • 1 comments

https://github.com/tailequy/fairness_dataset/tree/main/Dutch_census

Please get the data from the IPUMS website linked to the above repository. The person file here https://microdata.worldbank.org/index.php/catalog/2102/data-dictionary/F2?file_name=NLD2001-P-H seems to be best suited for fairness

Potential Tasks:

  • [ ] Ensure the license permits open source us
  • [ ] Verify that this dataset is appropriate for fairness tasks and subset it accordingly (removing un-necessary columns etc.)
  • [ ] Ensure we have instance level records with protected attributes and outcomes
  • [ ] First create sklearn-compatible dataset (dataframe) and an appropriate "classic" dataset (second priority)
  • [ ] Create a simple notebook where the dataset is consumed and simple fairness measures and computed at least.
  • [ ] DO NOT download and incorporate the data, rather include a function that will do this since data is not hosted in AIF360.

anupamamurthi avatar Aug 27 '22 03:08 anupamamurthi

Hi, I would like to work on this issue.

MiKueen avatar Sep 22 '23 18:09 MiKueen