InterpretableMachineLearning2020 Dataset: COMPAS Recidivism Algorithm

Dataset: COMPAS Recidivism Algorithm

Open pbiecek opened this issue 5 years ago • 1 comments

Problem

This is a regression problem. On the basis of historical data, models (of varying degrees of complexity) should be developed to predict the COMPAS scores. The best models should be explained using XAI tools at the instance level and at the data set level.

Data

COMPAS Recidivism Risk Scores are suspected of being biased https://www.propublica.org/article/how-we-analyzed-the-compas-recidivism-algorithm. Let's check this. The data can be downloaded from the propublica website. https://github.com/propublica/compas-analysis

Example solution

An interesting description of performed analysis can be found here: https://github.com/propublica/compas-analysis/blob/master/Compas%20Analysis.ipynb and here https://www.propublica.org/article/how-we-analyzed-the-compas-recidivism-algorithm

Feb 24 '20 17:02 pbiecek

So much can be done using this flagship data set for ML fairness! If anyone's interested in this project - ping me on Slack, happy to support :)

Mar 09 '20 16:03 marta-generation

InterpretableMachineLearning2020 InterpretableMachineLearning2020 copied to clipboard

Dataset: COMPAS Recidivism Algorithm

Problem

Data

Example solution

InterpretableMachineLearning2020
InterpretableMachineLearning2020 copied to clipboard