Achieving Fairness through Adversarial Learning: an Application to Recidivism Prediction

Chris Piech; Christina Wadsworth; Francesca Vera

arxiv: 1807.00199 · v1 · pith:GDO5ENNBnew · submitted 2018-06-30 · 💻 cs.LG · stat.ML

Achieving Fairness through Adversarial Learning: an Application to Recidivism Prediction

Christina Wadsworth , Francesca Vera , Chris Piech This is my paper

classification 💻 cs.LG stat.ML

keywords predictionrecidivismfairnessachievingapplicationbiascompasinmates

0 comments

read the original abstract

Recidivism prediction scores are used across the USA to determine sentencing and supervision for hundreds of thousands of inmates. One such generator of recidivism prediction scores is Northpointe's Correctional Offender Management Profiling for Alternative Sanctions (COMPAS) score, used in states like California and Florida, which past research has shown to be biased against black inmates according to certain measures of fairness. To counteract this racial bias, we present an adversarially-trained neural network that predicts recidivism and is trained to remove racial bias. When comparing the results of our model to COMPAS, we gain predictive accuracy and get closer to achieving two out of three measures of fairness: parity and equality of odds. Our model can be generalized to any prediction and demographic. This piece of research contributes an example of scientific replication and simplification in a high-stakes real-world application like recidivism prediction.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Worst-Group Equalized Odds Regularization for Multi-Attribute Fair Medical Image Classification
cs.LG 2026-05 unverdicted novelty 5.0

A worst-group equalized odds regularizer targets extreme subgroup deviations in true and false positive rates to improve multi-attribute fairness in medical imaging while preserving AUC.