pith. sign in

arxiv: 1903.07609 · v1 · pith:TILCNRL2new · submitted 2019-03-18 · 💻 cs.LG · stat.ML

Multi-Differential Fairness Auditor for Black Box Classifiers

classification 💻 cs.LG stat.ML
keywords classifierindividualsfairnessmulti-differentialsensitiveafrican-americanattributesblack
0
0 comments X
read the original abstract

Machine learning algorithms are increasingly involved in sensitive decision-making process with adversarial implications on individuals. This paper presents mdfa, an approach that identifies the characteristics of the victims of a classifier's discrimination. We measure discrimination as a violation of multi-differential fairness. Multi-differential fairness is a guarantee that a black box classifier's outcomes do not leak information on the sensitive attributes of a small group of individuals. We reduce the problem of identifying worst-case violations to matching distributions and predicting where sensitive attributes and classifier's outcomes coincide. We apply mdfa to a recidivism risk assessment classifier and demonstrate that individuals identified as African-American with little criminal history are three-times more likely to be considered at high risk of violent recidivism than similar individuals but not African-American.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.