Debiasing representations by removing unwanted variation due to protected attributes

Alexander Vargo; Amanda Bower; Laura Niss; Yuekai Sun

arxiv: 1807.00461 · v1 · pith:IBKNMSDXnew · submitted 2018-07-02 · 💻 cs.CY

Debiasing representations by removing unwanted variation due to protected attributes

Amanda Bower , Laura Niss , Yuekai Sun , Alexander Vargo This is my paper

classification 💻 cs.CY

keywords approachrepresentationsprotectedremovingapproachesapproximationattributeattributes

0 comments

read the original abstract

We propose a regression-based approach to removing implicit biases in representations. On tasks where the protected attribute is observed, the method is statistically more efficient than known approaches. Further, we show that this approach leads to debiased representations that satisfy a first order approximation of conditional parity. Finally, we demonstrate the efficacy of the proposed approach by reducing racial bias in recidivism risk scores.

This paper has not been read by Pith yet.

Debiasing representations by removing unwanted variation due to protected attributes

discussion (0)