Iterative Orthogonal Feature Projection for Diagnosing Bias in Black-Box Models

Julius Adebayo; Lalana Kagal

arxiv: 1611.04967 · v1 · pith:DMWPJ77Ynew · submitted 2016-11-15 · 💻 cs.LG · stat.ML

Iterative Orthogonal Feature Projection for Diagnosing Bias in Black-Box Models

Julius Adebayo , Lalana Kagal This is my paper

classification 💻 cs.LG stat.ML

keywords black-boxiterativemodelmodelspotentialpredictiveattributesinput

0 comments

read the original abstract

Predictive models are increasingly deployed for the purpose of determining access to services such as credit, insurance, and employment. Despite potential gains in productivity and efficiency, several potential problems have yet to be addressed, particularly the potential for unintentional discrimination. We present an iterative procedure, based on orthogonal projection of input attributes, for enabling interpretability of black-box predictive models. Through our iterative procedure, one can quantify the relative dependence of a black-box model on its input attributes.The relative significance of the inputs to a predictive model can then be used to assess the fairness (or discriminatory extent) of such a model.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

The Statistical Cost of Adaptation in Multi-Source Transfer Learning
math.ST 2026-05 unverdicted novelty 8.0

Multi-source transfer learning incurs an intrinsic adaptation cost that can exceed one, with phase transitions separating regimes where bias-agnostic estimators match oracle performance from those where they cannot.
Metamorphic Testing of a Deep Learning based Forecaster
cs.LG 2019-07 unverdicted novelty 5.0

Developed 19 metamorphic relations to test correlation detection and LSTM forecasting in an outage prediction application, uncovering 8 unknown issues in the live system and detecting 65.9% of injected bugs via mutati...