Equality of Opportunity in Supervised Learning

Moritz Hardt, Eric Price, Nathan Srebro · 2016 · cs.LG · arXiv 1610.02413

6 Pith papers cite this work. Polarity classification is still indexing.

6 Pith papers citing it

open full Pith review browse 6 citing papers arXiv PDF

abstract

We propose a criterion for discrimination against a specified sensitive attribute in supervised learning, where the goal is to predict some target based on available features. Assuming data about the predictor, target, and membership in the protected group are available, we show how to optimally adjust any learned predictor so as to remove discrimination according to our definition. Our framework also improves incentives by shifting the cost of poor classification from disadvantaged groups to the decision maker, who can respond by improving the classification accuracy. In line with other studies, our notion is oblivious: it depends only on the joint statistics of the predictor, the target and the protected attribute, but not on interpretation of individualfeatures. We study the inherent limits of defining and identifying biases based on such oblivious measures, outlining what can and cannot be inferred from different oblivious tests. We illustrate our notion using a case study of FICO credit scores.

citation-role summary

extension 1

citation-polarity summary

extend 1

representative citing papers

Fairness vs Performance: Characterizing the Pareto Frontier of Algorithmic Decision Systems

cs.LG · 2026-05-11 · unverdicted · novelty 6.0

The Pareto frontier of fair algorithmic decisions consists of deterministic group-specific threshold rules on predicted success probabilities, which can include upper bounds for some fairness metrics and holds independently of model training approach.

Revisiting Fairness Impossibility with Endogenous Behavior

cs.GT · 2026-04-07 · unverdicted · novelty 6.0

Error-rate balance and predictive parity become compatible under endogenous behavior by adjusting stakes differently across groups, introducing a new form of unequal treatment in consequences.

Ethical and social risks of harm from Language Models

cs.CL · 2021-12-08 · accept · novelty 6.0

The authors provide a detailed taxonomy of 21 risks associated with language models, covering discrimination, information leaks, misinformation, malicious applications, interaction harms, and societal impacts like job loss and environmental costs.

Online Learning with Multiple Fairness Regularizers via Graph-Structured Feedback

cs.LG · 2025-08-19 · unverdicted · novelty 5.0

Develops a bandit algorithm with graph feedback that learns weights for multiple fairness constraints adaptively over sequential interactions.

FAIR_XAI: Improving Multimodal Foundation Model Fairness via Explainability for Wellbeing Assessment

cs.AI · 2026-04-26 · unverdicted · novelty 4.0

Vision-language models for wellbeing assessment exhibit dataset-dependent performance and demographic biases, with explainability interventions providing inconsistent fairness gains at potential accuracy costs.

Man and machine: artificial intelligence and judicial decision making

cs.AI · 2026-03-19 · unverdicted · novelty 2.0

A synthetic review across multiple fields concludes that AI decision aids have modest or nonexistent effects on judicial outcomes while identifying gaps in understanding human-AI interactions.

citing papers explorer

Showing 6 of 6 citing papers.

Fairness vs Performance: Characterizing the Pareto Frontier of Algorithmic Decision Systems cs.LG · 2026-05-11 · unverdicted · none · ref 18
The Pareto frontier of fair algorithmic decisions consists of deterministic group-specific threshold rules on predicted success probabilities, which can include upper bounds for some fairness metrics and holds independently of model training approach.
Revisiting Fairness Impossibility with Endogenous Behavior cs.GT · 2026-04-07 · unverdicted · none · ref 3
Error-rate balance and predictive parity become compatible under endogenous behavior by adjusting stakes differently across groups, introducing a new form of unequal treatment in consequences.
Ethical and social risks of harm from Language Models cs.CL · 2021-12-08 · accept · none · ref 107
The authors provide a detailed taxonomy of 21 risks associated with language models, covering discrimination, information leaks, misinformation, malicious applications, interaction harms, and societal impacts like job loss and environmental costs.
Online Learning with Multiple Fairness Regularizers via Graph-Structured Feedback cs.LG · 2025-08-19 · unverdicted · none · ref 22 · internal anchor
Develops a bandit algorithm with graph feedback that learns weights for multiple fairness constraints adaptively over sequential interactions.
FAIR_XAI: Improving Multimodal Foundation Model Fairness via Explainability for Wellbeing Assessment cs.AI · 2026-04-26 · unverdicted · none · ref 35
Vision-language models for wellbeing assessment exhibit dataset-dependent performance and demographic biases, with explainability interventions providing inconsistent fairness gains at potential accuracy costs.
Man and machine: artificial intelligence and judicial decision making cs.AI · 2026-03-19 · unverdicted · none · ref 13 · internal anchor
A synthetic review across multiple fields concludes that AI decision aids have modest or nonexistent effects on judicial outcomes while identifying gaps in understanding human-AI interactions.

Equality of Opportunity in Supervised Learning

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer