Face-Feature Tuning is a label-free logit remapping method that reduces FPR/TPR gaps across groups in deepfake detection while preserving overall accuracy.
Equality of Opportunity in Supervised Learning
10 Pith papers cite this work. Polarity classification is still indexing.
abstract
We propose a criterion for discrimination against a specified sensitive attribute in supervised learning, where the goal is to predict some target based on available features. Assuming data about the predictor, target, and membership in the protected group are available, we show how to optimally adjust any learned predictor so as to remove discrimination according to our definition. Our framework also improves incentives by shifting the cost of poor classification from disadvantaged groups to the decision maker, who can respond by improving the classification accuracy. In line with other studies, our notion is oblivious: it depends only on the joint statistics of the predictor, the target and the protected attribute, but not on interpretation of individualfeatures. We study the inherent limits of defining and identifying biases based on such oblivious measures, outlining what can and cannot be inferred from different oblivious tests. We illustrate our notion using a case study of FICO credit scores.
citation-role summary
citation-polarity summary
roles
extension 1polarities
extend 1representative citing papers
Facial recognition enacts computational epistemicide by progressively reducing faces to standardized numerical vectors, rendering reformist ethical AI insufficient and requiring abolition of vectorized identity as a basis for rights.
Landseer offers a containerized modular system to integrate and evaluate combinations of machine learning defenses, with an initial analysis of 35 defenses highlighting replicability challenges.
The Pareto frontier of fair algorithmic decisions consists of deterministic group-specific threshold rules on predicted success probabilities, which can include upper bounds for some fairness metrics and holds independently of model training approach.
Error-rate balance and predictive parity become compatible under endogenous behavior by adjusting stakes differently across groups, introducing a new form of unequal treatment in consequences.
The authors provide a detailed taxonomy of 21 risks associated with language models, covering discrimination, information leaks, misinformation, malicious applications, interaction harms, and societal impacts like job loss and environmental costs.
A mixed-methods investigation of the research-practice gap in algorithmic fairness for public health identifies barriers and proposes the Fairness-to-Action framework integrating methodological, organizational, and systemic dimensions.
Develops a bandit algorithm with graph feedback that learns weights for multiple fairness constraints adaptively over sequential interactions.
Vision-language models for wellbeing assessment exhibit dataset-dependent performance and demographic biases, with explainability interventions providing inconsistent fairness gains at potential accuracy costs.
A synthetic review across multiple fields concludes that AI decision aids have modest or nonexistent effects on judicial outcomes while identifying gaps in understanding human-AI interactions.
citing papers explorer
-
Ethical and social risks of harm from Language Models
The authors provide a detailed taxonomy of 21 risks associated with language models, covering discrimination, information leaks, misinformation, malicious applications, interaction harms, and societal impacts like job loss and environmental costs.