When the Majority is Wrong: Modeling Annotator Disagreement for Subjective Tasks

Eve Fleisig, Rediet Abebe, Dan Klein · 2023 · DOI 10.18653/v1/2023.emnlp-main.415

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

open at publisher browse 3 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

Who and What? Using Linguistic Features and Annotator Characteristics to Analyze Annotation Variation

cs.CL · 2026-05-07 · unverdicted · novelty 6.0

Large-scale statistical analysis of four harmful language datasets reveals that interactions between annotator characteristics and linguistic cues drive annotation variation, with lexical features and attitudes prominent but patterns varying by dataset.

STABLEVAL: Disagreement-Aware and Stable Evaluation of AI Systems

cs.LG · 2026-05-04 · unverdicted · novelty 6.0

STABLEVAL models latent correctness and annotator confusion to deliver more stable and uncertainty-aware AI system rankings than majority-vote aggregation.

IYKYK (But AI Doesn't): Automated Content Moderation Does Not Capture Communities' Heterogeneous Attitudes Towards Reclaimed Language

cs.CL · 2026-04-17 · unverdicted · novelty 5.0

Automated hate speech detectors show poor alignment with heterogeneous in-group judgments on reclaimed slur usage, driven by low inter-annotator agreement and contextual features like derogatory intent.

citing papers explorer

Showing 3 of 3 citing papers.

Who and What? Using Linguistic Features and Annotator Characteristics to Analyze Annotation Variation cs.CL · 2026-05-07 · unverdicted · none · ref 67
Large-scale statistical analysis of four harmful language datasets reveals that interactions between annotator characteristics and linguistic cues drive annotation variation, with lexical features and attitudes prominent but patterns varying by dataset.
STABLEVAL: Disagreement-Aware and Stable Evaluation of AI Systems cs.LG · 2026-05-04 · unverdicted · none · ref 10
STABLEVAL models latent correctness and annotator confusion to deliver more stable and uncertainty-aware AI system rankings than majority-vote aggregation.
IYKYK (But AI Doesn't): Automated Content Moderation Does Not Capture Communities' Heterogeneous Attitudes Towards Reclaimed Language cs.CL · 2026-04-17 · unverdicted · none · ref 30
Automated hate speech detectors show poor alignment with heterogeneous in-group judgments on reclaimed slur usage, driven by low inter-annotator agreement and contextual features like derogatory intent.

When the Majority is Wrong: Modeling Annotator Disagreement for Subjective Tasks

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer