C onv A buse: Data, Analysis, and Benchmarks for Nuanced Abuse Detection in Conversational AI

Cercas Curry, Amanda, Abercrombie, Gavin, Rieser, Verena · 2021 · DOI 10.18653/v1/2021.emnlp-main.587

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

open at publisher browse 2 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

Beyond Majority Voting: Agreement-Based Clustering to Model Annotator Perspectives in Subjective NLP Tasks

cs.CL · 2026-05-11 · unverdicted · novelty 6.0

Agreement-based clustering of annotators improves performance on subjective NLP tasks by capturing diverse perspectives better than majority voting or per-annotator modeling.

STABLEVAL: Disagreement-Aware and Stable Evaluation of AI Systems

cs.LG · 2026-05-04 · unverdicted · novelty 5.0

STABLEVAL produces stable AI system rankings by modeling latent correctness and annotator confusion rather than majority vote aggregation.

citing papers explorer

Showing 2 of 2 citing papers after filters.

Beyond Majority Voting: Agreement-Based Clustering to Model Annotator Perspectives in Subjective NLP Tasks cs.CL · 2026-05-11 · unverdicted · none · ref 22
Agreement-based clustering of annotators improves performance on subjective NLP tasks by capturing diverse perspectives better than majority voting or per-annotator modeling.
STABLEVAL: Disagreement-Aware and Stable Evaluation of AI Systems cs.LG · 2026-05-04 · unverdicted · none · ref 21
STABLEVAL produces stable AI system rankings by modeling latent correctness and annotator confusion rather than majority vote aggregation.

C onv A buse: Data, Analysis, and Benchmarks for Nuanced Abuse Detection in Conversational AI

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer