pith. sign in

Inherent disagreements in human textual infer- ences.Transactions of the Association for Computational Linguistics, 7:677–694

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

fields

cs.AI 1 cs.CL 1

years

2026 2

verdicts

UNVERDICTED 2

representative citing papers

Escaping the Agreement Trap: Defensibility Signals for Evaluating Rule-Governed AI

cs.AI · 2026-04-22 · unverdicted · novelty 7.0

Introduces Defensibility Index, Ambiguity Index, and Probabilistic Defensibility Signal to evaluate AI moderation decisions by logical derivability from explicit rules rather than agreement with historical labels, with validation on 193k+ Reddit cases showing 33-46.6 pp metric gaps and a Governance

citing papers explorer

Showing 2 of 2 citing papers.