Introduces Defensibility Index, Ambiguity Index, and Probabilistic Defensibility Signal to evaluate AI moderation decisions by logical derivability from explicit rules rather than agreement with historical labels, with validation on 193k+ Reddit cases showing 33-46.6 pp metric gaps and a Governance
Title resolution pending
3 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
years
2026 3roles
background 1polarities
background 1representative citing papers
Admins in India used Meta AI to help create WhatsApp group rules, appreciating reduced workload but remaining cautious about privacy, relational trust, and contextual tone.
Qualitative analysis of Reddit discussions reveals four tensions users face with AI-generated fitness feedback, showing resistance to AI that limits personal interpretations of lived experiences.
citing papers explorer
-
Escaping the Agreement Trap: Defensibility Signals for Evaluating Rule-Governed AI
Introduces Defensibility Index, Ambiguity Index, and Probabilistic Defensibility Signal to evaluate AI moderation decisions by logical derivability from explicit rules rather than agreement with historical labels, with validation on 193k+ Reddit cases showing 33-46.6 pp metric gaps and a Governance
-
Creating Group Rules with AI: Human-AI Collaboration in WhatsApp Moderation
Admins in India used Meta AI to help create WhatsApp group rules, appreciating reduced workload but remaining cautious about privacy, relational trust, and contextual tone.
-
Who Gets to Interpret the Workout? User Tensions with AI-Generated Fitness Feedback
Qualitative analysis of Reddit discussions reveals four tensions users face with AI-generated fitness feedback, showing resistance to AI that limits personal interpretations of lived experiences.