Automated Rationale Generation: A Technique for Explainable AI and its Effects on Human Perceptions

Brent Harrison; Larry Chan; Mark Riedl; Pradyumna Tambwekar; Upol Ehsan

arxiv: 1901.03729 · v1 · pith:QOB6OEMEnew · submitted 2019-01-11 · 💻 cs.AI · cs.HC

Automated Rationale Generation: A Technique for Explainable AI and its Effects on Human Perceptions

Upol Ehsan , Pradyumna Tambwekar , Larry Chan , Brent Harrison , Mark Riedl This is my paper

classification 💻 cs.AI cs.HC

keywords rationalesagentrationalebehaviorgeneratedgenerationuserautomated

0 comments

read the original abstract

Automated rationale generation is an approach for real-time explanation generation whereby a computational model learns to translate an autonomous agent's internal state and action data representations into natural language. Training on human explanation data can enable agents to learn to generate human-like explanations for their behavior. In this paper, using the context of an agent that plays Frogger, we describe (a) how to collect a corpus of explanations, (b) how to train a neural rationale generator to produce different styles of rationales, and (c) how people perceive these rationales. We conducted two user studies. The first study establishes the plausibility of each type of generated rationale and situates their user perceptions along the dimensions of confidence, humanlike-ness, adequate justification, and understandability. The second study further explores user preferences between the generated rationales with regard to confidence in the autonomous agent, communicating failure and unexpected behavior. Overall, we find alignment between the intended differences in features of the generated rationales and the perceived differences by users. Moreover, context permitting, participants preferred detailed rationales to form a stable mental model of the agent's behavior.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Unexplainability and Incomprehensibility of Artificial Intelligence
cs.CY 2019-06 unverdicted novelty 3.0

Advanced AI systems are unexplainable in full and produce explanations that humans cannot comprehend.