Explanation in Artificial Intelligence: Insights from the Social Sciences

Tim Miller

arxiv: 1706.07269 · v3 · pith:H7XPTKANnew · submitted 2017-06-22 · 💻 cs.AI

Explanation in Artificial Intelligence: Insights from the Social Sciences

Tim Miller This is my paper

classification 💻 cs.AI

keywords artificialintelligenceexplainableexplanationcognitivepsychologyresearchsocial

0 comments

read the original abstract

There has been a recent resurgence in the area of explainable artificial intelligence as researchers and practitioners seek to make their algorithms more understandable. Much of this research is focused on explicitly explaining decisions or actions to a human observer, and it should not be controversial to say that looking at how humans explain to each other can serve as a useful starting point for explanation in artificial intelligence. However, it is fair to say that most work in explainable artificial intelligence uses only the researchers' intuition of what constitutes a `good' explanation. There exists vast and valuable bodies of research in philosophy, psychology, and cognitive science of how people define, generate, select, evaluate, and present explanations, which argues that people employ certain cognitive biases and social expectations towards the explanation process. This paper argues that the field of explainable artificial intelligence should build on this existing research, and reviews relevant papers from philosophy, cognitive psychology/science, and social psychology, which study these topics. It draws out some important findings, and discusses ways that these can be infused with work on explainable artificial intelligence.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Measuring User's Mental Models of Speech Translation in Human-AI Collaboration
cs.CL 2026-06 unverdicted novelty 6.0

A cross-lingual QA framework shows users build stronger mental models of MT systems through practice and source language knowledge mainly by spotting surface-level errors, with transcriptions helping further.
Ethical and social risks of harm from Language Models
cs.CL 2021-12 accept novelty 6.0

The authors provide a detailed taxonomy of 21 risks associated with language models, covering discrimination, information leaks, misinformation, malicious applications, interaction harms, and societal impacts like job...