Explanations as Dialogues: Toward Human-Centered Conversational Explainable AI

Niharika Mathur; Smit Desai

arxiv: 2605.27666 · v1 · pith:L6MVHBQRnew · submitted 2026-05-26 · 💻 cs.HC

Explanations as Dialogues: Toward Human-Centered Conversational Explainable AI

Niharika Mathur , Smit Desai This is my paper

Pith reviewed 2026-06-29 15:23 UTC · model grok-4.3

classification 💻 cs.HC

keywords conversational explainable AIhuman-centered AIdialogue systemsexplainabilityhuman-AI interactionXAI

0 comments

The pith

The conversational layer around an AI explanation is a critical part of its effectiveness, not an optional extra.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper identifies a mismatch: explanations are usually researched as fixed outputs, yet users encounter them through ongoing dialogue with the system. It claims that features such as timing, tone, persona, and prior exchanges directly determine whether an explanation succeeds in building understanding or trust. Three example scenarios illustrate how these elements operate in practice. If the claim holds, then effective explanations require design for interactive exchange rather than one-time delivery of facts. The authors present this as a vision for studying explanations as shaped by their conversational context.

Core claim

Explanations are experienced as interactive exchanges whose effectiveness depends on timing, tone, persona, and conversational history, so the conversational layer must be treated as a core constituent rather than an incidental wrapper around static content.

What carries the argument

The conversational layer, consisting of timing, tone, persona, and history that shape an explanation during interactive exchanges.

If this is right

XAI systems must be built to handle and adapt to multiple turns of user input rather than delivering a single response.
Evaluation of explanations needs to include measures of how well the exchange flows over time.
Research focus should move from producing correct facts to modeling full conversational sequences.
Domains that rely on user trust, such as medical or financial decisions, would require explanations that respond to follow-up questions in context.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same emphasis on dialogue could apply to other AI outputs like recommendations, where back-and-forth might improve acceptance.
Testing whether static-only explanations reduce user satisfaction in real deployments would provide a direct check on the claim.
Linking this view with existing conversational AI tools could produce more natural ongoing collaboration between humans and systems.

Load-bearing premise

The gap between studying explanations as static items and experiencing them as dialogue means the conversational features are essential to success, shown mainly through example cases rather than measured results.

What would settle it

A study that gives users the same information once as a static explanation and once through a multi-turn dialogue, then measures no difference in understanding, trust, or decision quality.

Figures

Figures reproduced from arXiv: 2605.27666 by Niharika Mathur, Smit Desai.

**Figure 1.** Figure 1: A hospital readmission risk score rendered as a static SHAP-style feature importance chart (Fig. 1A) versus as a conversational exchange (Fig. 1B). a framework or a starting point that takes the “conversational” seriously, not just a presentation layer but a site of conversational sensemaking. Addressing this gap will likely require closer alignment between communities that have largely progressed in par… view at source ↗

**Figure 2.** Figure 2: Scenario A: An Older Adult Interacting with an AI-enabled Health Management App. HC2XAI, we must bring these threads together and focus them on explanations as a conversational act. 3 Scenarios: Explanations in the Wild To demonstrate what we mean by explanations as conversational acts, we present the following scenarios. These scenarios are not entirely speculative. They are composites of interactions alr… view at source ↗

**Figure 3.** Figure 3: Scenario B: A college student interacting with an AI tutor. [PITH_FULL_IMAGE:figures/full_fig_p004_3.png] view at source ↗

**Figure 4.** Figure 4: Scenario C: A user interacting with an AI Travel Planner. [PITH_FULL_IMAGE:figures/full_fig_p005_4.png] view at source ↗

read the original abstract

As AI systems become increasingly conversational, a gap emerges wherein explanations are studied as static artifacts, yet in practice, are experienced as dialogue. In this provocation, we argue that the conversational layer around an explanation is not incidental to its effectiveness, but a critical constituent. Drawing on three illustrative scenarios, we invite the CUI community to study explanations as interactive, conversational exchanges shaped by timing, tone, persona and conversational history, and introduce our vision for Human-Centered Conversational XAI (HC2XAI).

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

Position paper that flags a real mismatch between static XAI studies and conversational use but offers only scenarios to support its main claim.

read the letter

The paper's core move is to treat explanations as dialogues rather than fixed artifacts, and it names timing, tone, persona, and history as elements that matter. That framing is not entirely new to CUI work, but applying it directly to XAI and labeling the result HC2XAI gives the community a short label to rally around.

What the paper does cleanly is lay out the gap: most XAI papers still evaluate explanations as one-shot outputs, while real users interact with them over turns. The three scenarios make that point concrete without overclaiming.

The soft spot is that the central assertion—that the conversational layer is a critical constituent rather than incidental—rests on those scenarios alone. No comparison, no user data, and no derivation shows the conversational pieces are load-bearing rather than nice-to-have. The authors present the piece as a provocation, so this is not a hidden flaw, but it does limit how far the argument travels.

The writing is direct and the citations stay within the relevant CUI and XAI literature. No circular math or invented entities appear.

This is for people already working at the intersection of conversational interfaces and explainability who want a prompt for the next design study. A reader looking for measured results or a formal framework will find little to cite. It deserves a serious referee round because position pieces like this can usefully shape what the field tries next, even if the current version stays at the level of a call to action.

Referee Report

0 major / 2 minor

Summary. The manuscript is a provocation arguing that explanations in AI are studied as static artifacts yet experienced as dialogues in conversational systems. It claims that the conversational layer around an explanation is not incidental but a critical constituent of effectiveness. The argument is advanced through three illustrative scenarios and culminates in a call for the CUI community to treat explanations as interactive exchanges shaped by timing, tone, persona, and conversational history, while introducing the vision of Human-Centered Conversational Explainable AI (HC2XAI).

Significance. If adopted, the perspective could usefully redirect XAI research toward dynamic, dialogue-based explanations that better match how users actually interact with conversational AI. The paper's value lies in its explicit framing as an invitation to new work rather than an empirical demonstration; its acknowledgment of the illustrative basis is a strength that keeps the contribution proportionate to the evidence supplied.

minor comments (2)

[Abstract] Abstract: the phrase 'critical constituent' is used without a short operational gloss; adding one sentence on what would count as evidence that conversation is load-bearing (versus merely present) would help readers evaluate the provocation.
[Introduction] The three illustrative scenarios are referenced but not summarized; a one-sentence capsule of each in the introduction would make the central claim easier to assess without requiring the reader to reach later sections.

Simulated Author's Rebuttal

0 responses · 0 unresolved

We thank the referee for their positive and accurate summary of our provocation paper, which correctly identifies its core argument that explanations in conversational AI must be studied as interactive dialogues rather than static artifacts, along with the illustrative scenarios and the vision for HC2XAI. We appreciate the recognition that the paper's value lies in its framing as an invitation to new work and that its illustrative basis is proportionate to the contribution. The recommendation for minor revision is noted, though no specific major comments were provided in the report.

Circularity Check

0 steps flagged

No significant circularity

full rationale

The paper is framed as a provocation advancing a vision for HC2XAI, supported only by three illustrative scenarios rather than any derivation, model, equations, or fitted parameters. No load-bearing steps reduce claims to self-citations, self-definitions, or renamed inputs; the central position is explicitly presented as exploratory rather than derived from prior results by the same authors. The argument remains self-contained against external benchmarks with no internal reduction.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The central claim rests on the domain assumption that conversational properties materially determine explanation effectiveness; no free parameters or invented entities are introduced.

axioms (1)

domain assumption Explanations are experienced as dialogue in practice while studied as static artifacts
Stated directly in the abstract as the motivating gap.

pith-pipeline@v0.9.1-grok · 5604 in / 1054 out tokens · 26086 ms · 2026-06-29T15:23:20.617104+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

67 extracted references · 16 canonical work pages · 2 internal anchors

[1]

Alafate Abulimiti, Paola R. Peña, Fatemeh Alizadeh, Shashank Ahire, Heloisa Candello, Smit Desai, Justin Edwards, Yuan He, Darragh Higgins, Alberto Jo- vane, Matthias Kraus, Guy Laban, Rachel McDonnell, Jairo Pérez-Osorio, Tanja Schneeberger, Jaisie Sin, Tobias Thejll-Madsen, Nima Zargham, and Benjamin R. Cowan. 2025. DEBP-PVA: Designing and Evaluating Be...

work page doi:10.1145/3742886.3758118 2025
[2]

Malihe Alikhani and Matthew Stone. 2020. Achieving common ground in multi- modal dialogue. InProceedings of the 58th Annual Meeting of the Association for Computational Linguistics: Tutorial Abstracts. 10–15

2020
[3]

Theo Araujo and Nadine Bol. 2024. From speaking like a person to being personal: The effects of personalized, regular interactions with conversational agents. Computers in Human Behavior: Artificial Humans2, 1 (2024), 100030

2024
[4]

Shoshana Blum-Kulka, Michal Hamo, and Talia Habib. 2010. Explanations in naturally occurring peer talk: Conversational emergence and function, thematic scope, and contribution to the development of discursive skills.First language 30, 3-4 (2010), 440–460

2010
[5]

Suman Chahar, Kuldeep Singh Kaswan, Meenakshi Sharma, and Jagjit Singh Dhatterwal. 2025. Research Exploration of Artificial Intelligence: The Black Box. In2025 International Conference on Intelligent and Secure Engineering Solutions (CISES). IEEE, 282–286

2025
[6]

Harmon Lee Bruce Chia. 2023. The emergence and need for explainable AI. Advances in Engineering Innovation3 (2023), 1–4

2023
[7]

Herbert H Clark and Edward F Schaefer. 1989. Contributing to discourse.Cogni- tive science13, 2 (1989), 259–294

1989
[8]

What can i help you with?

Benjamin R. Cowan, Nadia Pantidi, David Coyle, Kellie Morrissey, Peter Clarke, Sara Al-Shehri, David Earley, and Natasha Bandeira. 2017. “What can i help you with?”: infrequent users’ experiences of intelligent personal assistants. In Proceedings of the 19th International Conference on Human-Computer Interaction with Mobile Devices and Services (MobileHCI...

work page doi:10.1145/3098279.3098539 2017
[9]

Samuel Rhys Cox, Helena Bøjer Djernæs, and Niels van Berkel. 2025. Reflecting human values in XAI: Emotional and reflective benefits in creativity support tools.arXiv preprint arXiv:2506.17116(2025)

work page arXiv 2025
[10]

Samuel Rhys Cox, Joel Wester, and Niels van Berkel. 2026. Polite But Bor- ing? Trade-offs Between Engagement and Psychological Reactance to Chatbot Feedback Styles. arXiv:2601.20683 (Jan. 2026). doi:10.48550/arXiv.2601.20683 arXiv:2601.20683 [cs]

work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.2601.20683 2026
[11]

Smit Desai, Jessie Chin, Dakuo Wang, Benjamin Cowan, and Michael Twidale
[12]

arXiv:2502.11554 (Feb

Toward Metaphor-Fluid Conversation Design for Voice User Interfaces. arXiv:2502.11554 (Feb. 2025). doi:10.48550/arXiv.2502.11554 arXiv:2502.11554 [cs]

work page doi:10.48550/arxiv.2502.11554 2025
[13]

Smit Desai, Mateusz Dubiel, and Luis A Leiva. 2024. Examining humanness as a metaphor to design voice user interfaces. InProceedings of the 6th ACM Conference on Conversational User Interfaces. 1–15

2024
[14]

Smit Desai, Mateusz Dubiel, Nima Zargham, Thomas Mildner, and Laura Spillner
[15]

InProceedings of the 7th ACM Conference on Conversational User Interfaces

Personas evolved: Designing ethical LLM-based conversational agent personalities. InProceedings of the 7th ACM Conference on Conversational User Interfaces. 1–4
[16]

Smit Desai and Michael Twidale. 2023. Metaphors in voice user interfaces: a slippery fish.ACM Transactions on Computer-Human Interaction30, 6 (2023), 1–37

2023
[17]

Philip R Doyle, Leigh Clark, and Benjamin R. Cowan. 2021. What Do We See in Them? Identifying Dimensions of Partner Models for Speech Interfaces Using a Psycholexical Approach. InProceedings of the 2021 CHI Conference on Human Factors in Computing Systems (CHI ’21). Association for Computing Machinery, New York, NY, USA, 1–14. doi:10.1145/3411764.3445206

work page doi:10.1145/3411764.3445206 2021
[18]

Mateusz Dubiel, Sylvain Daronnat, and Luis A. Leiva. 2022. Conversational Agents Trust Calibration: A User-Centred Perspective to Design. InProceedings of the 4th Conference on Conversational User Interfaces (CUI ’22). Association for Computing Machinery, New York, NY, USA, 1–6. doi:10.1145/3543829.3544518

work page doi:10.1145/3543829.3544518 2022
[19]

Upol Ehsan, Brent Harrison, Larry Chan, and Mark O Riedl. 2018. Rationalization: A neural machine translation approach to generating natural language explana- tions. InProceedings of the 2018 AAAI/ACM Conference on AI, Ethics, and Society. 81–87

2018
[20]

Upol Ehsan, Q Vera Liao, Michael Muller, Mark O Riedl, and Justin D Weisz
[21]

In Proceedings of the 2021 CHI conference on human factors in computing systems

Expanding explainability: Towards social transparency in ai systems. In Proceedings of the 2021 CHI conference on human factors in computing systems. 1–19

2021
[22]

Upol Ehsan, Samir Passi, Q Vera Liao, Larry Chan, I-Hsiang Lee, Michael Muller, and Mark O Riedl. 2024. The who in XAI: how AI background shapes perceptions of AI explanations. InProceedings of the 2024 CHI Conference on Human Factors in Computing Systems. 1–32

2024
[23]

Upol Ehsan, Pradyumna Tambwekar, Larry Chan, Brent Harrison, and Mark O Riedl. 2019. Automated rationale generation: a technique for explainable AI and its effects on human perceptions. InProceedings of the 24th international conference on intelligent user interfaces. 263–274

2019
[24]

Sharon Ferguson, Paula Akemi Aoyagui, Rimsha Rizvi, Young-Ho Kim, and Anastasia Kuzminykh. 2024. The explanation that hits home: the characteristics of verbal explanations that affect human perception in subjective decision-making. Proceedings of the ACM on Human-Computer Interaction8, CSCW2 (2024), 1–37

2024
[25]

Raymond Fok and Daniel S Weld. 2024. In search of verifiability: Explanations rarely enable complementary performance in AI-advised decision making.AI Magazine45, 3 (2024), 317–332

2024
[26]

Anna Viktorovna Gavrilova and Carlo Galli. 2026. Conversing with machines: How AI is changing the way scientists think.Quantitative Biology14, 2 (2026)

2026
[27]

Julie Gerlings, Millie Søndergaard Jensen, and Arisa Shollo. 2021. Explainable AI, but explainable to whom? An exploratory case study of xAI in healthcare. In Handbook of Artificial Intelligence in Healthcare: Vol 2: Practicalities and Prospects. Springer, 169–198

2021
[28]

Martin Gjoreski, Matias Laporte, Marc Langheinrich, and Tim Miller. 2024. How to Validate XAI in Longitudinal Studies?. InCompanion of the 2024 on ACM international joint conference on pervasive and ubiquitous computing. 866–869

2024
[29]

Shirley Gregor and Izak Benbasat. 1999. Explanations From Intelligent Systems: Theoretical Foundations and Implications for Practice1.MIS quarterly23, 4 (1999), 497–530

1999
[30]

David Gunning and David Aha. 2019. DARPA’s explainable artificial intelligence (XAI) program.AI magazine40, 2 (2019), 44–58

2019
[31]

Jyoti Gupta and KR Seeja. 2024. A comparative study and systematic analysis of XAI models and their applications in healthcare.Archives of Computational Methods in Engineering31, 7 (2024), 3977–4002

2024
[32]

Gaole He, Nilay Aishwarya, and Ujwal Gadiraju. 2025. Is conversational XAI all you need? Human-AI decision making with a conversational XAI assistant. InProceedings of the 30th international conference on intelligent user interfaces. 907–924

2025
[33]

Sophie F Jentzsch, Sviatlana Höhn, and Nico Hochgeschwender. 2019. Conversa- tional interfaces for explainable AI: a human-centred approach. InInternational workshop on explainable, transparent autonomous agents and multi-agent systems. Springer, 77–92

2019
[34]

M Kedar. 2024. Exploring the Effectiveness of SHAP over other Explainable AI Methods.Int. J. Sci. Res. Eng. Manag8 (2024)

2024
[35]

help me help the ai

Sunnie SY Kim, Elizabeth Anne Watkins, Olga Russakovsky, Ruth Fong, and Andrés Monroy-Hernández. 2023. " help me help the ai": Understanding how explainability can support human-ai interaction. Inproceedings of the 2023 CHI conference on human factors in computing systems. 1–17

2023
[36]

Patrick Knab, Sascha Marton, Udo Schlegel, and Christian Bartelt. 2025. Which lime should i trust? concepts, challenges, and solutions. InWorld Conference on Explainable Artificial Intelligence. Springer, 28–52

2025
[37]

Todd Kulesza, Simone Stumpf, Margaret Burnett, Sherry Yang, Irwin Kwan, and Weng-Keen Wong. 2013. Too much, too little, or just right? Ways explanations impact end users’ mental models. In2013 IEEE Symposium on visual languages and human centric computing. IEEE, 3–10

2013
[38]

Jiachen Li, Bingrui Zong, Tingyu Cheng, Yunzhi Li, Elizabeth D Mynatt, and Ashutosh Dhekne. 2023. Privacy vs. awareness: Relieving the tension between older adults and adult children when sharing in-home activity data.Proceedings of the ACM on Human-Computer Interaction7, CSCW2 (2023), 1–30

2023
[39]

Q Vera Liao, Daniel Gruen, and Sarah Miller. 2020. Questioning the AI: informing design practices for explainable AI user experiences. InProceedings of the 2020 CHI conference on human factors in computing systems. 1–15. Toward Human-Centered Conversational XAI CUI ’26, July 21–24, 2026, Bremen, Germany

2020
[40]

Duri Long, Jessica Roberts, Brian Magerko, Kenneth Holstein, Daniella DiPaola, and Fred Martin. 2023. AI literacy: Finding common threads between education, design, policy, and explainability. InExtended Abstracts of the 2023 CHI Conference on Human Factors in Computing Systems. 1–6

2023
[41]

Tao Long, Sitong Wang, Émilie Fabre, Tony Wang, Anup Sathya, Jason Wu, Savvas Dimitrios Petridis, Ding Li, Tuhin Chakrabarty, Yue Jiang, et al . 2025. Facilitating Longitudinal Interaction Studies of AI Systems. InAdjunct Proceedings of the 38th Annual ACM Symposium on User Interface Software and Technology. 1–5

2025
[42]

Like Having a Really Bad PA

Ewa Luger and Abigail Sellen. 2016. “Like Having a Really Bad PA”: The Gulf Between User Expectation and Experience of Conversational Agents. InProceed- ings of the 2016 CHI Conference on Human Factors in Computing Systems (CHI ’16). ACM, New York, NY, USA, 5286–5297. doi:10.1145/2858036.2858288 event-place: San Jose, California, USA

work page doi:10.1145/2858036.2858288 2016
[43]

Who wants to be nagged by AI?

Niharika Mathur, Hasibur Rahman, and Smit Desai. 2026. " Who wants to be nagged by AI?": Investigating the Effects of Agreeableness on Older Adults’ Perception of LLM-Based Voice Assistants’ Explanations.arXiv preprint arXiv:2603.09012(2026)

work page arXiv 2026
[44]

Sometimes You Need Facts, and Sometimes a Hug

Niharika Mathur, Tamara Zubatiy, Agata Rozga, Jodi Forlizzi, and Elizabeth Mynatt. 2025. " Sometimes You Need Facts, and Sometimes a Hug": Understanding Older Adults’ Preferences for Explanations in LLM-Based Conversational AI Systems.arXiv preprint arXiv:2510.06697(2025)

work page arXiv 2025
[45]

Why Did You Say That?

Niharika Mathur, Tamara Zubatiy, Agata Rozga, and Elizabeth Mynatt. 2023. “Why Did You Say That?”: Understanding Explainability in Conversational AI Systems for Older Adults with Mild Cognitive Impairment (MCI). InInternational Conference on Ubiquitous Computing and Ambient Intelligence. Springer, 208–214

2023
[46]

Tim Miller. 2019. Explanation in artificial intelligence: Insights from the social sciences.Artificial intelligence267 (2019), 1–38

2019
[47]

Tim Miller. 2023. Explainable ai is dead, long live explainable ai! hypothesis- driven decision support using evaluative ai. InProceedings of the 2023 ACM conference on fairness, accountability, and transparency. 333–342

2023
[48]

Mohammad Namvarpour and Afsaneh Razi. 2025. The Art of Talking Machines: A Comprehensive Literature Review of Conversational User Interfaces. InPro- ceedings of the 7th ACM Conference on Conversational User Interfaces. 1–18

2025
[49]

Animesh Nighojkar, Bekhzodbek Moydinboyev, My Duong, and John Licato
[50]

Giving ai personalities leads to more human-like reasoning.arXiv preprint arXiv:2502.14155(2025)

work page arXiv 2025
[51]

Hasibur Rahman and Smit Desai. 2025. Vibe Check: Understanding the Effects of LLM-Based Conversational Agents’ Personality and Alignment on User Percep- tions in Goal-Oriented Tasks. arXiv:2509.09870 (Sept. 2025). doi:10.48550/arXiv. 2509.09870 arXiv:2509.09870 [cs]

work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv 2025
[52]

Minjin Rheu, Ji Youn Shin, Wei Peng, and Jina Huh-Yoo. 2021. Systematic re- view: Trust-building factors and implications for conversational agent design. International Journal of Human–Computer Interaction37, 1 (2021), 81–96

2021
[53]

Yao Rong, Tobias Leemann, Thai-Trang Nguyen, Lisa Fiedler, Peizhu Qian, Vaib- hav Unhelkar, Tina Seidel, Gjergji Kasneci, and Enkelejda Kasneci. 2023. Towards human-centered explainable ai: A survey of user studies for model explana- tions.IEEE transactions on pattern analysis and machine intelligence46, 4 (2023), 2104–2122

2023
[54]

Rikard Rosenbacke, Åsa Melhus, Martin McKee, and David Stuckler. 2024. How explainable artificial intelligence can increase or decrease clinicians’ trust in AI applications in health care: systematic review.Jmir Ai3 (2024), e53207

2024
[55]

Jae-Eun Russell, Anna Marie Smith, Salim George, Jonah Pratt, Brian Fodale, Cassandra Monk, and Adam Brummett. 2025. Unlocking Insights: Investigating Student AI Tutor Interactions in a Large Introductory STEM Course. InPro- ceedings of the 15th International Learning Analytics and Knowledge Conference (LAK ’25). Association for Computing Machinery, New Y...

work page doi:10.1145/3706468.3706524 2025
[56]

Ute Schmid and Britta Wrede. 2022. What is missing in XAI so far? An interdis- ciplinary perspective.KI-Künstliche Intelligenz36, 3 (2022), 303–315

2022
[57]

Hua Shen, Chieh-Yang Huang, Tongshuang Wu, and Ting-Hao Kenneth Huang
[58]

InCompanion publication of the 2023 conference on computer supported cooperative work and social computing

ConvXAI: Delivering heterogeneous AI explanations via conversations to support human-AI scientific writing. InCompanion publication of the 2023 conference on computer supported cooperative work and social computing. 384–387

2023
[59]

Eleanor Palo Stoller. 1993. Interpretations of symptoms by older people: A health diary study of illness behavior.Journal of Aging and Health5, 1 (1993), 58–81

1993
[60]

Nipuna Thalpage. 2023. Unlocking the black box: Explainable artificial intelli- gence (XAI) for trust and transparency in ai systems.J. Digit. Art Humanit4, 1 (2023), 31–36

2023
[61]

Qiaosi Wang, Koustuv Saha, Eric Gregori, David Joyner, and Ashok Goel. 2021. Towards Mutual Theory of Mind in Human-AI Interaction: How Language Reflects What Students Perceive About a Virtual Teaching Assistant. InPro- ceedings of the 2021 CHI Conference on Human Factors in Computing Systems (CHI ’21). Association for Computing Machinery, New York, NY, U...

work page doi:10.1145/3411764.3445645 2021
[62]

Christina Ziying Wei, Young-Ho Kim, and Anastasia Kuzminykh. 2023. The Bot on Speaking Terms: The Effects of Conversation Architecture on Perceptions of Conversational Agents. InProceedings of the 5th International Conference on Conversational User Interfaces (CUI ’23). Association for Computing Machinery, New York, NY, USA, 1–16. doi:10.1145/3571884.3597139

work page doi:10.1145/3571884.3597139 2023
[63]

Miaoxiang Yi. 2024. Revolutionizing interaction: the role of artificial intelligent conversation agents in human-computer interaction. InFourth International Con- ference on Signal Processing and Machine Learning (CONF-SPML 2024), Vol. 13077. SPIE, 192–201

2024
[64]

Setareh Zafari, Jesse de Pagter, Guglielmo Papagni, Alischa Rosenstein, Michael Filzmoser, and Sabine T Koeszegi. 2024. Trust development and explainability: A longitudinal study with a personalized assistive system.Multimodal Technologies and Interaction8, 3 (2024), 20

2024
[65]

Nima Zargham, Leon Reicherts, Michael Bonfert, Sarah Theres Voelkel, Johannes Schoening, Rainer Malaka, and Yvonne Rogers. 2022. Understanding Circum- stances for Desirable Proactive Behaviour of Voice Assistants: The Proactivity Dilemma. InProceedings of the 4th Conference on Conversational User Interfaces (CUI ’22). Association for Computing Machinery, ...

work page doi:10.1145/3543829.3543834 2022
[66]

John Zerilli. 2022. Explaining machine learning decisions.Philosophy of Science 89, 1 (2022), 1–19

2022
[67]

Tong Zhang, Mengao Zhang, Wei Yan Low, X Jessie Yang, and Boyang Albert Li. 2025. Conversational explanations: discussing explainable AI with non-AI experts. InProceedings of the 30th international conference on intelligent user interfaces. 409–424

2025

[1] [1]

Alafate Abulimiti, Paola R. Peña, Fatemeh Alizadeh, Shashank Ahire, Heloisa Candello, Smit Desai, Justin Edwards, Yuan He, Darragh Higgins, Alberto Jo- vane, Matthias Kraus, Guy Laban, Rachel McDonnell, Jairo Pérez-Osorio, Tanja Schneeberger, Jaisie Sin, Tobias Thejll-Madsen, Nima Zargham, and Benjamin R. Cowan. 2025. DEBP-PVA: Designing and Evaluating Be...

work page doi:10.1145/3742886.3758118 2025

[2] [2]

Malihe Alikhani and Matthew Stone. 2020. Achieving common ground in multi- modal dialogue. InProceedings of the 58th Annual Meeting of the Association for Computational Linguistics: Tutorial Abstracts. 10–15

2020

[3] [3]

Theo Araujo and Nadine Bol. 2024. From speaking like a person to being personal: The effects of personalized, regular interactions with conversational agents. Computers in Human Behavior: Artificial Humans2, 1 (2024), 100030

2024

[4] [4]

Shoshana Blum-Kulka, Michal Hamo, and Talia Habib. 2010. Explanations in naturally occurring peer talk: Conversational emergence and function, thematic scope, and contribution to the development of discursive skills.First language 30, 3-4 (2010), 440–460

2010

[5] [5]

Suman Chahar, Kuldeep Singh Kaswan, Meenakshi Sharma, and Jagjit Singh Dhatterwal. 2025. Research Exploration of Artificial Intelligence: The Black Box. In2025 International Conference on Intelligent and Secure Engineering Solutions (CISES). IEEE, 282–286

2025

[6] [6]

Harmon Lee Bruce Chia. 2023. The emergence and need for explainable AI. Advances in Engineering Innovation3 (2023), 1–4

2023

[7] [7]

Herbert H Clark and Edward F Schaefer. 1989. Contributing to discourse.Cogni- tive science13, 2 (1989), 259–294

1989

[8] [8]

What can i help you with?

Benjamin R. Cowan, Nadia Pantidi, David Coyle, Kellie Morrissey, Peter Clarke, Sara Al-Shehri, David Earley, and Natasha Bandeira. 2017. “What can i help you with?”: infrequent users’ experiences of intelligent personal assistants. In Proceedings of the 19th International Conference on Human-Computer Interaction with Mobile Devices and Services (MobileHCI...

work page doi:10.1145/3098279.3098539 2017

[9] [9]

Samuel Rhys Cox, Helena Bøjer Djernæs, and Niels van Berkel. 2025. Reflecting human values in XAI: Emotional and reflective benefits in creativity support tools.arXiv preprint arXiv:2506.17116(2025)

work page arXiv 2025

[10] [10]

Samuel Rhys Cox, Joel Wester, and Niels van Berkel. 2026. Polite But Bor- ing? Trade-offs Between Engagement and Psychological Reactance to Chatbot Feedback Styles. arXiv:2601.20683 (Jan. 2026). doi:10.48550/arXiv.2601.20683 arXiv:2601.20683 [cs]

work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.2601.20683 2026

[11] [11]

Smit Desai, Jessie Chin, Dakuo Wang, Benjamin Cowan, and Michael Twidale

[12] [12]

arXiv:2502.11554 (Feb

Toward Metaphor-Fluid Conversation Design for Voice User Interfaces. arXiv:2502.11554 (Feb. 2025). doi:10.48550/arXiv.2502.11554 arXiv:2502.11554 [cs]

work page doi:10.48550/arxiv.2502.11554 2025

[13] [13]

Smit Desai, Mateusz Dubiel, and Luis A Leiva. 2024. Examining humanness as a metaphor to design voice user interfaces. InProceedings of the 6th ACM Conference on Conversational User Interfaces. 1–15

2024

[14] [14]

Smit Desai, Mateusz Dubiel, Nima Zargham, Thomas Mildner, and Laura Spillner

[15] [15]

InProceedings of the 7th ACM Conference on Conversational User Interfaces

Personas evolved: Designing ethical LLM-based conversational agent personalities. InProceedings of the 7th ACM Conference on Conversational User Interfaces. 1–4

[16] [16]

Smit Desai and Michael Twidale. 2023. Metaphors in voice user interfaces: a slippery fish.ACM Transactions on Computer-Human Interaction30, 6 (2023), 1–37

2023

[17] [17]

Philip R Doyle, Leigh Clark, and Benjamin R. Cowan. 2021. What Do We See in Them? Identifying Dimensions of Partner Models for Speech Interfaces Using a Psycholexical Approach. InProceedings of the 2021 CHI Conference on Human Factors in Computing Systems (CHI ’21). Association for Computing Machinery, New York, NY, USA, 1–14. doi:10.1145/3411764.3445206

work page doi:10.1145/3411764.3445206 2021

[18] [18]

Mateusz Dubiel, Sylvain Daronnat, and Luis A. Leiva. 2022. Conversational Agents Trust Calibration: A User-Centred Perspective to Design. InProceedings of the 4th Conference on Conversational User Interfaces (CUI ’22). Association for Computing Machinery, New York, NY, USA, 1–6. doi:10.1145/3543829.3544518

work page doi:10.1145/3543829.3544518 2022

[19] [19]

Upol Ehsan, Brent Harrison, Larry Chan, and Mark O Riedl. 2018. Rationalization: A neural machine translation approach to generating natural language explana- tions. InProceedings of the 2018 AAAI/ACM Conference on AI, Ethics, and Society. 81–87

2018

[20] [20]

Upol Ehsan, Q Vera Liao, Michael Muller, Mark O Riedl, and Justin D Weisz

[21] [21]

In Proceedings of the 2021 CHI conference on human factors in computing systems

Expanding explainability: Towards social transparency in ai systems. In Proceedings of the 2021 CHI conference on human factors in computing systems. 1–19

2021

[22] [22]

Upol Ehsan, Samir Passi, Q Vera Liao, Larry Chan, I-Hsiang Lee, Michael Muller, and Mark O Riedl. 2024. The who in XAI: how AI background shapes perceptions of AI explanations. InProceedings of the 2024 CHI Conference on Human Factors in Computing Systems. 1–32

2024

[23] [23]

Upol Ehsan, Pradyumna Tambwekar, Larry Chan, Brent Harrison, and Mark O Riedl. 2019. Automated rationale generation: a technique for explainable AI and its effects on human perceptions. InProceedings of the 24th international conference on intelligent user interfaces. 263–274

2019

[24] [24]

Sharon Ferguson, Paula Akemi Aoyagui, Rimsha Rizvi, Young-Ho Kim, and Anastasia Kuzminykh. 2024. The explanation that hits home: the characteristics of verbal explanations that affect human perception in subjective decision-making. Proceedings of the ACM on Human-Computer Interaction8, CSCW2 (2024), 1–37

2024

[25] [25]

Raymond Fok and Daniel S Weld. 2024. In search of verifiability: Explanations rarely enable complementary performance in AI-advised decision making.AI Magazine45, 3 (2024), 317–332

2024

[26] [26]

Anna Viktorovna Gavrilova and Carlo Galli. 2026. Conversing with machines: How AI is changing the way scientists think.Quantitative Biology14, 2 (2026)

2026

[27] [27]

Julie Gerlings, Millie Søndergaard Jensen, and Arisa Shollo. 2021. Explainable AI, but explainable to whom? An exploratory case study of xAI in healthcare. In Handbook of Artificial Intelligence in Healthcare: Vol 2: Practicalities and Prospects. Springer, 169–198

2021

[28] [28]

Martin Gjoreski, Matias Laporte, Marc Langheinrich, and Tim Miller. 2024. How to Validate XAI in Longitudinal Studies?. InCompanion of the 2024 on ACM international joint conference on pervasive and ubiquitous computing. 866–869

2024

[29] [29]

Shirley Gregor and Izak Benbasat. 1999. Explanations From Intelligent Systems: Theoretical Foundations and Implications for Practice1.MIS quarterly23, 4 (1999), 497–530

1999

[30] [30]

David Gunning and David Aha. 2019. DARPA’s explainable artificial intelligence (XAI) program.AI magazine40, 2 (2019), 44–58

2019

[31] [31]

Jyoti Gupta and KR Seeja. 2024. A comparative study and systematic analysis of XAI models and their applications in healthcare.Archives of Computational Methods in Engineering31, 7 (2024), 3977–4002

2024

[32] [32]

Gaole He, Nilay Aishwarya, and Ujwal Gadiraju. 2025. Is conversational XAI all you need? Human-AI decision making with a conversational XAI assistant. InProceedings of the 30th international conference on intelligent user interfaces. 907–924

2025

[33] [33]

Sophie F Jentzsch, Sviatlana Höhn, and Nico Hochgeschwender. 2019. Conversa- tional interfaces for explainable AI: a human-centred approach. InInternational workshop on explainable, transparent autonomous agents and multi-agent systems. Springer, 77–92

2019

[34] [34]

M Kedar. 2024. Exploring the Effectiveness of SHAP over other Explainable AI Methods.Int. J. Sci. Res. Eng. Manag8 (2024)

2024

[35] [35]

help me help the ai

Sunnie SY Kim, Elizabeth Anne Watkins, Olga Russakovsky, Ruth Fong, and Andrés Monroy-Hernández. 2023. " help me help the ai": Understanding how explainability can support human-ai interaction. Inproceedings of the 2023 CHI conference on human factors in computing systems. 1–17

2023

[36] [36]

Patrick Knab, Sascha Marton, Udo Schlegel, and Christian Bartelt. 2025. Which lime should i trust? concepts, challenges, and solutions. InWorld Conference on Explainable Artificial Intelligence. Springer, 28–52

2025

[37] [37]

Todd Kulesza, Simone Stumpf, Margaret Burnett, Sherry Yang, Irwin Kwan, and Weng-Keen Wong. 2013. Too much, too little, or just right? Ways explanations impact end users’ mental models. In2013 IEEE Symposium on visual languages and human centric computing. IEEE, 3–10

2013

[38] [38]

Jiachen Li, Bingrui Zong, Tingyu Cheng, Yunzhi Li, Elizabeth D Mynatt, and Ashutosh Dhekne. 2023. Privacy vs. awareness: Relieving the tension between older adults and adult children when sharing in-home activity data.Proceedings of the ACM on Human-Computer Interaction7, CSCW2 (2023), 1–30

2023

[39] [39]

Q Vera Liao, Daniel Gruen, and Sarah Miller. 2020. Questioning the AI: informing design practices for explainable AI user experiences. InProceedings of the 2020 CHI conference on human factors in computing systems. 1–15. Toward Human-Centered Conversational XAI CUI ’26, July 21–24, 2026, Bremen, Germany

2020

[40] [40]

Duri Long, Jessica Roberts, Brian Magerko, Kenneth Holstein, Daniella DiPaola, and Fred Martin. 2023. AI literacy: Finding common threads between education, design, policy, and explainability. InExtended Abstracts of the 2023 CHI Conference on Human Factors in Computing Systems. 1–6

2023

[41] [41]

Tao Long, Sitong Wang, Émilie Fabre, Tony Wang, Anup Sathya, Jason Wu, Savvas Dimitrios Petridis, Ding Li, Tuhin Chakrabarty, Yue Jiang, et al . 2025. Facilitating Longitudinal Interaction Studies of AI Systems. InAdjunct Proceedings of the 38th Annual ACM Symposium on User Interface Software and Technology. 1–5

2025

[42] [42]

Like Having a Really Bad PA

Ewa Luger and Abigail Sellen. 2016. “Like Having a Really Bad PA”: The Gulf Between User Expectation and Experience of Conversational Agents. InProceed- ings of the 2016 CHI Conference on Human Factors in Computing Systems (CHI ’16). ACM, New York, NY, USA, 5286–5297. doi:10.1145/2858036.2858288 event-place: San Jose, California, USA

work page doi:10.1145/2858036.2858288 2016

[43] [43]

Who wants to be nagged by AI?

Niharika Mathur, Hasibur Rahman, and Smit Desai. 2026. " Who wants to be nagged by AI?": Investigating the Effects of Agreeableness on Older Adults’ Perception of LLM-Based Voice Assistants’ Explanations.arXiv preprint arXiv:2603.09012(2026)

work page arXiv 2026

[44] [44]

Sometimes You Need Facts, and Sometimes a Hug

Niharika Mathur, Tamara Zubatiy, Agata Rozga, Jodi Forlizzi, and Elizabeth Mynatt. 2025. " Sometimes You Need Facts, and Sometimes a Hug": Understanding Older Adults’ Preferences for Explanations in LLM-Based Conversational AI Systems.arXiv preprint arXiv:2510.06697(2025)

work page arXiv 2025

[45] [45]

Why Did You Say That?

Niharika Mathur, Tamara Zubatiy, Agata Rozga, and Elizabeth Mynatt. 2023. “Why Did You Say That?”: Understanding Explainability in Conversational AI Systems for Older Adults with Mild Cognitive Impairment (MCI). InInternational Conference on Ubiquitous Computing and Ambient Intelligence. Springer, 208–214

2023

[46] [46]

Tim Miller. 2019. Explanation in artificial intelligence: Insights from the social sciences.Artificial intelligence267 (2019), 1–38

2019

[47] [47]

Tim Miller. 2023. Explainable ai is dead, long live explainable ai! hypothesis- driven decision support using evaluative ai. InProceedings of the 2023 ACM conference on fairness, accountability, and transparency. 333–342

2023

[48] [48]

Mohammad Namvarpour and Afsaneh Razi. 2025. The Art of Talking Machines: A Comprehensive Literature Review of Conversational User Interfaces. InPro- ceedings of the 7th ACM Conference on Conversational User Interfaces. 1–18

2025

[49] [49]

Animesh Nighojkar, Bekhzodbek Moydinboyev, My Duong, and John Licato

[50] [50]

Giving ai personalities leads to more human-like reasoning.arXiv preprint arXiv:2502.14155(2025)

work page arXiv 2025

[51] [51]

Hasibur Rahman and Smit Desai. 2025. Vibe Check: Understanding the Effects of LLM-Based Conversational Agents’ Personality and Alignment on User Percep- tions in Goal-Oriented Tasks. arXiv:2509.09870 (Sept. 2025). doi:10.48550/arXiv. 2509.09870 arXiv:2509.09870 [cs]

work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv 2025

[52] [52]

Minjin Rheu, Ji Youn Shin, Wei Peng, and Jina Huh-Yoo. 2021. Systematic re- view: Trust-building factors and implications for conversational agent design. International Journal of Human–Computer Interaction37, 1 (2021), 81–96

2021

[53] [53]

Yao Rong, Tobias Leemann, Thai-Trang Nguyen, Lisa Fiedler, Peizhu Qian, Vaib- hav Unhelkar, Tina Seidel, Gjergji Kasneci, and Enkelejda Kasneci. 2023. Towards human-centered explainable ai: A survey of user studies for model explana- tions.IEEE transactions on pattern analysis and machine intelligence46, 4 (2023), 2104–2122

2023

[54] [54]

Rikard Rosenbacke, Åsa Melhus, Martin McKee, and David Stuckler. 2024. How explainable artificial intelligence can increase or decrease clinicians’ trust in AI applications in health care: systematic review.Jmir Ai3 (2024), e53207

2024

[55] [55]

Jae-Eun Russell, Anna Marie Smith, Salim George, Jonah Pratt, Brian Fodale, Cassandra Monk, and Adam Brummett. 2025. Unlocking Insights: Investigating Student AI Tutor Interactions in a Large Introductory STEM Course. InPro- ceedings of the 15th International Learning Analytics and Knowledge Conference (LAK ’25). Association for Computing Machinery, New Y...

work page doi:10.1145/3706468.3706524 2025

[56] [56]

Ute Schmid and Britta Wrede. 2022. What is missing in XAI so far? An interdis- ciplinary perspective.KI-Künstliche Intelligenz36, 3 (2022), 303–315

2022

[57] [57]

Hua Shen, Chieh-Yang Huang, Tongshuang Wu, and Ting-Hao Kenneth Huang

[58] [58]

InCompanion publication of the 2023 conference on computer supported cooperative work and social computing

ConvXAI: Delivering heterogeneous AI explanations via conversations to support human-AI scientific writing. InCompanion publication of the 2023 conference on computer supported cooperative work and social computing. 384–387

2023

[59] [59]

Eleanor Palo Stoller. 1993. Interpretations of symptoms by older people: A health diary study of illness behavior.Journal of Aging and Health5, 1 (1993), 58–81

1993

[60] [60]

Nipuna Thalpage. 2023. Unlocking the black box: Explainable artificial intelli- gence (XAI) for trust and transparency in ai systems.J. Digit. Art Humanit4, 1 (2023), 31–36

2023

[61] [61]

Qiaosi Wang, Koustuv Saha, Eric Gregori, David Joyner, and Ashok Goel. 2021. Towards Mutual Theory of Mind in Human-AI Interaction: How Language Reflects What Students Perceive About a Virtual Teaching Assistant. InPro- ceedings of the 2021 CHI Conference on Human Factors in Computing Systems (CHI ’21). Association for Computing Machinery, New York, NY, U...

work page doi:10.1145/3411764.3445645 2021

[62] [62]

Christina Ziying Wei, Young-Ho Kim, and Anastasia Kuzminykh. 2023. The Bot on Speaking Terms: The Effects of Conversation Architecture on Perceptions of Conversational Agents. InProceedings of the 5th International Conference on Conversational User Interfaces (CUI ’23). Association for Computing Machinery, New York, NY, USA, 1–16. doi:10.1145/3571884.3597139

work page doi:10.1145/3571884.3597139 2023

[63] [63]

Miaoxiang Yi. 2024. Revolutionizing interaction: the role of artificial intelligent conversation agents in human-computer interaction. InFourth International Con- ference on Signal Processing and Machine Learning (CONF-SPML 2024), Vol. 13077. SPIE, 192–201

2024

[64] [64]

Setareh Zafari, Jesse de Pagter, Guglielmo Papagni, Alischa Rosenstein, Michael Filzmoser, and Sabine T Koeszegi. 2024. Trust development and explainability: A longitudinal study with a personalized assistive system.Multimodal Technologies and Interaction8, 3 (2024), 20

2024

[65] [65]

Nima Zargham, Leon Reicherts, Michael Bonfert, Sarah Theres Voelkel, Johannes Schoening, Rainer Malaka, and Yvonne Rogers. 2022. Understanding Circum- stances for Desirable Proactive Behaviour of Voice Assistants: The Proactivity Dilemma. InProceedings of the 4th Conference on Conversational User Interfaces (CUI ’22). Association for Computing Machinery, ...

work page doi:10.1145/3543829.3543834 2022

[66] [66]

John Zerilli. 2022. Explaining machine learning decisions.Philosophy of Science 89, 1 (2022), 1–19

2022

[67] [67]

Tong Zhang, Mengao Zhang, Wei Yan Low, X Jessie Yang, and Boyang Albert Li. 2025. Conversational explanations: discussing explainable AI with non-AI experts. InProceedings of the 30th international conference on intelligent user interfaces. 409–424

2025