Dialogue based Interactive Explanations for Safety Decisions in Human Robot Collaboration

Akilu Yunusa Kaltungo; Clara Cheung; Kota Fujimoto; Ming Shan Ng; Tsukasa Ishizawa; Xiao Zhan; Yifan Xu

arxiv: 2604.05896 · v2 · submitted 2026-04-07 · 💻 cs.RO · cs.HC

Dialogue based Interactive Explanations for Safety Decisions in Human Robot Collaboration

Yifan Xu , Xiao Zhan , Akilu Yunusa Kaltungo , Ming Shan Ng , Tsukasa Ishizawa , Kota Fujimoto , Clara Cheung This is my paper

Pith reviewed 2026-05-10 18:49 UTC · model grok-4.3

classification 💻 cs.RO cs.HC

keywords human-robot collaborationsafety explanationsinteractive dialoguedecision tracesconstraint-based safetycounterfactual queriesconstruction robotics

0 comments

The pith

A dialogue framework derives robot safety explanations directly from decision traces to support why, why-not, and bounded what-if queries without relaxing certified constraints.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces a dialogue-based framework that couples explanation generation with the constraint-based safety evaluation already used to select robot behaviors in human-robot collaboration. Explanations come straight from the recorded decision trace, letting nearby workers ask causal, contrastive, and counterfactual questions about interventions such as stops or mode switches. All counterfactual exploration stays bounded inside the same fixed, certified safety parameters that govern real operation, so interactive use cannot weaken guarantees. A reader would care because safety actions in shared workspaces often appear opaque to humans, and this method turns the internal safety logic into an operational interface that can support coordinated task recovery.

Core claim

The central claim is that explanations are derived directly from the recorded decision trace of constraint-based safety evaluation, enabling users to pose causal (Why?), contrastive (Why not?), and counterfactual (What if?) queries about safety interventions. Counterfactual reasoning is evaluated in a bounded manner under fixed, certified safety parameters, ensuring that interactive exploration does not relax operational guarantees. The framework is instantiated in a construction robotics scenario, with an operational trace showing how constraint-aware dialogue clarifies interventions and supports coordinated task recovery.

What carries the argument

The recorded decision trace from constraint-based safety evaluation, which grounds dialogue responses to causal, contrastive, and bounded counterfactual queries about interventions.

Load-bearing premise

Explanations generated from the constraint traces will be intelligible and actionable for human collaborators, and bounded counterfactual queries can be answered without inadvertently relaxing the certified safety parameters.

What would settle it

A user study in which participants cannot correctly interpret or act on the provided explanations, or in which a counterfactual query produces a safety parameter value outside the certified bounds.

Figures

Figures reproduced from arXiv: 2604.05896 by Akilu Yunusa Kaltungo, Clara Cheung, Kota Fujimoto, Ming Shan Ng, Tsukasa Ishizawa, Xiao Zhan, Yifan Xu.

**Figure 3.** Figure 3: Dialogue-based safety in the construction scenario. [PITH_FULL_IMAGE:figures/full_fig_p003_3.png] view at source ↗

read the original abstract

As robots increasingly operate in shared, safety critical environments, acting safely is no longer sufficient robots must also make their safety decisions intelligible to human collaborators. In human robot collaboration (HRC), behaviours such as stopping or switching modes are often triggered by internal safety constraints that remain opaque to nearby workers. We present a dialogue based framework for interactive explanation of safety decisions in HRC. The approach tightly couples explanation with constraint based safety evaluation, grounding dialogue in the same state and constraint representations that govern behaviour selection. Explanations are derived directly from the recorded decision trace, enabling users to pose causal ("Why?"), contrastive ("Why not?"), and counterfactual ("What if?") queries about safety interventions. Counterfactual reasoning is evaluated in a bounded manner under fixed, certified safety parameters, ensuring that interactive exploration does not relax operational guarantees. We instantiate the framework in a construction robotics scenario and provide a structured operational trace illustrating how constraint aware dialogue clarifies safety interventions and supports coordinated task recovery. By treating explanation as an operational interface to safety control, this work advances a design perspective for interactive, safety aware autonomy in HRC.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

1 major / 1 minor

Summary. The manuscript presents a conceptual framework for dialogue-based interactive explanations of safety decisions in human-robot collaboration (HRC). Explanations are derived directly from recorded decision traces of constraint-based safety evaluations, enabling causal (Why?), contrastive (Why not?), and counterfactual (What if?) queries. Counterfactual reasoning is bounded under fixed, certified safety parameters to preserve operational guarantees. The framework is instantiated in a construction robotics scenario with a structured operational trace illustrating clarification of safety interventions and support for coordinated task recovery.

Significance. If the framework can be implemented with verifiable mechanisms for query processing and dialogue generation, it would advance safety-aware autonomy by treating explanation as an operational interface to constraint-based control. The grounding in decision traces and explicit bounding of counterfactuals distinguish this from post-hoc methods and address opacity in shared safety-critical environments. The design perspective is a strength, though its practical value depends on future validation of intelligibility and actionability.

major comments (1)

[Instantiation in construction robotics scenario] Instantiation in construction robotics scenario: the central claim that the framework clarifies safety interventions and supports coordinated task recovery rests on a single illustrative trace. No implementation details, query-processing algorithm, or evaluation (e.g., metrics on explanation accuracy or user comprehension) are supplied to verify that bounded counterfactuals preserve certified safety parameters in operation.

minor comments (1)

[Abstract and framework description] The abstract and framework description would benefit from explicit pseudocode or a diagram showing how a user query is mapped to the decision trace and how the safety-parameter bound is enforced during counterfactual evaluation.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for the constructive review and for recognizing the potential of grounding explanations in constraint-based decision traces. We address the single major comment below and describe the revisions we will make.

read point-by-point responses

Referee: Instantiation in construction robotics scenario: the central claim that the framework clarifies safety interventions and supports coordinated task recovery rests on a single illustrative trace. No implementation details, query-processing algorithm, or evaluation (e.g., metrics on explanation accuracy or user comprehension) are supplied to verify that bounded counterfactuals preserve certified safety parameters in operation.

Authors: We agree that the manuscript presents the framework at a conceptual level and relies on a single structured operational trace to illustrate its use in a construction robotics scenario. This trace demonstrates how causal, contrastive, and counterfactual queries can be answered from the recorded decision trace while keeping counterfactuals within fixed, certified safety parameters, but it does not constitute a full software implementation, does not supply query-processing algorithms, and contains no quantitative evaluation of explanation accuracy or user comprehension. The paper's contribution is framed as a design perspective that couples explanation generation directly to the same constraint representations used for control; the trace is provided to make the abstract mechanism concrete rather than to serve as empirical validation. In the revised manuscript we will add pseudocode for the query-processing and dialogue-generation steps, a more formal description of the bounding mechanism that keeps counterfactual reasoning inside certified safety limits, and an explicit discussion of evaluation metrics and study designs that would be appropriate for future implementation work. These changes will directly address the concern while retaining the conceptual focus of the current submission. revision: yes

Circularity Check

0 steps flagged

No significant circularity in conceptual design framework

full rationale

The paper is a design proposal for a dialogue-based explanation framework in HRC that grounds explanations in recorded constraint traces and bounds counterfactual queries to fixed certified safety parameters. No equations, quantitative derivations, fitted parameters, or predictive models are present that could reduce to inputs by construction. The central claims are architectural (explanations derived directly from decision traces; bounded counterfactuals preserve guarantees) rather than derived results. No self-citation chains, uniqueness theorems, or ansatzes are invoked in a load-bearing manner. This is a self-contained conceptual contribution with no circular steps.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 1 invented entities

The central claim rests on the premise that safety constraints admit a representation from which causal, contrastive, and bounded counterfactual explanations can be mechanically extracted without compromising certified safety bounds.

axioms (1)

domain assumption Safety constraints and decision traces can be represented in a form that directly supports generation of intelligible causal, contrastive, and counterfactual explanations.
Invoked when the abstract states that explanations are derived directly from the recorded decision trace and that counterfactual reasoning remains bounded under fixed certified parameters.

invented entities (1)

Constraint-aware dialogue framework no independent evidence
purpose: To serve as an operational interface that makes safety decisions intelligible while preserving guarantees.
New framework introduced to couple explanation with safety evaluation; no independent evidence of effectiveness is supplied in the abstract.

pith-pipeline@v0.9.0 · 5512 in / 1285 out tokens · 36721 ms · 2026-05-10T18:49:05.855031+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

20 extracted references · 20 canonical work pages

[1]

System transparency in shared autonomy: A mini review,

V . Alonso and P. De La Puente, “System transparency in shared autonomy: A mini review,”Frontiers in neurorobotics, vol. 12, p. 83, 2018

work page 2018
[2]

Explainable robotics in human-robot interactions,

R. Setchi, M. B. Dehkordi, and J. S. Khan, “Explainable robotics in human-robot interactions,”Procedia Computer Science, vol. 176, pp. 3057–3066, 2020

work page 2020
[3]

Safe human–robot col- laboration for industrial settings: a survey,

W. Li, Y . Hu, Y . Zhou, and D. T. Pham, “Safe human–robot col- laboration for industrial settings: a survey,”Journal of Intelligent Manufacturing, vol. 35, no. 5, pp. 2235–2261, 2024

work page 2024
[4]

Sounding robots: Design and evaluation of auditory displays for unintentional human- robot interaction,

B. Orthmann, I. Leite, R. Bresin, and I. Torre, “Sounding robots: Design and evaluation of auditory displays for unintentional human- robot interaction,”ACM Transactions on Human-Robot Interaction, vol. 12, no. 4, pp. 1–26, 2023

work page 2023
[5]

The development and evaluation of robot light skin: A novel robot signalling system to improve communication in industrial human–robot collaboration,

G. Tang, P. Webb, and J. Thrower, “The development and evaluation of robot light skin: A novel robot signalling system to improve communication in industrial human–robot collaboration,”Robotics and Computer-Integrated Manufacturing, vol. 56, pp. 85–94, 2019

work page 2019
[6]

Bioluminescence-inspired human-robot interaction: Designing expressive lights that affect human’s willingness to interact with a robot,

S. Song and S. Yamada, “Bioluminescence-inspired human-robot interaction: Designing expressive lights that affect human’s willingness to interact with a robot,” inProceedings of the 2018 ACM/IEEE International Conference on Human-Robot Interaction, ser. HRI ’18. New York, NY , USA: Association for Computing Machinery, 2018, p. 224–232. [Online]. Available...

work page doi:10.1145/3171221 2018
[7]

Mixed reality representation of hazard zones while collaborating with a robot: sense of control over own safety,

A. San Martin, J. Kildal, and E. Lazkano, “Mixed reality representation of hazard zones while collaborating with a robot: sense of control over own safety,”Virtual Reality, vol. 29, no. 1, p. 43, 2025

work page 2025
[8]

The relevance of signal timing in human-robot collaborative manipulation,

F. Cini, T. Banfi, G. Ciuti, L. Craighero, and M. Controzzi, “The relevance of signal timing in human-robot collaborative manipulation,” Science Robotics, vol. 6, no. 58, p. eabg1308, 2021

work page 2021
[9]

Explainability for human-robot collaboration,

E. Yadollahi, M. Romeo, F. I. Dogan, W. Johal, M. De Graaf, S. Levy- Tzedek, and I. Leite, “Explainability for human-robot collaboration,” inCompanion of the 2024 ACM/IEEE International Conference on Human-Robot Interaction, 2024, pp. 1364–1366

work page 2024
[10]

Explanation in artificial intelligence: Insights from the social sciences,

T. Miller, “Explanation in artificial intelligence: Insights from the social sciences,”Artificial Intelligence, vol. 267, pp. 1–38, 2019

work page 2019
[11]

Improving robot controller transparency through autonomous policy explanation,

B. Hayes and J. A. Shah, “Improving robot controller transparency through autonomous policy explanation,” inProceedings of the 2017 ACM/IEEE international conference on human-robot interaction, 2017, pp. 303–312

work page 2017
[12]

Counterfactual explana- tions without opening the black box: Automated decisions and the gdpr,

S. Wachter, B. Mittelstadt, and C. Russell, “Counterfactual explana- tions without opening the black box: Automated decisions and the gdpr,”Harv. JL & Tech., vol. 31, p. 841, 2017

work page 2017
[13]

Decision control and explanations in human-ai collaboration: Improv- ing user perceptions and compliance,

M. Westphal, M. V ¨ossing, G. Satzger, G. B. Yom-Tov, and A. Rafaeli, “Decision control and explanations in human-ai collaboration: Improv- ing user perceptions and compliance,”Computers in Human Behavior, vol. 144, p. 107714, 2023

work page 2023
[14]

arXiv preprint arXiv:1701.08317 , year=

T. Chakraborti, S. Sreedharan, Y . Zhang, and S. Kambhampati, “Plan explanations as model reconciliation: Moving beyond explanation as soliloquy,”arXiv preprint arXiv:1701.08317, 2017

work page arXiv 2017
[15]

Why can’t you do that hal? explaining unsolvability of planning tasks

S. Sreedharan, S. Srivastava, D. E. Smith, and S. Kambhampati, “Why can’t you do that hal? explaining unsolvability of planning tasks.” in IJCAI, 2019, pp. 1422–1430

work page 2019
[16]

Colli- sion detection and safe reaction with the dlr-iii lightweight manipulator arm,

A. De Luca, A. Albu-Schaffer, S. Haddadin, and G. Hirzinger, “Colli- sion detection and safe reaction with the dlr-iii lightweight manipulator arm,” in2006 IEEE/RSJ international conference on intelligent robots and systems. IEEE, 2006, pp. 1623–1630

work page 2006
[17]

A meta-analysis of factors influencing the development of human-robot trust,

P. A. Hancock, D. R. Billings, K. E. Oleson, J. Y . Chen, E. De Visser, and R. Parasuraman, “A meta-analysis of factors influencing the development of human-robot trust,” 2011

work page 2011
[18]

Explanation through dialogue for rule-based reasoning ai systems,

Y . Xu, “Explanation through dialogue for rule-based reasoning ai systems,” Ph.D. dissertation, The University of Manchester (United Kingdom), 2024

work page 2024
[19]

Reactive and safety-aware path replanning for collaborative applications,

C. Tonola, M. Faroni, S. Abdolshah, M. Hamad, S. Haddadin, N. Pe- drocchi, and M. Beschi, “Reactive and safety-aware path replanning for collaborative applications,”IEEE Transactions on Automation Science and Engineering, 2025

work page 2025
[20]

Safety considerations in deployment of robotic systems–a systematic review,

A. D. Adesiji, S. E. Ibitoye, R. M. Mahamood, O. A. Olayemi, P. O. Omoniyi, T.-C. Jen, and E. T. Akinlabi, “Safety considerations in deployment of robotic systems–a systematic review,”Journal of field robotics, vol. 43, no. 1, pp. 5–33, 2026

work page 2026

[1] [1]

System transparency in shared autonomy: A mini review,

V . Alonso and P. De La Puente, “System transparency in shared autonomy: A mini review,”Frontiers in neurorobotics, vol. 12, p. 83, 2018

work page 2018

[2] [2]

Explainable robotics in human-robot interactions,

R. Setchi, M. B. Dehkordi, and J. S. Khan, “Explainable robotics in human-robot interactions,”Procedia Computer Science, vol. 176, pp. 3057–3066, 2020

work page 2020

[3] [3]

Safe human–robot col- laboration for industrial settings: a survey,

W. Li, Y . Hu, Y . Zhou, and D. T. Pham, “Safe human–robot col- laboration for industrial settings: a survey,”Journal of Intelligent Manufacturing, vol. 35, no. 5, pp. 2235–2261, 2024

work page 2024

[4] [4]

Sounding robots: Design and evaluation of auditory displays for unintentional human- robot interaction,

B. Orthmann, I. Leite, R. Bresin, and I. Torre, “Sounding robots: Design and evaluation of auditory displays for unintentional human- robot interaction,”ACM Transactions on Human-Robot Interaction, vol. 12, no. 4, pp. 1–26, 2023

work page 2023

[5] [5]

The development and evaluation of robot light skin: A novel robot signalling system to improve communication in industrial human–robot collaboration,

G. Tang, P. Webb, and J. Thrower, “The development and evaluation of robot light skin: A novel robot signalling system to improve communication in industrial human–robot collaboration,”Robotics and Computer-Integrated Manufacturing, vol. 56, pp. 85–94, 2019

work page 2019

[6] [6]

Bioluminescence-inspired human-robot interaction: Designing expressive lights that affect human’s willingness to interact with a robot,

S. Song and S. Yamada, “Bioluminescence-inspired human-robot interaction: Designing expressive lights that affect human’s willingness to interact with a robot,” inProceedings of the 2018 ACM/IEEE International Conference on Human-Robot Interaction, ser. HRI ’18. New York, NY , USA: Association for Computing Machinery, 2018, p. 224–232. [Online]. Available...

work page doi:10.1145/3171221 2018

[7] [7]

Mixed reality representation of hazard zones while collaborating with a robot: sense of control over own safety,

A. San Martin, J. Kildal, and E. Lazkano, “Mixed reality representation of hazard zones while collaborating with a robot: sense of control over own safety,”Virtual Reality, vol. 29, no. 1, p. 43, 2025

work page 2025

[8] [8]

The relevance of signal timing in human-robot collaborative manipulation,

F. Cini, T. Banfi, G. Ciuti, L. Craighero, and M. Controzzi, “The relevance of signal timing in human-robot collaborative manipulation,” Science Robotics, vol. 6, no. 58, p. eabg1308, 2021

work page 2021

[9] [9]

Explainability for human-robot collaboration,

E. Yadollahi, M. Romeo, F. I. Dogan, W. Johal, M. De Graaf, S. Levy- Tzedek, and I. Leite, “Explainability for human-robot collaboration,” inCompanion of the 2024 ACM/IEEE International Conference on Human-Robot Interaction, 2024, pp. 1364–1366

work page 2024

[10] [10]

Explanation in artificial intelligence: Insights from the social sciences,

T. Miller, “Explanation in artificial intelligence: Insights from the social sciences,”Artificial Intelligence, vol. 267, pp. 1–38, 2019

work page 2019

[11] [11]

Improving robot controller transparency through autonomous policy explanation,

B. Hayes and J. A. Shah, “Improving robot controller transparency through autonomous policy explanation,” inProceedings of the 2017 ACM/IEEE international conference on human-robot interaction, 2017, pp. 303–312

work page 2017

[12] [12]

Counterfactual explana- tions without opening the black box: Automated decisions and the gdpr,

S. Wachter, B. Mittelstadt, and C. Russell, “Counterfactual explana- tions without opening the black box: Automated decisions and the gdpr,”Harv. JL & Tech., vol. 31, p. 841, 2017

work page 2017

[13] [13]

Decision control and explanations in human-ai collaboration: Improv- ing user perceptions and compliance,

M. Westphal, M. V ¨ossing, G. Satzger, G. B. Yom-Tov, and A. Rafaeli, “Decision control and explanations in human-ai collaboration: Improv- ing user perceptions and compliance,”Computers in Human Behavior, vol. 144, p. 107714, 2023

work page 2023

[14] [14]

arXiv preprint arXiv:1701.08317 , year=

T. Chakraborti, S. Sreedharan, Y . Zhang, and S. Kambhampati, “Plan explanations as model reconciliation: Moving beyond explanation as soliloquy,”arXiv preprint arXiv:1701.08317, 2017

work page arXiv 2017

[15] [15]

Why can’t you do that hal? explaining unsolvability of planning tasks

S. Sreedharan, S. Srivastava, D. E. Smith, and S. Kambhampati, “Why can’t you do that hal? explaining unsolvability of planning tasks.” in IJCAI, 2019, pp. 1422–1430

work page 2019

[16] [16]

Colli- sion detection and safe reaction with the dlr-iii lightweight manipulator arm,

A. De Luca, A. Albu-Schaffer, S. Haddadin, and G. Hirzinger, “Colli- sion detection and safe reaction with the dlr-iii lightweight manipulator arm,” in2006 IEEE/RSJ international conference on intelligent robots and systems. IEEE, 2006, pp. 1623–1630

work page 2006

[17] [17]

A meta-analysis of factors influencing the development of human-robot trust,

P. A. Hancock, D. R. Billings, K. E. Oleson, J. Y . Chen, E. De Visser, and R. Parasuraman, “A meta-analysis of factors influencing the development of human-robot trust,” 2011

work page 2011

[18] [18]

Explanation through dialogue for rule-based reasoning ai systems,

Y . Xu, “Explanation through dialogue for rule-based reasoning ai systems,” Ph.D. dissertation, The University of Manchester (United Kingdom), 2024

work page 2024

[19] [19]

Reactive and safety-aware path replanning for collaborative applications,

C. Tonola, M. Faroni, S. Abdolshah, M. Hamad, S. Haddadin, N. Pe- drocchi, and M. Beschi, “Reactive and safety-aware path replanning for collaborative applications,”IEEE Transactions on Automation Science and Engineering, 2025

work page 2025

[20] [20]

Safety considerations in deployment of robotic systems–a systematic review,

A. D. Adesiji, S. E. Ibitoye, R. M. Mahamood, O. A. Olayemi, P. O. Omoniyi, T.-C. Jen, and E. T. Akinlabi, “Safety considerations in deployment of robotic systems–a systematic review,”Journal of field robotics, vol. 43, no. 1, pp. 5–33, 2026

work page 2026