Trust Through Transparency: Explainable Social Navigation for Autonomous Mobile Robots via Vision-Language Models

Aliasghar Arab; Devika Kodi; Oluwadamilola Sotomi

arxiv: 2504.05477 · v1 · submitted 2025-04-07 · 💻 cs.RO

Trust Through Transparency: Explainable Social Navigation for Autonomous Mobile Robots via Vision-Language Models

Oluwadamilola Sotomi , Devika Kodi , Aliasghar Arab This is my paper

Pith reviewed 2026-05-22 20:21 UTC · model grok-4.3

classification 💻 cs.RO

keywords explainable navigationvision-language modelssocial roboticsrobot transparencyuser trustmultimodal explanationautonomous mobile robots

0 comments

The pith

A multimodal module lets robots generate natural language explanations for navigation choices using vision-language models and heat maps.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper develops a system for service robots that combines vision-language models with visual heat maps so the robot can describe its observations and decisions in plain language during movement through shared spaces. This setup aims to make the robot's behavior more transparent to nearby humans. Controlled tests with thirty participants found that most favored receiving these real-time explanations, which the authors link to greater reported trust and comprehension. The work validates the outputs against human judgments using confusion-matrix checks and presents the approach as effective for social navigation tasks.

Core claim

The paper claims that integrating vision-language models with heat maps produces a multimodal explainability module that lets robots perceive their surroundings, analyze them, and output natural-language summaries of their navigation decisions, resulting in measurable user preference for the explanations and improved agreement with human expectations as shown by study data and matrix analysis.

What carries the argument

The multimodal explainability module that fuses vision-language model outputs with heat maps to create natural language summaries of the robot's observations and path choices.

If this is right

Robots equipped with the module can produce real-time natural language explanations while moving in dynamic environments shared with people.
Majority preference for explanations appears in user data and correlates with higher reported trust and understanding.
Confusion-matrix validation confirms that the generated explanations align with human expectations of the robot's behavior.
The overall results indicate that adding such explainability increases interpretability of autonomous navigation.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

If the explanations remain accurate outside the lab, the module could reduce misunderstandings that lead to avoidance or conflict in crowded areas.
Long-term field deployments would need separate checks to confirm whether initial study preferences persist after repeated exposure.
The same module structure might be tested with additional sensor inputs to handle edge cases the current vision pipeline misses.

Load-bearing premise

The vision-language model outputs are assumed to faithfully represent the robot's actual decision process and that user preference shown in a controlled study will translate to sustained trust during real-world social navigation.

What would settle it

A follow-up trial in which participants navigate alongside the robot in an uncontrolled public space and report trust levels when explanations are withheld or deliberately mismatched to the actual path taken.

Figures

Figures reproduced from arXiv: 2504.05477 by Aliasghar Arab, Devika Kodi, Oluwadamilola Sotomi.

**Figure 2.** Figure 2: A diagram showing the relationship between the nodes that make up the explainability module. [PITH_FULL_IMAGE:figures/full_fig_p003_2.png] view at source ↗

**Figure 3.** Figure 3: Test 1: User survey results [PITH_FULL_IMAGE:figures/full_fig_p006_3.png] view at source ↗

**Figure 4.** Figure 4: Test 2: User survey results. Predicted Positive Predicted Negative Actual Positive TP: 82 FN: 15 Actual Negative FP: 20 TN: 79 TABLE III CONFUSION MATRIX SHOWING PERFORMANCE OF THE EXPLAINABILITY MODULE. TRUE POSITIVE (TP), FALSE POSITIVE (FP), FALSE NEGATIVE (FN), TRUE NEGATIVE (TN). improves performance and social acceptance in collaborative environments between humans and robots. The survey results and… view at source ↗

read the original abstract

Service and assistive robots are increasingly being deployed in dynamic social environments; however, ensuring transparent and explainable interactions remains a significant challenge. This paper presents a multimodal explainability module that integrates vision language models and heat maps to improve transparency during navigation. The proposed system enables robots to perceive, analyze, and articulate their observations through natural language summaries. User studies (n=30) showed a preference of majority for real-time explanations, indicating improved trust and understanding. Our experiments were validated through confusion matrix analysis to assess the level of agreement with human expectations. Our experimental and simulation results emphasize the effectiveness of explainability in autonomous navigation, enhancing trust and interpretability.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper gives a concrete way to add real-time natural-language explanations to robot social navigation using VLMs plus heat maps, backed by a small user study, but the confusion-matrix validation is described too vaguely to carry much weight.

read the letter

The core contribution is a working module that combines a vision-language model with heat maps so a mobile robot can describe what it sees and why it is moving a certain way around people. They report that in a study with 30 participants most preferred the explanations and felt they improved understanding and trust. That is a straightforward, applied step forward for service-robot transparency.

Referee Report

1 major / 1 minor

Summary. The paper proposes a multimodal explainability module integrating vision-language models and heat maps to enable autonomous mobile robots to perceive, analyze, and articulate observations via natural language summaries during social navigation. It reports a user study (n=30) in which a majority preferred real-time explanations, interpreted as evidence of improved trust and understanding, and states that experiments were validated via confusion matrix analysis measuring agreement with human expectations. The work concludes that explainability enhances trust and interpretability in autonomous navigation.

Significance. If the central claims are supported by detailed evidence, the integration of VLMs for real-time natural-language explanations in social navigation could meaningfully advance transparency in human-robot interaction, a key barrier to deploying service robots. The empirical user-study component provides a direct test of user preference, which is a strength when properly documented. However, the current presentation leaves the validation approach underspecified, limiting the ability to evaluate whether the reported preference translates to genuine fidelity with the robot's decision process.

major comments (1)

[Abstract] Abstract: the statement that 'experiments were validated through confusion matrix analysis to assess the level of agreement with human expectations' supplies no information on the matrix categories, the mapping from VLM text outputs to discrete classes, the collection of ground-truth labels, or the specific navigation actions or explanation qualities being classified. This analysis is presented as corroborating the user-study preference result and therefore underpins the central claim of improved understanding and trust; without these details the evidential support cannot be assessed.

minor comments (1)

The abstract refers to 'experimental and simulation results' without identifying the simulation environment, quantitative performance metrics, or baseline comparisons used to demonstrate effectiveness.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for the constructive feedback on our manuscript. The point raised about the underspecification of the confusion matrix analysis is valid, and we will revise the abstract and relevant sections to provide the necessary details while preserving the core contributions of the work.

read point-by-point responses

Referee: [Abstract] Abstract: the statement that 'experiments were validated through confusion matrix analysis to assess the level of agreement with human expectations' supplies no information on the matrix categories, the mapping from VLM text outputs to discrete classes, the collection of ground-truth labels, or the specific navigation actions or explanation qualities being classified. This analysis is presented as corroborating the user-study preference result and therefore underpins the central claim of improved understanding and trust; without these details the evidential support cannot be assessed.

Authors: We agree that the abstract statement is too brief and does not supply the requested specifics, which limits evaluation of how the confusion matrix supports the claims of improved understanding and trust. In the revised manuscript we will expand the abstract to concisely describe the matrix categories, the procedure for mapping VLM text outputs to discrete classes, the collection of ground-truth labels, and the navigation actions and explanation qualities evaluated. Corresponding clarifications will also be added to the experimental validation section to make the full methodology transparent and to strengthen the link to the user-study results. revision: yes

Circularity Check

0 steps flagged

No circularity: empirical validation relies on external human data

full rationale

The paper describes an empirical system for explainable robot navigation using VLMs and heat maps, with central claims resting on a user study (n=30) measuring preference for real-time explanations and a confusion matrix assessing agreement with human expectations. No equations, parameter fitting, self-citations, or derivation steps appear in the abstract or described content. The validation is framed against independent external human judgments rather than reducing to the system's own outputs by construction, satisfying the criteria for a self-contained empirical result with no load-bearing circular steps.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The work relies on the domain assumption that current vision-language models can reliably interpret social scenes and generate faithful explanations; no free parameters or new invented entities are introduced in the abstract.

axioms (1)

domain assumption Vision-language models can accurately perceive and describe social navigation scenarios in a way that matches human expectations.
The entire explainability module depends on this capability of the underlying VLM.

pith-pipeline@v0.9.0 · 5641 in / 1247 out tokens · 73003 ms · 2026-05-22T20:21:07.408871+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Foundation/Cost/FunctionalEquation.lean washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

multimodal explainability module that integrates vision language models and heat maps... confusion matrix analysis to assess the level of agreement with human expectations

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

21 extracted references · 21 canonical work pages

[1]

Efficient human-robot collaboration: when should a robot take initiative? The International Journal of Robotics Research , 36(5-7):563–579, 2017

Jimmy Baraglia, Maya Cakmak, Yukie Nagai, Rajesh PN Rao, and Minoru Asada. Efficient human-robot collaboration: when should a robot take initiative? The International Journal of Robotics Research , 36(5-7):563–579, 2017

work page 2017
[2]

Explainable autonomous mobile robots: Interface and socially aware learning

Kiruthiga C Shekar, Pranav Doma, Chinmay Prashanth, Vikram Subra- maniam, and Aliasghar Arab. Explainable autonomous mobile robots: Interface and socially aware learning. Authorea Preprints, 2024

work page 2024
[3]

Trust in automation: Designing for appropriate reliance

John D Lee and Katrina A See. Trust in automation: Designing for appropriate reliance. Human factors, 46(1):50–80, 2004

work page 2004
[4]

Opening up to social robots: how emotions drive self-disclosure behavior

Guy Laban, Arvid Kappas, Val Morrison, and Emily S Cross. Opening up to social robots: how emotions drive self-disclosure behavior. In 2023 32nd IEEE International Conference on Robot and Human In- teractive Communication (RO-MAN) , pages 1697–1704. IEEE, 2023

work page 2023
[5]

The situation awareness frame- work for explainable ai (safe-ai) and human factors considerations for xai systems

Lindsay Sanneman and Julie A Shah. The situation awareness frame- work for explainable ai (safe-ai) and human factors considerations for xai systems. International Journal of Human–Computer Interaction , 38(18-20):1772–1788, 2022

work page 2022
[6]

Enhancing robot explanation capabilities through vision-language models: a preliminary study by interpreting visual inputs for improved human-robot interaction

David Sobr ´ın-Hidalgo, Miguel ´Angel Gonz ´alez-Santamarta, ´Angel Manuel Guerrero-Higueras, Francisco Javier Rodr ´ıguez- Lera, and Vicente Matell ´an-Olivera. Enhancing robot explanation capabilities through vision-language models: a preliminary study by interpreting visual inputs for improved human-robot interaction. arXiv preprint arXiv:2404.09705 , 2024

work page arXiv 2024
[7]

Explain- able ai for robot failures: Generating explanations that improve user assistance in fault recovery

Devleena Das, Siddhartha Banerjee, and Sonia Chernova. Explain- able ai for robot failures: Generating explanations that improve user assistance in fault recovery. In Proceedings of the 2021 ACM/IEEE international conference on human-robot interaction , pages 351–360, 2021

work page 2021
[8]

A surrogate model framework for explainable autonomous behaviour

Konstantinos Gavriilidis, Andrea Munafo, Wei Pang, and Helen Hastie. A surrogate model framework for explainable autonomous behaviour. arXiv preprint arXiv:2305.19724 , 2023

work page arXiv 2023
[9]

Explainable reinforcement learning via model transforms

Mira Finkelstein, Lucy Liu, Yoav Kolumbus, David C Parkes, Jeffrey S Rosenschein, Sarah Keren, et al. Explainable reinforcement learning via model transforms. Advances in Neural Information Processing Systems, 35:34039–34051, 2022

work page 2022
[10]

Towards explain- able ai: Interpretable models for complex decision-making

Jaibir Singh, Suman Rani, and Garaga Srilakshmi. Towards explain- able ai: Interpretable models for complex decision-making. In 2024 International Conference on Knowledge Engineering and Communi- cation Systems (ICKECS) , volume 1, pages 1–5. IEEE, 2024

work page 2024
[11]

Evaluating human-like explanations for robot actions in re- inforcement learning scenarios

Francisco Cruz, Charlotte Young, Richard Dazeley, and Peter Vam- plew. Evaluating human-like explanations for robot actions in re- inforcement learning scenarios. In 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pages 894–901. IEEE, 2022

work page 2022
[12]

Talkwithmachines: Enhancing human-robot interaction through large/vision language models

Ammar N Abbas and Csaba Beleznai. Talkwithmachines: Enhancing human-robot interaction through large/vision language models. In 2024 Eighth IEEE International Conference on Robotic Computing (IRC), pages 253–258. IEEE, 2024

work page 2024
[13]

Social-llava: Enhancing robot navigation through human-language reasoning in social spaces,

Amirreza Payandeh, Daeun Song, Mohammad Nazeri, Jing Liang, Praneel Mukherjee, Amir Hossain Raj, Yangzhe Kong, Dinesh Manocha, and Xuesu Xiao. Social-llava: Enhancing robot navigation through human-language reasoning in social spaces. arXiv preprint arXiv:2501.09024, 2024

work page arXiv 2024
[14]

Motion planning method for car-like autonomous mobile robots in dynamic obstacle environments

Zhiwei Wang, Peiqing Li, Qipeng Li, Zhongshan Wang, and Zhuoran Li. Motion planning method for car-like autonomous mobile robots in dynamic obstacle environments. IEEE Access, 11:137387–137400, 2023

work page 2023
[15]

Comparison and improvement of local planners on ros for narrow passages

Huajun Yuan, Hanlin Li, Yuhan Zhang, Shuang Du, Limin Yu, and Xinheng Wang. Comparison and improvement of local planners on ros for narrow passages. In 2022 International Conference on High Performance Big Data and Intelligent Systems (HDIS), pages 125–130. IEEE, 2022

work page 2022
[16]

Vlm-social-nav: Socially aware robot navigation through scoring using vision-language models

Daeun Song, Jing Liang, Amirreza Payandeh, Amir Hossain Raj, Xuesu Xiao, and Dinesh Manocha. Vlm-social-nav: Socially aware robot navigation through scoring using vision-language models. IEEE Robotics and Automation Letters , 2024

work page 2024
[17]

Grad-cam: visual explanations from deep networks via gradient-based localization

Ramprasaath R Selvaraju, Michael Cogswell, Abhishek Das, Ramakr- ishna Vedantam, Devi Parikh, and Dhruv Batra. Grad-cam: visual explanations from deep networks via gradient-based localization. In- ternational journal of computer vision , 128:336–359, 2020

work page 2020
[18]

Vlfm: Vision-language frontier maps for zero- shot semantic navigation

Naoki Yokoyama, Sehoon Ha, Dhruv Batra, Jiuguang Wang, and Bernadette Bucher. Vlfm: Vision-language frontier maps for zero- shot semantic navigation. In 2024 IEEE International Conference on Robotics and Automation (ICRA) , pages 42–48. IEEE, 2024

work page 2024
[19]

Blip: Bootstrapping language-image pre-training for unified vision-language understanding and generation

Junnan Li, Dongxu Li, Caiming Xiong, and Steven Hoi. Blip: Bootstrapping language-image pre-training for unified vision-language understanding and generation. In International conference on machine learning, pages 12888–12900. PMLR, 2022

work page 2022
[20]

Autonomous navigation assurance with explainable ai and security monitoring

Denzel Hamilton, Kevin Kornegay, and Lanier Watkins. Autonomous navigation assurance with explainable ai and security monitoring. In 2020 IEEE Applied Imagery Pattern Recognition Workshop (AIPR) , pages 1–7. IEEE, 2020

work page 2020
[21]

Safe predictive control of four-wheel mobile robot with independent steering and drive

Aliasghar Arab, Ilija Had ˇzi´c, and Jingang Yi. Safe predictive control of four-wheel mobile robot with independent steering and drive. In 2021 American Control Conference (ACC) , pages 2962–2967. IEEE, 2021. APPENDIX Explainability Architecture in ROS Camera Node: The Camera Node captures images on demand, saving and publishing them to /camera/imageRaw ...

work page 2021

[1] [1]

Efficient human-robot collaboration: when should a robot take initiative? The International Journal of Robotics Research , 36(5-7):563–579, 2017

Jimmy Baraglia, Maya Cakmak, Yukie Nagai, Rajesh PN Rao, and Minoru Asada. Efficient human-robot collaboration: when should a robot take initiative? The International Journal of Robotics Research , 36(5-7):563–579, 2017

work page 2017

[2] [2]

Explainable autonomous mobile robots: Interface and socially aware learning

Kiruthiga C Shekar, Pranav Doma, Chinmay Prashanth, Vikram Subra- maniam, and Aliasghar Arab. Explainable autonomous mobile robots: Interface and socially aware learning. Authorea Preprints, 2024

work page 2024

[3] [3]

Trust in automation: Designing for appropriate reliance

John D Lee and Katrina A See. Trust in automation: Designing for appropriate reliance. Human factors, 46(1):50–80, 2004

work page 2004

[4] [4]

Opening up to social robots: how emotions drive self-disclosure behavior

Guy Laban, Arvid Kappas, Val Morrison, and Emily S Cross. Opening up to social robots: how emotions drive self-disclosure behavior. In 2023 32nd IEEE International Conference on Robot and Human In- teractive Communication (RO-MAN) , pages 1697–1704. IEEE, 2023

work page 2023

[5] [5]

The situation awareness frame- work for explainable ai (safe-ai) and human factors considerations for xai systems

Lindsay Sanneman and Julie A Shah. The situation awareness frame- work for explainable ai (safe-ai) and human factors considerations for xai systems. International Journal of Human–Computer Interaction , 38(18-20):1772–1788, 2022

work page 2022

[6] [6]

Enhancing robot explanation capabilities through vision-language models: a preliminary study by interpreting visual inputs for improved human-robot interaction

David Sobr ´ın-Hidalgo, Miguel ´Angel Gonz ´alez-Santamarta, ´Angel Manuel Guerrero-Higueras, Francisco Javier Rodr ´ıguez- Lera, and Vicente Matell ´an-Olivera. Enhancing robot explanation capabilities through vision-language models: a preliminary study by interpreting visual inputs for improved human-robot interaction. arXiv preprint arXiv:2404.09705 , 2024

work page arXiv 2024

[7] [7]

Explain- able ai for robot failures: Generating explanations that improve user assistance in fault recovery

Devleena Das, Siddhartha Banerjee, and Sonia Chernova. Explain- able ai for robot failures: Generating explanations that improve user assistance in fault recovery. In Proceedings of the 2021 ACM/IEEE international conference on human-robot interaction , pages 351–360, 2021

work page 2021

[8] [8]

A surrogate model framework for explainable autonomous behaviour

Konstantinos Gavriilidis, Andrea Munafo, Wei Pang, and Helen Hastie. A surrogate model framework for explainable autonomous behaviour. arXiv preprint arXiv:2305.19724 , 2023

work page arXiv 2023

[9] [9]

Explainable reinforcement learning via model transforms

Mira Finkelstein, Lucy Liu, Yoav Kolumbus, David C Parkes, Jeffrey S Rosenschein, Sarah Keren, et al. Explainable reinforcement learning via model transforms. Advances in Neural Information Processing Systems, 35:34039–34051, 2022

work page 2022

[10] [10]

Towards explain- able ai: Interpretable models for complex decision-making

Jaibir Singh, Suman Rani, and Garaga Srilakshmi. Towards explain- able ai: Interpretable models for complex decision-making. In 2024 International Conference on Knowledge Engineering and Communi- cation Systems (ICKECS) , volume 1, pages 1–5. IEEE, 2024

work page 2024

[11] [11]

Evaluating human-like explanations for robot actions in re- inforcement learning scenarios

Francisco Cruz, Charlotte Young, Richard Dazeley, and Peter Vam- plew. Evaluating human-like explanations for robot actions in re- inforcement learning scenarios. In 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pages 894–901. IEEE, 2022

work page 2022

[12] [12]

Talkwithmachines: Enhancing human-robot interaction through large/vision language models

Ammar N Abbas and Csaba Beleznai. Talkwithmachines: Enhancing human-robot interaction through large/vision language models. In 2024 Eighth IEEE International Conference on Robotic Computing (IRC), pages 253–258. IEEE, 2024

work page 2024

[13] [13]

Social-llava: Enhancing robot navigation through human-language reasoning in social spaces,

Amirreza Payandeh, Daeun Song, Mohammad Nazeri, Jing Liang, Praneel Mukherjee, Amir Hossain Raj, Yangzhe Kong, Dinesh Manocha, and Xuesu Xiao. Social-llava: Enhancing robot navigation through human-language reasoning in social spaces. arXiv preprint arXiv:2501.09024, 2024

work page arXiv 2024

[14] [14]

Motion planning method for car-like autonomous mobile robots in dynamic obstacle environments

Zhiwei Wang, Peiqing Li, Qipeng Li, Zhongshan Wang, and Zhuoran Li. Motion planning method for car-like autonomous mobile robots in dynamic obstacle environments. IEEE Access, 11:137387–137400, 2023

work page 2023

[15] [15]

Comparison and improvement of local planners on ros for narrow passages

Huajun Yuan, Hanlin Li, Yuhan Zhang, Shuang Du, Limin Yu, and Xinheng Wang. Comparison and improvement of local planners on ros for narrow passages. In 2022 International Conference on High Performance Big Data and Intelligent Systems (HDIS), pages 125–130. IEEE, 2022

work page 2022

[16] [16]

Vlm-social-nav: Socially aware robot navigation through scoring using vision-language models

Daeun Song, Jing Liang, Amirreza Payandeh, Amir Hossain Raj, Xuesu Xiao, and Dinesh Manocha. Vlm-social-nav: Socially aware robot navigation through scoring using vision-language models. IEEE Robotics and Automation Letters , 2024

work page 2024

[17] [17]

Grad-cam: visual explanations from deep networks via gradient-based localization

Ramprasaath R Selvaraju, Michael Cogswell, Abhishek Das, Ramakr- ishna Vedantam, Devi Parikh, and Dhruv Batra. Grad-cam: visual explanations from deep networks via gradient-based localization. In- ternational journal of computer vision , 128:336–359, 2020

work page 2020

[18] [18]

Vlfm: Vision-language frontier maps for zero- shot semantic navigation

Naoki Yokoyama, Sehoon Ha, Dhruv Batra, Jiuguang Wang, and Bernadette Bucher. Vlfm: Vision-language frontier maps for zero- shot semantic navigation. In 2024 IEEE International Conference on Robotics and Automation (ICRA) , pages 42–48. IEEE, 2024

work page 2024

[19] [19]

Blip: Bootstrapping language-image pre-training for unified vision-language understanding and generation

Junnan Li, Dongxu Li, Caiming Xiong, and Steven Hoi. Blip: Bootstrapping language-image pre-training for unified vision-language understanding and generation. In International conference on machine learning, pages 12888–12900. PMLR, 2022

work page 2022

[20] [20]

Autonomous navigation assurance with explainable ai and security monitoring

Denzel Hamilton, Kevin Kornegay, and Lanier Watkins. Autonomous navigation assurance with explainable ai and security monitoring. In 2020 IEEE Applied Imagery Pattern Recognition Workshop (AIPR) , pages 1–7. IEEE, 2020

work page 2020

[21] [21]

Safe predictive control of four-wheel mobile robot with independent steering and drive

Aliasghar Arab, Ilija Had ˇzi´c, and Jingang Yi. Safe predictive control of four-wheel mobile robot with independent steering and drive. In 2021 American Control Conference (ACC) , pages 2962–2967. IEEE, 2021. APPENDIX Explainability Architecture in ROS Camera Node: The Camera Node captures images on demand, saving and publishing them to /camera/imageRaw ...

work page 2021