Multimodal explainability module using vision-language models and heat maps enables robots to generate natural-language summaries of navigation observations, with n=30 user studies showing majority preference for real-time explanations and improved trust.
Vlm-social-nav: Socially aware robot navigation through scoring using vision-language models
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.RO 1years
2025 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Trust Through Transparency: Explainable Social Navigation for Autonomous Mobile Robots via Vision-Language Models
Multimodal explainability module using vision-language models and heat maps enables robots to generate natural-language summaries of navigation observations, with n=30 user studies showing majority preference for real-time explanations and improved trust.