HCSG combines geometric forecasting of human pose and trajectory with VLM-generated semantic descriptions of intentions, fused into a topological map with a social distance loss, yielding 14% higher success rate and 34% lower collision rate on the HA-VLNCE benchmark.
Habitat: A Platform for Embodied AI Research,
2 Pith papers cite this work. Polarity classification is still indexing.
years
2026 2verdicts
UNVERDICTED 2representative citing papers
DM³-Nav delivers decentralized multi-agent semantic navigation for multimodal open-vocabulary multi-object tasks that matches centralized baselines in simulation and succeeds in real-world robot deployments.
citing papers explorer
-
HCSG: Human-Centric Semantic-Geometric Reasoning for Vision-Language Navigation
HCSG combines geometric forecasting of human pose and trajectory with VLM-generated semantic descriptions of intentions, fused into a topological map with a social distance loss, yielding 14% higher success rate and 34% lower collision rate on the HA-VLNCE benchmark.
-
DM$^3$-Nav: Decentralized Multi-Agent Multimodal Multi-Object Semantic Navigation
DM³-Nav delivers decentralized multi-agent semantic navigation for multimodal open-vocabulary multi-object tasks that matches centralized baselines in simulation and succeeds in real-world robot deployments.