P3nav: End-to-end perception, prediction and planning for vision-and-language navigation

· 2026 · arXiv 2603.17459

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

read on arXiv browse 3 citing papers

citation-role summary

baseline 1

citation-polarity summary

baseline 1

representative citing papers

HCSG: Human-Centric Semantic-Geometric Reasoning for Vision-Language Navigation

cs.RO · 2026-05-13 · unverdicted · novelty 7.0

HCSG combines geometric forecasting of human pose and trajectory with VLM-generated semantic descriptions of intentions, fused into a topological map with a social distance loss, yielding 14% higher success rate and 34% lower collision rate on the HA-VLNCE benchmark.

SEDualVLN: A Spatially-Enhanced Dual-System for Vision-Language Navigation

cs.RO · 2026-05-17 · unverdicted · novelty 5.0 · 2 refs

SEDualVLN introduces a spatially-enhanced dual-system VLN architecture that achieves state-of-the-art results on VLN-CE benchmarks through coordinated VLM action generation and MLLM waypoint planning.

What Limits Vision-and-Language Navigation ?

cs.RO · 2026-05-13 · unverdicted · novelty 5.0

StereoNav reaches new benchmark highs on R2R-CE and RxR-CE and improves real-robot reliability by supplying persistent target-location priors and stereo-derived geometry that stay stable under lighting changes and blur.

citing papers explorer

Showing 3 of 3 citing papers after filters.

HCSG: Human-Centric Semantic-Geometric Reasoning for Vision-Language Navigation cs.RO · 2026-05-13 · unverdicted · none · ref 24
HCSG combines geometric forecasting of human pose and trajectory with VLM-generated semantic descriptions of intentions, fused into a topological map with a social distance loss, yielding 14% higher success rate and 34% lower collision rate on the HA-VLNCE benchmark.
SEDualVLN: A Spatially-Enhanced Dual-System for Vision-Language Navigation cs.RO · 2026-05-17 · unverdicted · none · ref 15 · 2 links
SEDualVLN introduces a spatially-enhanced dual-system VLN architecture that achieves state-of-the-art results on VLN-CE benchmarks through coordinated VLM action generation and MLLM waypoint planning.
What Limits Vision-and-Language Navigation ? cs.RO · 2026-05-13 · unverdicted · none · ref 45
StereoNav reaches new benchmark highs on R2R-CE and RxR-CE and improves real-robot reliability by supplying persistent target-location priors and stereo-derived geometry that stay stable under lighting changes and blur.

P3nav: End-to-end perception, prediction and planning for vision-and-language navigation

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer