Spatialnav: Leveraging spatial scene graphs for zero-shot vision-and-language navigation

· 2026 · arXiv 2601.06806

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

read on arXiv browse 4 citing papers

citation-role summary

background 1 dataset 1

citation-polarity summary

background 1 use dataset 1

representative citing papers

QuadAgent: A Responsive Agent System for Vision-Language Guided Quadrotor Agile Flight

cs.RO · 2026-04-03 · unverdicted · novelty 7.0

QuadAgent uses an asynchronous multi-agent architecture with an Impression Graph for scene memory and vision-based avoidance to enable training-free vision-language guided agile quadrotor flight, outperforming baselines in simulations and achieving real-world speeds up to 5 m/s.

SEDualVLN: A Spatially-Enhanced Dual-System for Vision-Language Navigation

cs.RO · 2026-05-17 · unverdicted · novelty 6.0

SEDualVLN proposes a spatially-enhanced dual-system VLN framework that pairs a fast VLM action generator with a slow MLLM waypoint planner and reports state-of-the-art results on VLN-CE benchmarks.

SpaAct: Spatially-Activated Transition Learning with Curriculum Adaptation for Vision-Language Navigation

cs.CV · 2026-04-30 · unverdicted · novelty 6.0

SpaAct activates spatial awareness in VLMs using action retrospection, future frame prediction, and progressive curriculum learning to reach SOTA on VLN-CE benchmarks.

Explore Like Humans: Autonomous Exploration with Online SG-Memo Construction for Embodied Agents

cs.CV · 2026-04-21 · unverdicted · novelty 5.0

ABot-Explorer unifies online exploration and hierarchical semantic memory construction via VLM-distilled navigational affordances for improved embodied navigation efficiency.

citing papers explorer

Showing 4 of 4 citing papers.

QuadAgent: A Responsive Agent System for Vision-Language Guided Quadrotor Agile Flight cs.RO · 2026-04-03 · unverdicted · none · ref 28
QuadAgent uses an asynchronous multi-agent architecture with an Impression Graph for scene memory and vision-based avoidance to enable training-free vision-language guided agile quadrotor flight, outperforming baselines in simulations and achieving real-world speeds up to 5 m/s.
SEDualVLN: A Spatially-Enhanced Dual-System for Vision-Language Navigation cs.RO · 2026-05-17 · unverdicted · none · ref 20
SEDualVLN proposes a spatially-enhanced dual-system VLN framework that pairs a fast VLM action generator with a slow MLLM waypoint planner and reports state-of-the-art results on VLN-CE benchmarks.
SpaAct: Spatially-Activated Transition Learning with Curriculum Adaptation for Vision-Language Navigation cs.CV · 2026-04-30 · unverdicted · none · ref 82
SpaAct activates spatial awareness in VLMs using action retrospection, future frame prediction, and progressive curriculum learning to reach SOTA on VLN-CE benchmarks.
Explore Like Humans: Autonomous Exploration with Online SG-Memo Construction for Embodied Agents cs.CV · 2026-04-21 · unverdicted · none · ref 1
ABot-Explorer unifies online exploration and hierarchical semantic memory construction via VLM-distilled navigational affordances for improved embodied navigation efficiency.

Spatialnav: Leveraging spatial scene graphs for zero-shot vision-and-language navigation

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer