Openfmnav: Towards open-set zero-shot object navigation via vision-language foundation models,

· 2024

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

browse 3 citing papers

representative citing papers

ReMemNav: A Rethinking and Memory-Augmented Framework for Zero-Shot Object Navigation

cs.RO · 2026-03-25 · conditional · novelty 6.0

ReMemNav improves zero-shot object navigation success and efficiency by integrating episodic memory and rethinking with VLMs, achieving SR/SPL gains of 1.7%/7.0% on HM3D v0.1, 18.2%/11.1% on HM3D v0.2, and 8.7%/7.9% on MP3D.

Memory Over Maps: 3D Object Localization Without Reconstruction

cs.RO · 2026-03-20 · unverdicted · novelty 6.0

A map-free localization method stores posed RGB-D keyframes, retrieves and re-ranks them with a VLM, then fuses sparse depth for on-demand 3D target estimates, matching reconstruction-based performance on navigation benchmarks with far lower build cost.

A Deployable Embodied Vision-Language Navigation System with Hierarchical Cognition and Context-Aware Exploration

cs.RO · 2026-04-23 · unverdicted · novelty 4.0 · 2 refs

Introduces a hierarchical VLN architecture with asynchronous layers, incremental memory graph, and WTRP-based exploration that improves success and efficiency on resource-constrained robots.

citing papers explorer

Showing 3 of 3 citing papers.

ReMemNav: A Rethinking and Memory-Augmented Framework for Zero-Shot Object Navigation cs.RO · 2026-03-25 · conditional · none · ref 16
ReMemNav improves zero-shot object navigation success and efficiency by integrating episodic memory and rethinking with VLMs, achieving SR/SPL gains of 1.7%/7.0% on HM3D v0.1, 18.2%/11.1% on HM3D v0.2, and 8.7%/7.9% on MP3D.
Memory Over Maps: 3D Object Localization Without Reconstruction cs.RO · 2026-03-20 · unverdicted · none · ref 43
A map-free localization method stores posed RGB-D keyframes, retrieves and re-ranks them with a VLM, then fuses sparse depth for on-demand 3D target estimates, matching reconstruction-based performance on navigation benchmarks with far lower build cost.
A Deployable Embodied Vision-Language Navigation System with Hierarchical Cognition and Context-Aware Exploration cs.RO · 2026-04-23 · unverdicted · none · ref 14 · 2 links
Introduces a hierarchical VLN architecture with asynchronous layers, incremental memory graph, and WTRP-based exploration that improves success and efficiency on resource-constrained robots.

Openfmnav: Towards open-set zero-shot object navigation via vision-language foundation models,

fields

years

verdicts

representative citing papers

citing papers explorer