ConsistNav is a new training-free framework that uses a semantic executive controller, persistent candidate memory, and stability-aware action control to close the action consistency gap in zero-shot object navigation, reporting SOTA results on HM3D and MP3D with 11.4% SR and 7.9% SPL gains on MP3D.
Advances in Neural Information Processing Systems (NeurIPS) , year =
2 Pith papers cite this work. Polarity classification is still indexing.
verdicts
UNVERDICTED 2representative citing papers
The paper defines Agent AI as interactive multimodal systems that perceive grounded data and generate embodied actions, arguing this approach can mitigate hallucinations in foundation models.
citing papers explorer
-
ConsistNav: Closing the Action Consistency Gap in Zero-Shot Object Navigation with Semantic Executive Control
ConsistNav is a new training-free framework that uses a semantic executive controller, persistent candidate memory, and stability-aware action control to close the action consistency gap in zero-shot object navigation, reporting SOTA results on HM3D and MP3D with 11.4% SR and 7.9% SPL gains on MP3D.
-
Agent AI: Surveying the Horizons of Multimodal Interaction
The paper defines Agent AI as interactive multimodal systems that perceive grounded data and generate embodied actions, arguing this approach can mitigate hallucinations in foundation models.