IEEE Robotics and Automation Letters9(10), 8921–8928 (2024)

Maggio, D · 2024 · arXiv 2024.345139

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

representative citing papers

Think While You Map: Asynchronous Vision-Language Agents for Incremental 3D Scene Graphs

cs.CV · 2026-06-30 · unverdicted · novelty 7.0

An asynchronous architecture decouples incremental voxel-based mapping from VLM-based semantic enrichment to produce queryable open-vocabulary 3D scene graphs that match or exceed prior methods on segmentation and grounding benchmarks.

FARM: Find Anything using Relational Spatial Memory

cs.RO · 2026-06-13 · unverdicted · novelty 7.0

FARM creates an open-vocabulary relational spatial memory that improves object retrieval recall by 164-224% over prior methods on 44k language queries across 67 scenes while running at 5-10 Hz.

SemanticXR: Low Power and Real-time Queryable Semantic Mapping with an Object-Level Device-Cloud Architecture

cs.DC · 2026-06-11 · unverdicted · novelty 7.0

SemanticXR introduces the first device-cloud system for real-time open-vocabulary semantic mapping and querying that organizes work around semantically identifiable objects to meet XR power, bandwidth, and memory limits.

From Pixels to Concepts: Growing Rich 3D Semantic Scene Graph Forests utilizing Foundation Models

cs.RO · 2026-06-22 · unverdicted · novelty 6.0

Uses VLMs to detect instance concepts and LLMs to infer abstract relationships, assembling them into 3D scene graph forests that are evaluated on uHumans2 and ScanNet and tested in open-vocabulary retrieval on a Spot robot.

citing papers explorer

Showing 2 of 2 citing papers after filters.

FARM: Find Anything using Relational Spatial Memory cs.RO · 2026-06-13 · unverdicted · none · ref 21
FARM creates an open-vocabulary relational spatial memory that improves object retrieval recall by 164-224% over prior methods on 44k language queries across 67 scenes while running at 5-10 Hz.
From Pixels to Concepts: Growing Rich 3D Semantic Scene Graph Forests utilizing Foundation Models cs.RO · 2026-06-22 · unverdicted · none · ref 3
Uses VLMs to detect instance concepts and LLMs to infer abstract relationships, assembling them into 3D scene graph forests that are evaluated on uHumans2 and ScanNet and tested in open-vocabulary retrieval on a Spot robot.

IEEE Robotics and Automation Letters9(10), 8921–8928 (2024)

fields

years

verdicts

representative citing papers

citing papers explorer