Spa- tialbot: Precise spatial understanding with vision lan- guage models

Wenxiao Cai, Iaroslav Ponomarenko, Jianhao Yuan, Xiaoqi Li, Wankou Yang, Hao Dong, Bo Zhao · 2025

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

AssemLM: Spatial Reasoning Multimodal Large Language Models for Robotic Assembly

cs.RO · 2026-04-10 · unverdicted · novelty 6.0

AssemLM uses a specialized point cloud encoder inside a multimodal LLM to reach state-of-the-art 6D pose prediction for assembly tasks, backed by a new 900K-sample benchmark called AssemBench.

citing papers explorer

Showing 1 of 1 citing paper.

AssemLM: Spatial Reasoning Multimodal Large Language Models for Robotic Assembly cs.RO · 2026-04-10 · unverdicted · none · ref 5
AssemLM uses a specialized point cloud encoder inside a multimodal LLM to reach state-of-the-art 6D pose prediction for assembly tasks, backed by a new 900K-sample benchmark called AssemBench.

Spa- tialbot: Precise spatial understanding with vision lan- guage models

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer