For modality ablation, we train and evaluate the model with one modality removed at a time, and report the results in the top part of Table V

Ablation Studies:To validate the contributions of different input modalities, architectural components, we conduct both modality, module ablations

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

AssemLM: Spatial Reasoning Multimodal Large Language Models for Robotic Assembly

cs.RO · 2026-04-10 · unverdicted · novelty 6.0

AssemLM uses a specialized point cloud encoder inside a multimodal LLM to reach state-of-the-art 6D pose prediction for assembly tasks, backed by a new 900K-sample benchmark called AssemBench.

citing papers explorer

Showing 1 of 1 citing paper.

AssemLM: Spatial Reasoning Multimodal Large Language Models for Robotic Assembly cs.RO · 2026-04-10 · unverdicted · none · ref 62
AssemLM uses a specialized point cloud encoder inside a multimodal LLM to reach state-of-the-art 6D pose prediction for assembly tasks, backed by a new 900K-sample benchmark called AssemBench.

For modality ablation, we train and evaluate the model with one modality removed at a time, and report the results in the top part of Table V

fields

years

verdicts

representative citing papers

citing papers explorer