On the generalization capacities of mllms for spatial intelligence.International Conference on Learning Representations, 2026

Gongjie Zhang, Wenhao Li, Quanhao Qian, Jiuniu Wang, Deli Zhao, Shijian Lu, Ran Xu · 2026

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Towards Camera-Robust 3D Localization: Equation-Anchored Tool-Use for MLLMs

cs.CV · 2026-05-19 · unverdicted · novelty 7.0

Proposes an equation-anchored tool-use method for MLLMs that writes the pinhole back-projection equation in Chain-of-Thought and substitutes retrieved camera intrinsics and depths to achieve robustness in 3D object detection and visual grounding under rescaled intrinsics.

citing papers explorer

Showing 1 of 1 citing paper.

Towards Camera-Robust 3D Localization: Equation-Anchored Tool-Use for MLLMs cs.CV · 2026-05-19 · unverdicted · none · ref 45
Proposes an equation-anchored tool-use method for MLLMs that writes the pinhole back-projection equation in Chain-of-Thought and substitutes retrieved camera intrinsics and depths to achieve robustness in 3D object detection and visual grounding under rescaled intrinsics.

On the generalization capacities of mllms for spatial intelligence.International Conference on Learning Representations, 2026

fields

years

verdicts

representative citing papers

citing papers explorer