Thinking with spatial code for physical-world video reasoning.arXiv preprint arXiv:2603.05591,

Jieneng Chen, Wenxin Ma, Ruisheng Yuan, Yunzhi Zhang, Jiajun Wu, Alan Yuille · 2026 · arXiv 2603.05591

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Ouroboros-Spatial: Closing the Data-Model Loop for Spatial Reasoning

cs.CV · 2026-06-10 · unverdicted · novelty 7.0

A closed-loop self-evolving training system for spatial reasoning in MLLMs that iteratively generates QA pairs matched to the model's current capabilities via confidence feedback, achieving gains with an order of magnitude less data.

OneCanvas: 3D Scene Understanding via Panoramic Reprojection

cs.CV · 2026-06-17 · unverdicted · novelty 6.0

OneCanvas aggregates multi-view 3D patches onto one panoramic canvas with continuous angular placement and 3D embeddings, enabling pretrained VLMs to achieve SOTA on SQA3D and VSI-Bench with an order of magnitude less compute via a new spatial pretraining curriculum.

citing papers explorer

Showing 2 of 2 citing papers.

Ouroboros-Spatial: Closing the Data-Model Loop for Spatial Reasoning cs.CV · 2026-06-10 · unverdicted · none · ref 10
A closed-loop self-evolving training system for spatial reasoning in MLLMs that iteratively generates QA pairs matched to the model's current capabilities via confidence feedback, achieving gains with an order of magnitude less data.
OneCanvas: 3D Scene Understanding via Panoramic Reprojection cs.CV · 2026-06-17 · unverdicted · none · ref 3
OneCanvas aggregates multi-view 3D patches onto one panoramic canvas with continuous angular placement and 3D embeddings, enabling pretrained VLMs to achieve SOTA on SQA3D and VSI-Bench with an order of magnitude less compute via a new spatial pretraining curriculum.

Thinking with spatial code for physical-world video reasoning.arXiv preprint arXiv:2603.05591,

fields

years

verdicts

representative citing papers

citing papers explorer