pith. sign in

SpatialEdit: Benchmarking Fine-Grained Image Spatial Editing

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it
abstract

Image spatial editing performs geometry-driven transformations, allowing precise control over object layout and camera viewpoints. Current models are insufficient for fine-grained spatial manipulations, motivating a dedicated assessment suite. Our contributions are listed: (i) We introduce SpatialEdit-Bench, a complete benchmark that evaluates spatial editing by jointly measuring perceptual plausibility and geometric fidelity via viewpoint reconstruction and framing analysis. (ii) To address the data bottleneck for scalable training, we construct SpatialEdit-500k, a synthetic dataset generated with a controllable Blender pipeline that renders objects across diverse backgrounds and systematic camera trajectories, providing precise ground-truth transformations for both object- and camera-centric operations. (iii) Building on this data, we develop SpatialEdit-16B, a baseline model for fine-grained spatial editing. Our method achieves competitive performance on general editing while substantially outperforming prior methods on spatial manipulation tasks. All resources will be made public at https://github.com/EasonXiao-888/SpatialEdit.

citation-role summary

extension 1

citation-polarity summary

fields

cs.CV 2 cs.GR 1

years

2026 3

verdicts

UNVERDICTED 3

roles

extension 1

polarities

extend 1

representative citing papers

citing papers explorer

Showing 3 of 3 citing papers.