DragScene: Interactive 3D Scene Editing with Single-view Drag Instructions

Chenghao Gu; Shuzhao Xie; Yunpeng Bai; Zhengqi Zhang; Zhenzhe Li; Zhi Wang

arxiv: 2412.13552 · v1 · pith:GO5ZZ5F2new · submitted 2024-12-18 · 💻 cs.CV · cs.GR

DragScene: Interactive 3D Scene Editing with Single-view Drag Instructions

Chenghao Gu , Zhenzhe Li , Zhengqi Zhang , Yunpeng Bai , Shuzhao Xie , Zhi Wang This is my paper

classification 💻 cs.CV cs.GR

keywords editingdrag-styledragsceneeditsinstructionslatentmulti-viewscenes

0 comments

read the original abstract

3D editing has shown remarkable capability in editing scenes based on various instructions. However, existing methods struggle with achieving intuitive, localized editing, such as selectively making flowers blossom. Drag-style editing has shown exceptional capability to edit images with direct manipulation instead of ambiguous text commands. Nevertheless, extending drag-based editing to 3D scenes presents substantial challenges due to multi-view inconsistency. To this end, we introduce DragScene, a framework that integrates drag-style editing with diverse 3D representations. First, latent optimization is performed on a reference view to generate 2D edits based on user instructions. Subsequently, coarse 3D clues are reconstructed from the reference view using a point-based representation to capture the geometric details of the edits. The latent representation of the edited view is then mapped to these 3D clues, guiding the latent optimization of other views. This process ensures that edits are propagated seamlessly across multiple views, maintaining multi-view consistency. Finally, the target 3D scene is reconstructed from the edited multi-view images. Extensive experiments demonstrate that DragScene facilitates precise and flexible drag-style editing of 3D scenes, supporting broad applicability across diverse 3D representations.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

MesonGS++: Post-training Compression of 3D Gaussian Splatting with Hyperparameter Searching
cs.CV 2026-04 unverdicted novelty 6.0

MesonGS++ achieves over 34x compression of 3D Gaussian Splatting models with preserved or improved PSNR by using size-aware joint optimization of pruning and quantization hyperparameters via discrete sampling and 0-1 ...
MesonGS++: Post-training Compression of 3D Gaussian Splatting with Hyperparameter Searching
cs.CV 2026-04 unverdicted novelty 5.0

MesonGS++ achieves over 34x compression of 3D Gaussian Splatting models post-training while preserving or exceeding original rendering quality through size-aware hyperparameter optimization.