DragScene: Interactive 3D Scene Editing with Single-view Drag Instructions
read the original abstract
3D editing has shown remarkable capability in editing scenes based on various instructions. However, existing methods struggle with achieving intuitive, localized editing, such as selectively making flowers blossom. Drag-style editing has shown exceptional capability to edit images with direct manipulation instead of ambiguous text commands. Nevertheless, extending drag-based editing to 3D scenes presents substantial challenges due to multi-view inconsistency. To this end, we introduce DragScene, a framework that integrates drag-style editing with diverse 3D representations. First, latent optimization is performed on a reference view to generate 2D edits based on user instructions. Subsequently, coarse 3D clues are reconstructed from the reference view using a point-based representation to capture the geometric details of the edits. The latent representation of the edited view is then mapped to these 3D clues, guiding the latent optimization of other views. This process ensures that edits are propagated seamlessly across multiple views, maintaining multi-view consistency. Finally, the target 3D scene is reconstructed from the edited multi-view images. Extensive experiments demonstrate that DragScene facilitates precise and flexible drag-style editing of 3D scenes, supporting broad applicability across diverse 3D representations.
This paper has not been read by Pith yet.
Forward citations
Cited by 2 Pith papers
-
MesonGS++: Post-training Compression of 3D Gaussian Splatting with Hyperparameter Searching
MesonGS++ achieves over 34x compression of 3D Gaussian Splatting models with preserved or improved PSNR by using size-aware joint optimization of pruning and quantization hyperparameters via discrete sampling and 0-1 ...
-
MesonGS++: Post-training Compression of 3D Gaussian Splatting with Hyperparameter Searching
MesonGS++ achieves over 34x compression of 3D Gaussian Splatting models post-training while preserving or exceeding original rendering quality through size-aware hyperparameter optimization.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.