pith. sign in

arxiv: 2211.13227 · v1 · pith:DFQI3S7Unew · submitted 2022-11-23 · 💻 cs.CV

Paint by Example: Exemplar-based Image Editing with Diffusion Models

classification 💻 cs.CV
keywords imageeditingexemplardiffusionachieveachievedachievesanalyze
0
0 comments X
read the original abstract

Language-guided image editing has achieved great success recently. In this paper, for the first time, we investigate exemplar-guided image editing for more precise control. We achieve this goal by leveraging self-supervised training to disentangle and re-organize the source image and the exemplar. However, the naive approach will cause obvious fusing artifacts. We carefully analyze it and propose an information bottleneck and strong augmentations to avoid the trivial solution of directly copying and pasting the exemplar image. Meanwhile, to ensure the controllability of the editing process, we design an arbitrary shape mask for the exemplar image and leverage the classifier-free guidance to increase the similarity to the exemplar image. The whole framework involves a single forward of the diffusion model without any iterative optimization. We demonstrate that our method achieves an impressive performance and enables controllable editing on in-the-wild images with high fidelity.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 5 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. iTryOn: Mastering Interactive Video Virtual Try-On with Spatial-Semantic Guidance

    cs.CV 2026-05 unverdicted novelty 7.0

    iTryOn is a video diffusion Transformer that injects spatial 3D hand guidance and semantic action captions to enable interactive garment replacement in videos.

  2. iTryOn: Mastering Interactive Video Virtual Try-On with Spatial-Semantic Guidance

    cs.CV 2026-05 unverdicted novelty 7.0

    iTryOn is a diffusion-based framework that adds spatial 3D hand guidance and semantic action-aware embeddings to handle complex garment deformations during human-clothing interactions in videos.

  3. ChArtist: Generating Pictorial Charts with Unified Spatial and Subject Control

    cs.CV 2026-03 unverdicted novelty 7.0

    ChArtist generates pictorial charts via a Diffusion Transformer using skeleton-based spatial control and reference-image subject control, supported by a new 30,000-triplet dataset and data accuracy metric.

  4. PostureObjectstitch: Anomaly Image Generation Considering Assembly Relationships in Industrial Scenarios

    cs.CV 2026-04 unverdicted novelty 6.0

    PostureObjectStitch generates assembly-aware anomaly images by decoupling multi-view features into high-frequency, texture and RGB components, modulating them temporally in a diffusion model, and applying conditional ...

  5. HarmoniDiff-RS: Training-Free Diffusion Harmonization for Satellite Image Composition

    cs.CV 2026-04 unverdicted novelty 5.0

    HarmoniDiff-RS performs training-free harmonization of satellite image composites using diffusion latents with mean shift and timestep fusion, plus a new RSIC-H benchmark of 500 pairs.