Paint by Example: Exemplar-based Image Editing with Diffusion Models

Binxin Yang; Bo Zhang; Dong Chen; Fang Wen; Shuyang Gu; Ting Zhang; Xiaoyan Sun; Xuejin Chen

arxiv: 2211.13227 · v1 · pith:DFQI3S7Unew · submitted 2022-11-23 · 💻 cs.CV

Paint by Example: Exemplar-based Image Editing with Diffusion Models

Binxin Yang , Shuyang Gu , Bo Zhang , Ting Zhang , Xuejin Chen , Xiaoyan Sun , Dong Chen , Fang Wen This is my paper

classification 💻 cs.CV

keywords imageeditingexemplardiffusionachieveachievedachievesanalyze

0 comments

read the original abstract

Language-guided image editing has achieved great success recently. In this paper, for the first time, we investigate exemplar-guided image editing for more precise control. We achieve this goal by leveraging self-supervised training to disentangle and re-organize the source image and the exemplar. However, the naive approach will cause obvious fusing artifacts. We carefully analyze it and propose an information bottleneck and strong augmentations to avoid the trivial solution of directly copying and pasting the exemplar image. Meanwhile, to ensure the controllability of the editing process, we design an arbitrary shape mask for the exemplar image and leverage the classifier-free guidance to increase the similarity to the exemplar image. The whole framework involves a single forward of the diffusion model without any iterative optimization. We demonstrate that our method achieves an impressive performance and enables controllable editing on in-the-wild images with high fidelity.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 5 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

iTryOn: Mastering Interactive Video Virtual Try-On with Spatial-Semantic Guidance
cs.CV 2026-05 unverdicted novelty 7.0

iTryOn is a video diffusion Transformer that injects spatial 3D hand guidance and semantic action captions to enable interactive garment replacement in videos.
iTryOn: Mastering Interactive Video Virtual Try-On with Spatial-Semantic Guidance
cs.CV 2026-05 unverdicted novelty 7.0

iTryOn is a diffusion-based framework that adds spatial 3D hand guidance and semantic action-aware embeddings to handle complex garment deformations during human-clothing interactions in videos.
ChArtist: Generating Pictorial Charts with Unified Spatial and Subject Control
cs.CV 2026-03 unverdicted novelty 7.0

ChArtist generates pictorial charts via a Diffusion Transformer using skeleton-based spatial control and reference-image subject control, supported by a new 30,000-triplet dataset and data accuracy metric.
PostureObjectstitch: Anomaly Image Generation Considering Assembly Relationships in Industrial Scenarios
cs.CV 2026-04 unverdicted novelty 6.0

PostureObjectStitch generates assembly-aware anomaly images by decoupling multi-view features into high-frequency, texture and RGB components, modulating them temporally in a diffusion model, and applying conditional ...
HarmoniDiff-RS: Training-Free Diffusion Harmonization for Satellite Image Composition
cs.CV 2026-04 unverdicted novelty 5.0

HarmoniDiff-RS performs training-free harmonization of satellite image composites using diffusion latents with mean shift and timestep fusion, plus a new RSIC-H benchmark of 500 pairs.