Image-to-Image Translation with Conditional Adversarial Networks

Phillip Isola , Jun-Yan Zhu , Tinghui Zhou , Alexei A. Efros

Authors on Pith no claims yet

classification 💻 cs.CV

keywords lossmappingnetworksadversarialapproachconditionalfunctionsimage

read the original abstract

We investigate conditional adversarial networks as a general-purpose solution to image-to-image translation problems. These networks not only learn the mapping from input image to output image, but also learn a loss function to train this mapping. This makes it possible to apply the same generic approach to problems that traditionally would require very different loss formulations. We demonstrate that this approach is effective at synthesizing photos from label maps, reconstructing objects from edge maps, and colorizing images, among other tasks. Indeed, since the release of the pix2pix software associated with this paper, a large number of internet users (many of them artists) have posted their own experiments with our system, further demonstrating its wide applicability and ease of adoption without the need for parameter tweaking. As a community, we no longer hand-engineer our mapping functions, and this work suggests we can achieve reasonable results without hand-engineering our loss functions either.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 8 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

A Paired Point-of-Care Ultrasound Dataset for Image Quality Enhancement and Benchmarking via a cGAN Baseline
eess.IV 2026-05 conditional novelty 7.0

The first paired POCUS-to-high-end ultrasound dataset is released and a cGAN baseline raises SSIM from 0.29 to 0.54 and PSNR from 19.16 dB to 22.41 dB on 1064 test pairs.
VitaminP: cross-modal learning enables whole-cell segmentation from routine histology
cs.CV 2026-04 unverdicted novelty 7.0

VitaminP uses paired H&E-mIF data to train a model that transfers molecular boundary information, enabling accurate whole-cell segmentation directly from routine H&E histology across 34 cancer types.
SyMTRS: Benchmark Multi-Task Synthetic Dataset for Depth, Domain Adaptation and Super-Resolution in Aerial Imagery
cs.CV 2026-04 unverdicted novelty 7.0

A new large-scale synthetic multi-task benchmark dataset supplying pixel-perfect depth, domain-shifted night imagery, and multi-scale low-resolution pairs for aerial remote sensing.
Style-Based Neural Architectures for Real-Time Weather Classification
cs.CV 2026-04 unverdicted novelty 5.0

Three style-based neural architectures are proposed for real-time weather classification from images, with two truncated ResNet variants claimed to outperform prior methods and generalize across public datasets.
Stylistic-STORM (ST-STORM) : Perceiving the Semantic Nature of Appearance
cs.CV 2026-04 unverdicted novelty 5.0

ST-STORM introduces a dual-branch SSL framework that disentangles semantic content from stylistic appearance using gated latent streams, JEPA for content invariance, and adversarial constraints for style capture.
Heuristic Style Transfer for Real-Time, Efficient Weather Attribute Detection
cs.CV 2026-04 conditional novelty 5.0

Lightweight multi-task models using Gram matrices and PatchGAN-style architectures detect 53 weather classes from RGB images with F1 scores above 96% internally and 78% zero-shot externally, supported by a new 503k-im...
VCC-DSA: A Novel Vascular Consistency Constrained DSA Imaging Model for Motion Artifact Suppression
eess.IV 2026-04 unverdicted novelty 5.0

VCC-DSA uses a vascular consistency constraint and self-evolving training data to suppress motion artifacts in DSA, reporting 73.4% PSNR and 8.56% SSIM gains over other methods.
A Wasserstein GAN-based climate scenario generator for risk management and insurance: the case of soil subsidence
cs.LG 2026-04 unverdicted novelty 4.0

A conditional Wasserstein GAN generates plausible future SWI drought trajectories for French insurance risk management under climate change.