Colorful Image Colorization

Alexei A. Efros; Phillip Isola; Richard Zhang

arxiv: 1603.08511 · v5 · pith:FYWBAE63new · submitted 2016-03-28 · 💻 cs.CV

Colorful Image Colorization

Richard Zhang , Phillip Isola , Alexei A. Efros This is my paper

classification 💻 cs.CV

keywords colorcolorizationproblemapproachcolorizationsfeatureimagelearning

0 comments

read the original abstract

Given a grayscale photograph as input, this paper attacks the problem of hallucinating a plausible color version of the photograph. This problem is clearly underconstrained, so previous approaches have either relied on significant user interaction or resulted in desaturated colorizations. We propose a fully automatic approach that produces vibrant and realistic colorizations. We embrace the underlying uncertainty of the problem by posing it as a classification task and use class-rebalancing at training time to increase the diversity of colors in the result. The system is implemented as a feed-forward pass in a CNN at test time and is trained on over a million color images. We evaluate our algorithm using a "colorization Turing test," asking human participants to choose between a generated and ground truth color image. Our method successfully fools humans on 32% of the trials, significantly higher than previous methods. Moreover, we show that colorization can be a powerful pretext task for self-supervised feature learning, acting as a cross-channel encoder. This approach results in state-of-the-art performance on several feature learning benchmarks.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Density estimation using Real NVP
cs.LG 2016-05 accept novelty 8.0

Real NVP uses affine coupling layers to create invertible transformations that support exact density estimation, sampling, and latent inference without approximations.
Semantic Deep Intermodal Feature Transfer: Transferring Feature Descriptors Between Imaging Modalities
cs.CV 2019-07 unverdicted novelty 6.0

Se-DIFT predicts feature appearances across RGB and thermal modalities via an encoder-decoder plus global feature vector, cutting L1 error over 7% versus U-Net and enabling intermodal matching of SIFT, SURF, and ORB.