pith. machine review for the scientific record. sign in

arxiv: 1902.03938 · v1 · submitted 2019-02-11 · 💻 cs.CV

Recognition: unknown

MISO: Mutual Information Loss with Stochastic Style Representations for Multimodal Image-to-Image Translation

Authors on Pith no claims yet
classification 💻 cs.CV
keywords lossdomaininformationmultimodaltranslationimagemutualrepresentation
0
0 comments X
read the original abstract

Unpaired multimodal image-to-image translation is a task of translating a given image in a source domain into diverse images in the target domain, overcoming the limitation of one-to-one mapping. Existing multimodal translation models are mainly based on the disentangled representations with an image reconstruction loss. We propose two approaches to improve multimodal translation quality. First, we use a content representation from the source domain conditioned on a style representation from the target domain. Second, rather than using a typical image reconstruction loss, we design MILO (Mutual Information LOss), a new stochastically-defined loss function based on information theory. This loss function directly reflects the interpretation of latent variables as a random variable. We show that our proposed model Mutual Information with StOchastic Style Representation(MISO) achieves state-of-the-art performance through extensive experiments on various real-world datasets.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.