Conditional Adversarial Generative Flow for Controllable Image Synthesis

Hongsheng Li; Rui Liu; Xiaogang Wang; Xinyu Gong; Yu Liu

arxiv: 1904.01782 · v1 · pith:ZZUBGM5Qnew · submitted 2019-04-03 · 💻 cs.CV

Conditional Adversarial Generative Flow for Controllable Image Synthesis

Rui Liu , Yu Liu , Xinyu Gong , Xiaogang Wang , Hongsheng Li This is my paper

classification 💻 cs.CV

keywords imageconditionalcaglowconditionsgenerativesynthesisadversarialattributes

0 comments

read the original abstract

Flow-based generative models show great potential in image synthesis due to its reversible pipeline and exact log-likelihood target, yet it suffers from weak ability for conditional image synthesis, especially for multi-label or unaware conditions. This is because the potential distribution of image conditions is hard to measure precisely from its latent variable $z$. In this paper, based on modeling a joint probabilistic density of an image and its conditions, we propose a novel flow-based generative model named conditional adversarial generative flow (CAGlow). Instead of disentangling attributes from latent space, we blaze a new trail for learning an encoder to estimate the mapping from condition space to latent space in an adversarial manner. Given a specific condition $c$, CAGlow can encode it to a sampled $z$, and then enable robust conditional image synthesis in complex situations like combining person identity with multiple attributes. The proposed CAGlow can be implemented in both supervised and unsupervised manners, thus can synthesize images with conditional information like categories, attributes, and even some unknown properties. Extensive experiments show that CAGlow ensures the independence of different conditions and outperforms regular Glow to a significant extent.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Shaping Belief States with Generative Environment Models for RL
cs.LG 2019-06 unverdicted novelty 5.0

Multi-step predictive generative models form stable belief states capturing environment layout and agent pose, yielding higher data efficiency on RL tasks than model-free agents.