pith. sign in

arxiv: 1809.05964 · v1 · pith:GFQM7AHZnew · submitted 2018-09-16 · 💻 cs.LG · stat.ML

Latent Space Optimal Transport for Generative Models

classification 💻 cs.LG stat.ML
keywords distributionproblemdatasimplelatent-spacesolvingtransformdatasets
0
0 comments X
read the original abstract

Variational Auto-Encoders enforce their learned intermediate latent-space data distribution to be a simple distribution, such as an isotropic Gaussian. However, this causes the posterior collapse problem and loses manifold structure which can be important for datasets such as facial images. A GAN can transform a simple distribution to a latent-space data distribution and thus preserve the manifold structure, but optimizing a GAN involves solving a Min-Max optimization problem, which is difficult and not well understood so far. Therefore, we propose a GAN-like method to transform a simple distribution to a data distribution in the latent space by solving only a minimization problem. This minimization problem comes from training a discriminator between a simple distribution and a latent-space data distribution. Then, we can explicitly formulate an Optimal Transport (OT) problem that computes the desired mapping between the two distributions. This means that we can transform a distribution without solving the difficult Min-Max optimization problem. Experimental results on an eight-Gaussian dataset show that the proposed OT can handle multi-cluster distributions. Results on the MNIST and the CelebA datasets validate the effectiveness of the proposed method.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Mind the Residual Gap: Probabilistic Downscaling under Real-World Bias

    cs.LG 2026-06 unverdicted novelty 7.0

    ReMatch corrects train-test residual distribution mismatch in probabilistic downscaling via optimal transport in low-dimensional PCA space, reducing under-dispersion and improving SSR and CRPS on HRRR-ERA5 wind data.