pith. sign in

arxiv: 1705.07215 · v5 · pith:IEAAS65Knew · submitted 2017-05-19 · 💻 cs.AI · cs.CV· cs.GT· cs.LG· cs.NE

On Convergence and Stability of GANs

classification 💻 cs.AI cs.CVcs.GTcs.LGcs.NE
keywords equilibrialocalmodetrainingcollapseconvergencedraganminimization
0
0 comments X
read the original abstract

We propose studying GAN training dynamics as regret minimization, which is in contrast to the popular view that there is consistent minimization of a divergence between real and generated distributions. We analyze the convergence of GAN training from this new point of view to understand why mode collapse happens. We hypothesize the existence of undesirable local equilibria in this non-convex game to be responsible for mode collapse. We observe that these local equilibria often exhibit sharp gradients of the discriminator function around some real data points. We demonstrate that these degenerate local equilibria can be avoided with a gradient penalty scheme called DRAGAN. We show that DRAGAN enables faster training, achieves improved stability with fewer mode collapses, and leads to generator networks with better modeling performance across a variety of architectures and objective functions.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 8 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. MAGIC: Few-Shot Mask-Guided Anomaly Inpainting with Prompt Perturbation, Spatially Adaptive Guidance, and Context Awareness

    cs.CV 2025-07 unverdicted novelty 7.0

    MAGIC is a few-shot mask-guided anomaly inpainting framework using Gaussian prompt perturbation, spatially adaptive guidance, and context-aware mask alignment to produce high-fidelity, diverse anomalies that outperfor...

  2. Large Scale GAN Training for High Fidelity Natural Image Synthesis

    cs.LG 2018-09 accept novelty 7.0

    BigGANs achieve state-of-the-art class-conditional synthesis on ImageNet 128x128 with Inception Score 166.5 and FID 7.4 by scaling GANs and applying orthogonal regularization plus truncation.

  3. Progressive Growing of GANs for Improved Quality, Stability, and Variation

    cs.NE 2017-10 accept novelty 7.0

    Progressive growing stabilizes GAN training to produce high-resolution images of unprecedented quality and achieves a record unsupervised inception score of 8.80 on CIFAR10.

  4. Adversarial Computation of Optimal Transport Maps

    cs.LG 2019-06 unverdicted novelty 6.0

    A GAN with Wasserstein discriminator objective makes the generator follow the W2 geodesic to learn an optimal transport map.

  5. Learning to Emulate Chaos: Adversarial Optimal Transport Regularization

    stat.ML 2026-04 unverdicted novelty 5.0

    Adversarial optimal transport objectives train neural emulators with improved long-term statistical fidelity on chaotic systems.

  6. MSDformer: Multi-scale Discrete Transformer For Time Series Generation

    cs.LG 2025-05 unverdicted novelty 5.0

    MSDformer introduces a multi-scale discrete transformer that tokenizes time series at multiple scales and models them autoregressively in discrete space, claiming superior performance over prior DTM methods with rate-...

  7. A Wasserstein GAN-based climate scenario generator for risk management and insurance: the case of soil subsidence

    cs.LG 2026-04 unverdicted novelty 4.0

    A conditional Wasserstein GAN generates plausible future SWI drought trajectories for French insurance risk management under climate change.

  8. Survey of Deep Learning and Physics-Based Approaches in Computational Wave Imaging

    cs.LG 2024-10 unverdicted novelty 2.0

    A literature survey that organizes research on integrating deep neural networks with physics-based methods for computational wave imaging and identifies lessons and trends.