Training GANs with Optimism

Constantinos Daskalakis, Andrew Ilyas, Vasilis Syrgkanis, Haoyang Zeng · 2017 · cs.LG · arXiv 1711.00141

8 Pith papers cite this work. Polarity classification is still indexing.

8 Pith papers citing it

open full Pith review browse 8 citing papers arXiv PDF

abstract

We address the issue of limit cycling behavior in training Generative Adversarial Networks and propose the use of Optimistic Mirror Decent (OMD) for training Wasserstein GANs. Recent theoretical results have shown that optimistic mirror decent (OMD) can enjoy faster regret rates in the context of zero-sum games. WGANs is exactly a context of solving a zero-sum game with simultaneous no-regret dynamics. Moreover, we show that optimistic mirror decent addresses the limit cycling problem in training WGANs. We formally show that in the case of bi-linear zero-sum games the last iterate of OMD dynamics converges to an equilibrium, in contrast to GD dynamics which are bound to cycle. We also portray the huge qualitative difference between GD and OMD dynamics with toy examples, even when GD is modified with many adaptations proposed in the recent literature, such as gradient penalty or momentum. We apply OMD WGAN training to a bioinformatics problem of generating DNA sequences. We observe that models trained with OMD achieve consistently smaller KL divergence with respect to the true underlying distribution, than models trained with GD variants. Finally, we introduce a new algorithm, Optimistic Adam, which is an optimistic variant of Adam. We apply it to WGAN training on CIFAR10 and observe improved performance in terms of inception score as compared to Adam.

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

Metric-Gradient Projection for Stable Multi-Agent Policy Learning

cs.LG · 2026-05-12 · unverdicted · novelty 7.0

HPML projects multi-agent update fields onto the closest metric-gradient potential flow via Hodge decomposition, yielding Lyapunov potentials and equilibrium-gap bounds.

Higher-Order Uncoupled Learning Dynamics and Nash Equilibrium

cs.MA · 2025-06-12 · unverdicted · novelty 7.0

Higher-order uncoupled dynamics exist that locally learn isolated mixed-strategy Nash equilibria in finite games, but no universal dynamics can learn them across all such games.

Shuffling the Data, Stretching the Step-size: Sharper Bias in constant step-size SGD

math.OC · 2026-04-11 · unverdicted · novelty 7.0

Combining random reshuffling and Richardson-Romberg extrapolation yields cubic bias refinement and better MSE for constant-step SGD on structured non-monotone variational inequalities.

Addressing Over-Refusal in LLMs with Competing Rewards

cs.LG · 2026-06-30 · unverdicted · novelty 6.0

SEAR trains one LLM via adversarial process rewards to explore harmful reasoning paths but flip to safe outputs, reducing over-refusal while preserving safety.

SGD at the Edge of Stability: Stochastic Stabilization with Large Learning Rates

stat.ML · 2026-06-29 · unverdicted · novelty 6.0

SGD on multiclass cross-entropy loss alternates between curvature-driven oscillations and stable regimes but self-stabilizes to enable best-iterate convergence with large learning rates for linear and two-layer models.

A Unified Variational Design of Predictive Mirror Descent in Convex Games under Stochastic Feedback

math.OC · 2026-06-01 · unverdicted · novelty 6.0

A variational stochastic differential game with auxiliary memory produces two-channel predictive mirror dynamics and local finite-horizon last-iterate bounds under mirror regularity and bounded diffusion.

Why SGD is not Brownian Motion: A New Perspective on Stochastic Dynamics

cs.LG · 2026-05-21 · unverdicted · novelty 6.0

SGD is reformulated via a master equation from discrete updates, producing a discrete Fokker-Planck equation that predicts non-stationary variance growth proportional to learning rate in flat Hessian directions.

Mirror Descent-Type Algorithms for the Variational Inequality Problem with Functional Constraints

cs.LG · 2026-02-24 · unverdicted · novelty 6.0

Mirror descent algorithms with productive/non-productive step switching achieve optimal convergence rates for bounded monotone operators and Lipschitz convex functional constraints in variational inequalities.

citing papers explorer

Showing 0 of 0 citing papers after filters.

No citing papers match the current filters.

Training GANs with Optimism

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer