Recognition: unknown
Oscillating solutions to the mean-field Langevin descent-ascent flow
Pith reviewed 2026-05-10 15:43 UTC · model grok-4.3
The pith
The mean-field Langevin descent-ascent flow on R squared can oscillate indefinitely instead of converging.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
We present a counterexample to the statement of convergence of the mean-field Langevin descent-ascent flow on R^2. We consider payoff functions that are shaped as a double well in each coordinate, and for which the deterministic dynamics admits a limit cycle. When the coupling between the two coordinates is sufficiently strong and the entropic regularization sufficiently small, we show that the mean-field dynamics remains close to this cyclic behavior, and in particular, does not converge.
What carries the argument
The mean-field Langevin descent-ascent flow, a system of stochastic differential equations governing the evolution of two interacting measures under regularized gradient descent and ascent.
If this is right
- The flow fails to converge and instead exhibits persistent oscillations near the deterministic limit cycle.
- This non-convergence holds when inter-coordinate coupling exceeds a sufficient threshold.
- The behavior appears when the entropic regularization parameter falls below a sufficient threshold.
- The construction serves as a concrete counterexample to any claim of unconditional convergence for the flow.
Where Pith is reading between the lines
- Convergence proofs for related mean-field game flows may need explicit bounds on coupling strength relative to regularization to exclude cycles.
- Similar sustained oscillations could appear in discrete-time or finite-player approximations of the same payoff structure.
- The counterexample raises the question of whether other regularizations or payoff shapes with deterministic cycles produce non-convergent mean-field behavior.
Load-bearing premise
The payoffs must be double wells in each coordinate so the deterministic dynamics has a limit cycle, and this cycle persists in the mean-field flow only for strong enough coupling and small enough regularization.
What would settle it
Run a numerical simulation of the mean-field dynamics with the double-well payoffs, strong coupling parameter, and small regularization parameter to check whether the measures sustain oscillations near the deterministic cycle or eventually converge to a fixed point.
Figures
read the original abstract
We present a counterexample to the statement of convergence of the mean-field Langevin descent-ascent flow on $\mathbb{R}^2$. We consider payoff functions that are shaped as a double well in each coordinate, and for which the deterministic dynamics admits a limit cycle. When the coupling between the two coordinates is sufficiently strong and the entropic regularization sufficiently small, we show that the mean-field dynamics remains close to this cyclic behavior, and in particular, does not converge.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper constructs explicit double-well payoff functions on R^2 for which the deterministic descent-ascent vector field admits a limit cycle. It claims that, for sufficiently strong inter-coordinate coupling and sufficiently small but positive entropic regularization, the associated mean-field Langevin descent-ascent flow remains close to this cycle for all time and therefore does not converge.
Significance. If the central claim holds, the explicit construction would provide a concrete counterexample to convergence statements for mean-field Langevin flows in min-max settings, showing that persistent oscillations are possible even with positive regularization. The use of standard SDE analysis on a parameter-free example is a strength.
major comments (2)
- [Abstract] Abstract and the statement of the main result: the assertion that the flow 'does not converge' for any positive regularization contradicts the standard well-posedness theory for the Fokker-Planck equation associated to the SDE dZ_t = F(Z_t) dt + sqrt(2 reg) dW_t. Under the dissipativity induced by the double-well potentials plus strong coupling, this linear parabolic PDE admits a unique invariant probability measure to which every solution converges in total variation (or Wasserstein distance) as t → ∞; no non-constant time-periodic orbit in the space of measures can exist. The manuscript must either (i) show that its definition of the mean-field flow differs from this SDE or (ii) clarify what 'does not converge' means in light of this convergence.
- [Main result / limiting argument] The limiting argument for closeness to the deterministic cycle (presumably in the section containing the main theorem): even if trajectories remain close to the cycle on finite-time horizons when reg is small, the long-time convergence of the law to the unique invariant measure (supported near the cycle) still occurs. The manuscript needs to reconcile the claimed perpetual closeness with this ergodicity; otherwise the non-convergence claim is not supported.
minor comments (2)
- Notation for the regularization parameter should be introduced once and used consistently; avoid switching between 'reg', 'ε', etc.
- If numerical illustrations of the oscillating behavior are included, add a brief description of the discretization scheme and time horizon used.
Simulated Author's Rebuttal
We thank the referee for the careful reading and for identifying the need for greater precision in our statements about non-convergence. We address each major comment below.
read point-by-point responses
-
Referee: [Abstract] Abstract and the statement of the main result: the assertion that the flow 'does not converge' for any positive regularization contradicts the standard well-posedness theory for the Fokker-Planck equation associated to the SDE dZ_t = F(Z_t) dt + sqrt(2 reg) dW_t. Under the dissipativity induced by the double-well potentials plus strong coupling, this linear parabolic PDE admits a unique invariant probability measure to which every solution converges in total variation (or Wasserstein distance) as t → ∞; no non-constant time-periodic orbit in the space of measures can exist. The manuscript must either (i) show that its definition of the mean-field flow differs from this SDE or (ii) clarify what 'does not converge' means in light of this convergence.
Authors: We agree that the law of the process converges to the unique invariant measure of the associated Fokker-Planck equation. Our use of 'does not converge' is meant to indicate that the dynamics do not converge to a stationary Nash equilibrium (i.e., a Dirac measure supported at an equilibrium point of the game). Instead, for small regularization and strong coupling, the invariant measure is supported near the deterministic limit cycle, so that sample paths exhibit persistent oscillations around the cycle. We will revise the abstract and main-result statement to make this distinction explicit, thereby addressing option (ii). We do not assert that the underlying SDE differs from the standard form. revision: yes
-
Referee: [Main result / limiting argument] The limiting argument for closeness to the deterministic cycle (presumably in the section containing the main theorem): even if trajectories remain close to the cycle on finite-time horizons when reg is small, the long-time convergence of the law to the unique invariant measure (supported near the cycle) still occurs. The manuscript needs to reconcile the claimed perpetual closeness with this ergodicity; otherwise the non-convergence claim is not supported.
Authors: We concur that the law converges to the invariant measure. Because the invariant measure concentrates near the deterministic limit cycle when regularization is sufficiently small, the process remains close to the cycle for all sufficiently large times (in the sense that its distance to the cycle is small with high probability). Ergodicity governs mixing inside the thin tube surrounding the cycle but does not contradict uniform-in-time closeness to the cycle itself. We will add a clarifying paragraph in the main-result section that reconciles the limiting argument with ergodicity and reiterates that non-convergence refers to equilibria, not to the cycle. revision: yes
Circularity Check
No circularity; explicit counterexample from concrete functions and standard analysis
full rationale
The paper constructs explicit double-well payoff functions on R^2 for which the deterministic (zero-regularization) descent-ascent dynamics admits a limit cycle. It then invokes standard SDE perturbation and Fokker-Planck estimates to show that, for sufficiently strong coupling and sufficiently small positive regularization, trajectories remain close to that cycle for all time. No step defines a quantity in terms of itself, renames a fitted parameter as a prediction, or relies on a load-bearing self-citation whose content reduces to the present claim. The derivation is self-contained against external benchmarks (classical SDE theory and parabolic PDE estimates) and does not reduce any asserted result to its own inputs by construction.
Axiom & Free-Parameter Ledger
axioms (1)
- standard math Existence and uniqueness of solutions to the mean-field stochastic differential equation
Reference graph
Works this paper leans on
-
[1]
A mean-field analysis of two-player zero-sum games.Advances in neural information processing systems, 33:20215–20226, 2020
Carles Domingo-Enrich, Samy Jelassi, Arthur Mensch, Grant Rotskoff, and Joan Bruna. A mean-field analysis of two-player zero-sum games.Advances in neural information processing systems, 33:20215–20226, 2020
2020
-
[2]
Zurich Lectures in Advanced Mathematics
Tomas Dominguez and Jean-Christophe Mourrat.Statistical mechanics of mean-field dis- ordered systems: a Hamilton-Jacobi approach. Zurich Lectures in Advanced Mathematics. European Mathematical Society (EMS), Zürich, 2024
2024
-
[3]
Freidlin-Wentzell LDP in path space for McKean-Vlasov equations and the functional iterated logarithm law.Ann
Gon¸calo dos Reis, William Salkeld, and Julian Tugaut. Freidlin-Wentzell LDP in path space for McKean-Vlasov equations and the functional iterated logarithm law.Ann. Appl. Probab., 29(3):1487–1540, 2019
2019
-
[4]
McKean-Vlasov SDE and SPDE with locally monotone coefficients.Ann
Wei Hong, Shanshan Hu, and Wei Liu. McKean-Vlasov SDE and SPDE with locally monotone coefficients.Ann. Appl. Probab., 34(2):2136–2189, 2024
2024
-
[5]
Symmetric mean-field Langevin dynamics for distributional minimax problems
Juno Kim, Kakei Yamamoto, Kazusato Oko, Zhuoran Yang, and Taiji Suzuki. Symmetric mean-field Langevin dynamics for distributional minimax problems. InInternational Conference on Learning Representations, 2024
2024
-
[6]
Two-scale gradient descent ascent dynamics finds mixed Nash equilibria of continuous games: a mean-field perspective
Yulong Lu. Two-scale gradient descent ascent dynamics finds mixed Nash equilibria of continuous games: a mean-field perspective. InInternational Conference on Machine Learning, pages 22790–22811. PMLR, 2023
2023
-
[7]
Provably convergent quasistatic dynamics for mean-field two-player zero-sum games
Chao Ma and Lexing Ying. Provably convergent quasistatic dynamics for mean-field two-player zero-sum games. InInternational Conference on Learning Representations, 2022
2022
-
[8]
PL conditions do not guarantee convergence of gradient descent-ascent dynamics
Jean-Christophe Mourrat. PL conditions do not guarantee convergence of gradient descent-ascent dynamics. Preprint, arXiv:2602.16517
work page internal anchor Pith review arXiv
-
[9]
Generalization of an inequality by Talagrand and links with the logarithmic Sobolev inequality.J
Felix Otto and Cédric Villani. Generalization of an inequality by Talagrand and links with the logarithmic Sobolev inequality.J. Funct. Anal., 173(2):361–400, 2000
2000
-
[10]
Springer-Verlag, New York, third edition, 2001
Lawrence Perko.Differential equations and dynamical systems, volume 7 ofTexts in Applied Mathematics. Springer-Verlag, New York, third edition, 2001
2001
-
[11]
Local exponen- tial stability of mean-field Langevin descent-ascent in Wasserstein space
Geuntaek Seo, Minseop Shin, Pierre Monmarché, and Beomjun Choi. Local expo- nential stability of mean-field langevin descent-ascent in Wasserstein space. Preprint, arXiv:2602.01564
-
[12]
Open problem: Convergence of single-timescale mean-field Langevin descent-ascent for two-player zero-sum games
Guillaume Wang and Lénaïc Chizat. Open problem: Convergence of single-timescale mean-field Langevin descent-ascent for two-player zero-sum games. InThe Thirty Seventh Annual Conference on Learning Theory, pages 5345–5350. PMLR, 2024
2024
-
[13]
Local convergence of mean-field Langevin dy- namics: from gradient flows to linearly monotone games
Guillaume Wang and Lénaïc Chizat. Local convergence of mean-field Langevin dynamics: from gradient flows to linearly monotone games. Preprint, arXiv:2602.11999. (Jean-Christophe Mourrat)Department of Mathematics, ENS Lyon and CNRS, Lyon, France (Loucas Pillaud-Vivien)CERMICS, CNRS, ENPC, Institut Polytechnique de Paris, Marne-la-V allée, France
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.