Accelerated Stochastic Min-Max Optimization Based on Bias-corrected Momentum

Ali H.Sayed; Haoyuan Cai; Sulaiman A. Alghunaim

arxiv: 2406.13041 · v3 · pith:YH5CIR35new · submitted 2024-06-18 · 💻 cs.LG · math.OC

Accelerated Stochastic Min-Max Optimization Based on Bias-corrected Momentum

Haoyuan Cai , Sulaiman A. Alghunaim , Ali H.Sayed This is my paper

classification 💻 cs.LG math.OC

keywords varepsilonalgorithmscomplexitymathcaloptimizationstochasticbias-correctedconditions

0 comments

read the original abstract

Lower-bound analyses for nonconvex strongly-concave minimax optimization problems have shown that stochastic first-order algorithms require at least $\mathcal{O}(\varepsilon^{-4})$ sample complexity to find an $\varepsilon$-stationary point. Some works indicate that this complexity can be improved to $\mathcal{O}(\varepsilon^{-3})$ when the stochastic loss gradient is Lipschitz continuous. The question of achieving enhanced convergence rates under distinct conditions, remains open. In this work, we address this question for optimization problems that are nonconvex in the minimization variable and strongly concave or Polyak-Lojasiewicz (PL) in the maximization variable. We introduce novel bias-corrected momentum algorithms utilizing efficient Hessian-vector products. We establish convergence conditions and demonstrate a lower iteration complexity of $\mathcal{O}(\varepsilon^{-3})$ for the proposed algorithms. The effectiveness of the proposed method is validated through applications to robust logistic regression and robust adaptive cruise control.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

A Unified Zeroth-Order Approach for Decentralized Minimax Optimization
math.OC 2026-06 unverdicted novelty 7.0

ZOMA unifies hybrid zeroth-order estimators, bias corrections (GT/ED/EXTRA), and accelerations (STORM/PAGE/L2S) for decentralized nonconvex PL minimax optimization, claiming convergence rates matching centralized meth...
Communication-Efficient Decentralized Stochastic Minimax Optimization
math.OC 2025-07 unverdicted novelty 6.0

A momentum-augmented local decentralized method achieves O(κ ε^{-1}) local updates per round and O(κ² ε^{-2}) communication complexity for nonconvex PL minimax problems, improving on prior local GDA.