Learning to optimize with guarantees: a complete characterization of linearly convergent algorithms

Andrea Martin; Ian R. Manchester; Luca Furieri

arxiv: 2508.00775 · v2 · pith:XEW5H3FDnew · submitted 2025-08-01 · 📡 eess.SY · cs.LG· cs.SY· math.OC

Learning to optimize with guarantees: a complete characterization of linearly convergent algorithms

Andrea Martin , Ian R. Manchester , Luca Furieri This is my paper

classification 📡 eess.SY cs.LGcs.SYmath.OC

keywords algorithmslinearlinearlyoptimizationalgorithmclassicalconvergentguarantees

0 comments

read the original abstract

The design of many classical optimization algorithms is driven by the certification of linear convergence rates over classes of optimization problems. In this paper, we consider the problem of improving the average-case performance of an algorithm over a specific distribution of problem instances. While this task can be tackled by embedding trainable components into the algorithm updates, a key challenge is to preserve worst-case guarantees across the entire problem class. For classes of composite optimization problems, we show that all linearly convergent algorithms can be parametrized in terms of a baseline linearly convergent algorithm, and a set of trainable, exponentially-decaying modifications to its update rule; crucially, this parametrization excludes all-and only-the algorithms that do not converge linearly. Our results apply to improving the average-case performance of classical algorithms such as gradient descent for nonconvex, gradient-dominated functions; Nesterov's accelerated method for smooth, strongly convex functions; and projected gradient methods for optimization over polyhedral feasible sets. We illustrate how our characterization can be used for learning to optimize with linear convergence and feasibility guarantees. Numerical results showcase benefits over classical optimizers when solving ill-conditioned systems of linear equations and running a model predictive control scheme on a linear dynamical system.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Learning Over-Relaxation Policies for ADMM with Convergence Guarantees
math.OC 2026-04 unverdicted novelty 6.0

Learned online policies for the ADMM relaxation parameter improve iteration count and runtime on benchmark quadratic programs while maintaining convergence guarantees for time-varying parameters under mild assumptions.