pith. machine review for the scientific record. sign in

arxiv: 2410.15368 · v2 · submitted 2024-10-20 · 🧮 math.OC · cs.LG· stat.ML

Recognition: unknown

Tighter Performance Theory of FedExProx

Authors on Pith no claims yet
classification 🧮 math.OC cs.LGstat.ML
keywords analysisfedexproxextrapolationconvergenceconvexgradientmethodoptimization
0
0 comments X
read the original abstract

We revisit FedExProx - a recently proposed distributed optimization method designed to enhance convergence properties of parallel proximal algorithms via extrapolation. In the process, we uncover a surprising flaw: its known theoretical guarantees on quadratic optimization tasks are no better than those offered by the vanilla Gradient Descent (GD) method. Motivated by this observation, we develop a novel analysis framework, establishing a tighter linear convergence rate for non-strongly convex quadratic problems. By incorporating both computation and communication costs, we demonstrate that FedExProx can indeed provably outperform GD, in stark contrast to the original analysis. Furthermore, we consider partial participation scenarios and analyze two adaptive extrapolation strategies - based on gradient diversity and Polyak stepsizes - again significantly outperforming previous results. Moving beyond quadratics, we extend the applicability of our analysis to general functions satisfying the Polyak-Lojasiewicz condition, outperforming the previous strongly convex analysis while operating under weaker assumptions. Backed by empirical results, our findings point to a new and stronger potential of FedExProx, paving the way for further exploration of the benefits of extrapolation in federated learning.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Stabilized Proximal Point Method via Trust Region Control

    math.OC 2026-04 unverdicted novelty 6.0

    A trust-region stabilized proximal point method enforces a displacement condition to achieve linear descent for general nonsmooth convex problems.