Self-tuned Regularized Federated Methods with Guarantees for Optimal Solution Selection

Farzad Yousefian; Mohammadjavad Ebrahimi; Shisheng Cui; Yuyang Qiu

arxiv: 2503.08634 · v3 · pith:UZ674PQ4new · submitted 2025-03-11 · 🧮 math.OC

Self-tuned Regularized Federated Methods with Guarantees for Optimal Solution Selection

Mohammadjavad Ebrahimi , Yuyang Qiu , Shisheng Cui , Farzad Yousefian This is my paper

classification 🧮 math.OC

keywords problemguaranteescomplexityconvexfederatedlearningobjectiveoptimal

0 comments

read the original abstract

We study a hierarchical federated learning (FL) problem, where clients cooperatively seek to select among multiple optimal solutions of a primary distributed learning problem, a solution that minimizes a secondary loss function. This problem arises from over-parameterized learning and ill-posed optimization problems. First, we consider the setting where the inner-level objective is convex and the outer-level objective is either convex or strongly convex. We propose a self-tuned regularized federated averaging (StR-FedAvg) method where the stepsize and regularization parameter are characterized by the number of communication rounds and problem parameters. We derive new complexity guarantees for addressing the optimal solution selection problem in FL. Second, when the outer-level objective is nonconvex, we propose a two-loop FL scheme in which the outer loop employs an inexact projected first-order method and the inner loop applies StR-FedAvg with an iteratively updated regularization parameter. We derive new communication complexity guarantees for computing a stationary point of the nonconvex solution-selection problem. To our knowledge, this is the first work to establish complexity guarantees for this class of problems in FL. Preliminary experiments validate our theoretical findings.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

On the Resolution of Stochastic MPECs over Networks: Distributed Implicit Zeroth-Order Gradient Tracking Methods
math.OC 2025-05 unverdicted novelty 6.0

Proposes single-stage and two-stage distributed implicit zeroth-order gradient tracking methods for stochastic MPECs over networks that achieve best-known complexity bounds for centralized nonsmooth nonconvex stochast...