Optimal Client Sampling for Federated Learning

Peter Richtarik; Samuel Horvath; Wenlin Chen

arxiv: 2010.13723 · v3 · pith:JFHO755Ynew · submitted 2020-10-26 · 💻 cs.LG · cs.DC

Optimal Client Sampling for Federated Learning

Wenlin Chen , Samuel Horvath , Peter Richtarik This is my paper

classification 💻 cs.LG cs.DC

keywords clientclientscommunicationfederatedformulamethodsonlyoptimal

0 comments

read the original abstract

It is well understood that client-master communication can be a primary bottleneck in Federated Learning. In this work, we address this issue with a novel client subsampling scheme, where we restrict the number of clients allowed to communicate their updates back to the master node. In each communication round, all participating clients compute their updates, but only the ones with "important" updates communicate back to the master. We show that importance can be measured using only the norm of the update and give a formula for optimal client participation. This formula minimizes the distance between the full update, where all clients participate, and our limited update, where the number of participating clients is restricted. In addition, we provide a simple algorithm that approximates the optimal formula for client participation, which only requires secure aggregation and thus does not compromise client privacy. We show both theoretically and empirically that for Distributed SGD (DSGD) and Federated Averaging (FedAvg), the performance of our approach can be close to full participation and superior to the baseline where participating clients are sampled uniformly. Moreover, our approach is orthogonal to and compatible with existing methods for reducing communication overhead, such as local methods and communication compression methods.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

FedSteer: Taming Extreme Gradient Staleness in Federated Learning with Corrective Projections and Caching
cs.LG 2026-06 unverdicted novelty 5.0

FedSteer constructs a gradient subspace from cached client updates, projects active gradients to obtain coordinates, and reuses those coordinates on the drifted subspace to correct extreme staleness in federated learning.
Optimizing Split Federated Learning with Unstable Client Participation
cs.NI 2025-09 unverdicted novelty 5.0

The paper derives the first convergence upper bound for split federated learning under activation upload, gradient download, and aggregation failures, then jointly optimizes client sampling and model splitting to mini...