Generalized Byzantine-tolerant SGD

Cong Xie; Indranil Gupta; Oluwasanmi Koyejo

arxiv: 1802.10116 · v3 · pith:ZAYTCIKCnew · submitted 2018-02-27 · 💻 cs.DC · stat.ML

Generalized Byzantine-tolerant SGD

Cong Xie , Oluwasanmi Koyejo , Indranil Gupta This is my paper

classification 💻 cs.DC stat.ML

keywords byzantineaggregationrulesanalysisapproachesarbitrarilyarchitectureattack

0 comments

read the original abstract

We propose three new robust aggregation rules for distributed synchronous Stochastic Gradient Descent~(SGD) under a general Byzantine failure model. The attackers can arbitrarily manipulate the data transferred between the servers and the workers in the parameter server~(PS) architecture. We prove the Byzantine resilience properties of these aggregation rules. Empirical analysis shows that the proposed techniques outperform current approaches for realistic use cases and Byzantine attack scenarios.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 4 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Practical Validity Conditions for Byzantine-Tolerant Federated Learning
cs.LG 2026-05 unverdicted novelty 7.0

Introduces MEB and c-MEB validity conditions for Byzantine-robust aggregation, proving achievability under majority honesty (n>2t) with an optimal MinMax-MEB rule at c<sqrt(2) and explicit guarantees for standard aggregators.
Byzantine-Robust Distributed SGD: A Unified Analysis and Tight Error Bounds
math.OC 2026-04 unverdicted novelty 7.0

Unified convergence rates and tight lower bounds for Byzantine-robust distributed SGD under stochasticity and general data heterogeneity, showing local momentum reduces stochastic error floors.
RESIST: Resilient Decentralized Learning Using Consensus Gradient Descent
cs.LG 2025-02 unverdicted novelty 6.0

RESIST achieves algorithmic and statistical convergence guarantees for strongly convex, PL, and nonconvex ERM under MITM attacks via multistep consensus gradient descent plus robust screening.
Generalized Rank Regression
stat.ME 2026-05 unverdicted novelty 5.0

Generalized Rank Regression extends rank methods to non-monotonic scores, derives Bahadur representation and asymptotic normality, proposes a two-stage sub-gradient algorithm, and shows variance equivalence to composi...