DFedReweighting: A Unified Framework for Objective-Oriented Reweighting in Decentralized Federated Learning

arxiv: 2512.12022 · v2 · submitted 2025-12-12 · 💻 cs.LG

DFedReweighting: A Unified Framework for Objective-Oriented Reweighting in Decentralized Federated Learning

Kaichuang Zhang , Wei Yin , Jinghao Yang , Ping Xu This is my paper

Pith reviewed 2026-05-16 22:30 UTC · model grok-4.3

classification 💻 cs.LG

keywords decentralized federated learningreweightingfairnessByzantine robustnessconvergence analysisaggregationmulti-objective optimization

0 comments p. Extension

The pith

DFedReweighting reweights client contributions in each round of decentralized federated learning by first scoring them on a target performance metric evaluated on local auxiliary data and then applying a customized refinement strategy.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces DFedReweighting as a single aggregation framework that lets clients in a decentralized setup pursue different objectives such as fairness or attack resistance without a central server. For each client it builds a compact auxiliary dataset from local data, measures a chosen target performance metric to produce preliminary weights, and then applies a customized reweighting strategy to obtain the final weights used for aggregation. The authors prove that suitable choices of the metric and strategy together yield linear convergence on L-smooth strongly convex objectives. Experiments show consistent gains in fairness across clients and in resilience to Byzantine attackers. The same mechanism also supports multi-objective cases both within a single client and across clients by appropriate design of the metric and strategy.

Core claim

DFedReweighting evaluates a target performance metric on a compact auxiliary dataset constructed from each client's local data to obtain preliminary aggregation weights, then refines those weights with a customized reweighting strategy to produce the final weights; an appropriate combination of the metric and strategy guarantees linear convergence for general L-smooth and strongly convex functions while delivering improved fairness and Byzantine robustness.

What carries the argument

The TPM-CRS pair: a target performance metric (TPM) computed on a compact local auxiliary dataset that yields preliminary weights, followed by a customized reweighting strategy (CRS) that adjusts them into the final aggregation weights.

If this is right

Fairness across clients improves by down-weighting those whose local models perform poorly on the chosen metric.
Robustness to Byzantine clients increases because malicious updates receive low weights after the refinement step.
A wide range of objectives can be realized simply by selecting different target metrics and reweighting rules.
Linear convergence holds for any L-smooth strongly convex loss once the metric and strategy satisfy the stated compatibility condition.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same auxiliary-dataset approach might allow reweighting rules to be adapted on the fly when client data distributions shift.
The framework could be combined with existing decentralized averaging methods to add objective-oriented control without redesigning the communication pattern.
If the auxiliary dataset is kept private, the method may preserve more data locality than approaches that require sharing validation sets.

Load-bearing premise

Each client can form a compact auxiliary dataset from its local data that is representative enough for the chosen performance metric to generate useful preliminary weights.

What would settle it

Demonstration that the proposed TPM-CRS pair fails to produce linear convergence on an L-smooth strongly convex problem, or that fairness and robustness gains disappear when the auxiliary dataset is replaced by a non-representative sample.

Figures

Figures reproduced from arXiv: 2512.12022 by Jinghao Yang, Kaichuang Zhang, Ping Xu, Wei Yin.

**Figure 1.** Figure 1: Objective-oriented reweighting aggregation. Each client assigns aggregation weights based on its specific objective. For instance, to improve model fairness, a client may assign a higher weight to update U1, which yields superior fairness. Similarly, clients can allocate distinct aggregation weights to received updates (U1, U2, U3) based on their individual objective by reweighting aggregation function A… view at source ↗

**Figure 2.** Figure 2: Overview of DFedReweighting in a DFL system. In a 4-client DFL system where each client has 2 neighbors, each client (i) constructs an auxiliary dataset Ak, (ii) computes a preliminary weight mk using the target performance metric (TPM), (iii) derives the final aggregation weight vk via a customized reweighting strategy (CRS), and (iv) aggregates models using this final weight. DFedReweighting heavily depe… view at source ↗

**Figure 3.** Figure 3: Different TPM-CRS combinations lead to different final performance on Byzantine attack robustness (same problem). The first combination (TPM: Loss, CRS: Equation (17)) clearly outperforms the second combination (TPM: Accuracy, CRS: Equation (18)). [4] Peva Blanchard, El Mahdi El Mhamdi, Rachid Guerraoui, and Julien Stainer. Machine learning with adversaries: Byzantine tolerant gradient descent. Advances in… view at source ↗

read the original abstract

Decentralized federated learning (DFL) has emerged as a promising paradigm that enables multiple clients to collaboratively train machine learning models through iterative rounds of local training, communication, and aggregation, without relying on a central server. Nevertheless, DFL systems continue to face a range of challenges, including fairness and Byantine robustness. To address these challenges, we propose \textbf{DFedReweighting}, a unified aggregation framework that achieves diverse learning objectives in DFL via objective-oriented reweighting at the final step of each learning round. Specifically, for each client, the framework first evaluates a target performance metric (TPM) on a compact auxiliary dataset constructed from local data, yielding preliminary aggregation weights, which are subsequently refined by a customized reweighting strategy (CRS) to produce the final aggregation weights. Theoretically, we prove that an appropriate TPM-CRS combination guarantees linear convergence for general $L$-smoothand strongly convex functions. Empirical results consistently demonstrate that \textbf{DFedReweighting} significantly improves fairness and robustness against Byzantine attacks across diverse settings. Two multi-objective examples, spanning tasks across and within clients, further establish that a broad range of desired learning objectives can be accommodated by appropriately designing the TPM and CRS. Our code is available at https://github.com/KaichuangZhang/DFedReweighting.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

DFedReweighting gives a reweighting scheme for decentralized FL to target fairness and Byzantine robustness via auxiliary metrics, with a claimed linear convergence result, but the auxiliary dataset step lacks error bounds.

read the letter

The main thing to know is that this paper offers DFedReweighting, a framework for reweighting client updates in decentralized federated learning to meet goals like fairness across clients and robustness to Byzantine attacks. Each client builds a small auxiliary dataset from its local data, scores a target performance metric on it to set initial weights, then applies a customized reweighting strategy to finalize the aggregation weights for the round. They claim this works for multiple objectives by picking the right metric and strategy pair, and they show empirical gains in varied settings plus a linear convergence proof for L-smooth strongly convex objectives.

Referee Report

1 major / 2 minor

Summary. The paper proposes DFedReweighting, a unified aggregation framework for decentralized federated learning that computes preliminary weights by evaluating a target performance metric (TPM) on a compact auxiliary dataset constructed from each client's local data, then refines them via a customized reweighting strategy (CRS) to achieve diverse objectives such as fairness and Byzantine robustness. It claims a linear convergence guarantee for L-smooth and strongly convex objectives under appropriate TPM-CRS pairs, supported by empirical results across settings and two multi-objective examples; code is released.

Significance. If the convergence result holds with the stated assumptions, the framework provides a flexible, objective-oriented mechanism for handling fairness and security in serverless DFL, extending beyond standard averaging. The explicit code release and multi-objective examples are strengths that support reproducibility and generality.

major comments (1)

[Abstract and theoretical analysis section] Abstract and theoretical analysis section: the linear convergence claim for general L-smooth and strongly convex functions relies on the TPM evaluated on the compact auxiliary dataset producing weights whose effect yields a contraction mapping. No quantitative bound is given on the deviation between these weights and those obtained from the full local data distribution, nor on the auxiliary dataset construction (size, sampling) needed to control the error under heterogeneity; this approximation error is load-bearing for the contraction and is not addressed in the proof sketch.

minor comments (2)

[Abstract] Abstract: 'L-smoothand' is missing a space.
[Method description] The description of how the compact auxiliary dataset is constructed from local data (e.g., sampling method, size relative to local dataset) is insufficiently detailed for readers to assess representativeness.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for the constructive feedback on the theoretical analysis. We agree that the approximation error from the auxiliary dataset must be rigorously bounded to support the linear convergence claim and will revise the proof accordingly.

read point-by-point responses

Referee: [Abstract and theoretical analysis section] Abstract and theoretical analysis section: the linear convergence claim for general L-smooth and strongly convex functions relies on the TPM evaluated on the compact auxiliary dataset producing weights whose effect yields a contraction mapping. No quantitative bound is given on the deviation between these weights and those obtained from the full local data distribution, nor on the auxiliary dataset construction (size, sampling) needed to control the error under heterogeneity; this approximation error is load-bearing for the contraction and is not addressed in the proof sketch.

Authors: We agree that a quantitative bound on the approximation error is necessary for a complete proof. In the revised version, we will augment the theoretical analysis section with an explicit error bound between the TPM weights computed on the compact auxiliary dataset and those on the full local data distribution. Under the L-smooth and strongly convex assumptions, we will apply standard concentration inequalities (e.g., Hoeffding or Bernstein) to control the deviation as a function of auxiliary dataset size and heterogeneity level. The auxiliary dataset construction will be specified as uniform random sampling without replacement, with a minimum size requirement (scaling logarithmically with the number of clients and inversely with the desired error tolerance) to ensure the total error term remains smaller than the contraction factor, thereby preserving the linear convergence rate up to a controllable additive constant. This analysis will be added to the proof sketch and supported by a new lemma. revision: yes

Circularity Check

0 steps flagged

No circularity: theoretical convergence proof is independent of fitted inputs or self-referential definitions

full rationale

The paper's central claim is a theoretical proof that an appropriate TPM-CRS pair guarantees linear convergence for L-smooth strongly convex objectives. The abstract and description present this as a separate analysis with no equations shown that reduce the contraction mapping to a fitted parameter, self-citation chain, or definition by construction. The auxiliary dataset is stated as an assumption for constructing preliminary weights, but the convergence result is not shown to be forced by that construction or by renaming prior results. No load-bearing self-citation or ansatz smuggling is evident in the provided text.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The central claim rests on standard optimization assumptions for convergence and the practical construction of representative auxiliary datasets from local client data.

axioms (1)

domain assumption Objective functions are L-smooth and strongly convex
Invoked to guarantee linear convergence of the reweighted aggregation.

pith-pipeline@v0.9.0 · 5548 in / 1231 out tokens · 37112 ms · 2026-05-16T22:30:50.323401+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

Theoretically, we prove that an appropriate TPM-CRS combination guarantees linear convergence for general L-smooth and strongly convex functions.
IndisputableMonolith/Foundation/RealityFromDistinction.lean reality_from_one_distinction unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

DFedReweighting employs the objective-oriented reweighting aggregation defined in Equation (5)

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

59 extracted references · 59 canonical work pages · 5 internal anchors

[1]

Mda: Availability-aware federated learning client selection.arXiv preprint arXiv:2211.14391, 2022

Amin Eslami Abyane, Steve Drew, and Hadi Hemmati. Mda: Availability-aware federated learning client selection.arXiv preprint arXiv:2211.14391, 2022. 3

work page arXiv 2022
[2]

Federated learning under arbitrary communication patterns

Dmitrii Avdiukhin and Shiva Kasiviswanathan. Federated learning under arbitrary communication patterns. InInter- national Conference on Machine Learning, pages 425–435. PMLR, 2021. 5, 1

work page 2021
[3]

A lit- tle is enough: Circumventing defenses for distributed learn- ing.Advances in Neural Information Processing Systems, 32, 2019

Gilad Baruch, Moran Baruch, and Yoav Goldberg. A lit- tle is enough: Circumventing defenses for distributed learn- ing.Advances in Neural Information Processing Systems, 32, 2019. 8 Figure 3.Different TPM-CRS combinations lead to different final performance on Byzantine attack robustness (same problem).The first combination (TPM: Loss, CRS: Equation (17))...

work page 2019
[4]

Machine learning with adversaries: Byzantine tolerant gradient descent.Advances in Neural In- formation Processing Systems, 30, 2017

Peva Blanchard, El Mahdi El Mhamdi, Rachid Guerraoui, and Julien Stainer. Machine learning with adversaries: Byzantine tolerant gradient descent.Advances in Neural In- formation Processing Systems, 30, 2017. 8

work page 2017
[5]

Expanding the Reach of Federated Learning by Reducing Client Resource Requirements

Sebastian Caldas, Jakub Kone ˇcny, H Brendan McMahan, and Ameet Talwalkar. Expanding the reach of federated learning by reducing client resource requirements.arXiv preprint arXiv:1812.07210, 2018. 2

work page internal anchor Pith review Pith/arXiv arXiv 2018
[6]

Robust blockchained federated learning with model validation and proof-of-stake inspired consensus

Hang Chen, Syed Ali Asif, Jihong Park, Chien-Chung Shen, and Mehdi Bennis. Robust blockchained federated learning with model validation and proof-of-stake inspired consensus. arXiv preprint arXiv:2101.03300, 2021. 1

work page arXiv 2021
[7]

A fairness-aware peer-to-peer decentralized learning framework with heterogeneous devices.Future Internet, 14 (5):138, 2022

Zheyi Chen, Weixian Liao, Pu Tian, Qianlong Wang, and Wei Yu. A fairness-aware peer-to-peer decentralized learning framework with heterogeneous devices.Future Internet, 14 (5):138, 2022. 2

work page 2022
[8]

The mnist database of handwritten digit images for machine learning research.IEEE Signal Processing Maga- zine, 29(6):141–142, 2012

Li Deng. The mnist database of handwritten digit images for machine learning research.IEEE Signal Processing Maga- zine, 29(6):141–142, 2012. 6

work page 2012
[9]

Fairness-aware agnostic federated learning

Wei Du, Depeng Xu, Xintao Wu, and Hanghang Tong. Fairness-aware agnostic federated learning. InProceedings of the 2021 SIAM International Conference on Data Mining (SDM), pages 181–189. SIAM, 2021. 2

work page 2021
[10]

Ezzeldin, Shen Yan, Chaoyang He, Emilio Ferrara, and Salman Avestimehr

Yahya H. Ezzeldin, Shen Yan, Chaoyang He, Emilio Ferrara, and Salman Avestimehr. Fairfed: Enabling group fairness in federated learning.arXiv preprint arXiv:2110.00857, 2021. 2

work page arXiv 2021
[11]

Local model poisoning attacks to byzantine-robust federated learning

Minghong Fang, Xiaoyu Cao, Jinyuan Jia, and Neil Gong. Local model poisoning attacks to byzantine-robust federated learning. In29th USENIX Security Symposium (USENIX Se- curity 20), pages 1605–1622, 2020. 6

work page 2020
[12]

Byzantine- robust decentralized federated learning.arXiv preprint arXiv:2406.10416, 2024

Minghong Fang, Zifan Zhang, Prashant Khanduri, Song- tao Lu, Yuchen Liu, Neil Gong, et al. Byzantine- robust decentralized federated learning.arXiv preprint arXiv:2406.10416, 2024. 3, 8, 1

work page arXiv 2024
[13]

Fedgr: Federated learning with gravitation regulation for double imbalance distribu- tion

Songyue Guo, Xu Yang, Jiyuan Feng, Ye Ding, Wei Wang, Yunqing Feng, and Qing Liao. Fedgr: Federated learning with gravitation regulation for double imbalance distribu- tion. InInternational Conference on Database Systems for Advanced Applications, pages 703–718. Springer, 2023. 6

work page 2023
[14]

Central server free federated learning over single-sided trust social networks.arXiv preprint arXiv:1910.04956, 2019

Chaoyang He, Conghui Tan, Hanlin Tang, Shuang Qiu, and Ji Liu. Central server free federated learning over single-sided trust social networks.arXiv preprint arXiv:1910.04956, 2019. 1

work page arXiv 1910
[15]

Spreadgnn: Serverless multi-task federated learning for graph neural net- works.arXiv preprint arXiv:2106.02743, 2021

Chaoyang He, Emir Ceyani, Keshav Balasubramanian, Mu- rali Annavaram, and Salman Avestimehr. Spreadgnn: Serverless multi-task federated learning for graph neural net- works.arXiv preprint arXiv:2106.02743, 2021. 1

work page arXiv 2021
[16]

Measuring the Effects of Non-Identical Data Distribution for Federated Visual Classification

Tzu-Ming Harry Hsu, Hang Qi, and Matthew Brown. Mea- suring the effects of non-identical data distribution for feder- ated visual classification.arXiv preprint arXiv:1909.06335,

work page internal anchor Pith review arXiv 1909
[17]

Fedmgda+: Federated learning meets multi-objective optimization.arXiv preprint arXiv:2006.11489, 2020

Zeou Hu, Kiarash Shaloudegi, Guojun Zhang, and Yaoliang Yu. Fedmgda+: Federated learning meets multi-objective optimization.arXiv preprint arXiv:2006.11489, 2020. 2

work page arXiv 2006
[18]

Performance analysis of decentralized federated learning de- ployments, 2025

Chengyan Jiang, Jiamin Fan, Talal Halabi, and Israat Haque. Performance analysis of decentralized federated learning de- ployments, 2025. 5, 1

work page 2025
[19]

Association Press, 1975

Malcolm Shepherd Knowles.Self-directed learning: A guide for learners and teachers. Association Press, 1975. 2

work page 1975
[20]

Federated Learning: Strategies for Improving Communication Efficiency

Jakub Kone ˇcn`y, H Brendan McMahan, Felix X Yu, Peter Richt´arik, Ananda Theertha Suresh, and Dave Bacon. Fed- erated learning: Strategies for improving communication ef- ficiency.arXiv preprint arXiv:1610.05492, 2016. 2

work page internal anchor Pith review Pith/arXiv arXiv 2016
[21]

Benign overfitting in two-layer relu convolutional neural networks

Yiwen Kou, Zixiang Chen, Yuanzhou Chen, and Quanquan Gu. Benign overfitting in two-layer relu convolutional neural networks. InInternational conference on machine learning, pages 17615–17659. PMLR, 2023. 8

work page 2023
[22]

Fully decentralized federated learning

Anusha Lalitha, Shubhanshu Shekhar, Tara Javidi, and Fari- naz Koushanfar. Fully decentralized federated learning. InThird Workshop on Bayesian Deep Learning (NeurIPS),

work page
[23]

Peer-to-peer Federated Learning on Graphs

Anusha Lalitha, Osman Cihan Kilinc, Tara Javidi, and Fari- naz Koushanfar. Peer-to-peer federated learning on graphs. arXiv preprint arXiv:1901.11173, 2019. 1

work page internal anchor Pith review Pith/arXiv arXiv 1901
[24]

The byzantine generals problem

Leslie Lamport, Robert Shostak, and Marshall Pease. The byzantine generals problem. InConcurrency: the works of leslie lamport, pages 203–226. 2019. 8

work page 2019
[25]

Exploring the prac- ticality of federated learning: A survey towards the com- munication perspective.arXiv preprint arXiv:2405.20431,

Khiem Le, Nhan Luong-Ha, Manh Nguyen-Duc, Danh Le- Phuoc, Cuong Do, and Kok-Seng Wong. Exploring the prac- ticality of federated learning: A survey towards the com- munication perspective.arXiv preprint arXiv:2405.20431,

work page arXiv
[26]

pfedfair: Towards optimal group fairness-accuracy trade- off in heterogeneous federated learning.arXiv preprint arXiv:2503.14925, 2025

Haoyu Lei, Shizhan Gong, Qi Dou, and Farzan Farnia. pfedfair: Towards optimal group fairness-accuracy trade- off in heterogeneous federated learning.arXiv preprint arXiv:2503.14925, 2025. 2

work page arXiv 2025
[27]

Learning to detect malicious clients for robust feder- ated learning.arXiv preprint arXiv:2002.00211, 2020

Suyi Li, Yong Cheng, Wei Wang, Yang Liu, and Tianjian Chen. Learning to detect malicious clients for robust feder- ated learning.arXiv preprint arXiv:2002.00211, 2020. 2

work page arXiv 2002
[28]

Fair re- source allocation in federated learning,

Tian Li, Maziar Sanjabi, Ahmad Beirami, and Virginia Smith. Fair resource allocation in federated learning.arXiv preprint arXiv:1905.10497, 2019. 2, 6

work page arXiv 1905
[29]

Ditto: Fair and robust federated learning through per- sonalization

Tian Li, Shengyuan Hu, Ahmad Beirami, and Virginia Smith. Ditto: Fair and robust federated learning through per- sonalization. InInternational conference on machine learn- ing, pages 6357–6368. PMLR, 2021. 7

work page 2021
[30]

Convergence analysis of se- quential federated learning on heterogeneous data.Advances in Neural Information Processing Systems, 36:56700–56755,

Yipeng Li and Xinchen Lyu. Convergence analysis of se- quential federated learning on heterogeneous data.Advances in Neural Information Processing Systems, 36:56700–56755,

work page
[31]

Building a practically useful theory of goal setting and task motivation: A 35-year odyssey.American psychologist, 57(9):705, 2002

Edwin A Locke and Gary P Latham. Building a practically useful theory of goal setting and task motivation: A 35-year odyssey.American psychologist, 57(9):705, 2002. 2

work page 2002
[32]

Shieldfl: Mitigating model poisoning at- tacks in privacy-preserving federated learning.IEEE Trans- actions on Information Forensics and Security, 17:1639– 1654, 2022

Zhuoran Ma, Jianfeng Ma, Yinbin Miao, Yingjiu Li, and Robert H Deng. Shieldfl: Mitigating model poisoning at- tacks in privacy-preserving federated learning.IEEE Trans- actions on Information Forensics and Security, 17:1639– 1654, 2022. 5

work page 2022
[33]

Communication- efficient learning of deep networks from decentralized data

Brendan McMahan, Eider Moore, Daniel Ramage, Seth Hampson, and Blaise Aguera y Arcas. Communication- efficient learning of deep networks from decentralized data. InArtificial Intelligence and Statistics, pages 1273–1282. PMLR, 2017. 1, 2, 6

work page 2017
[34]

Agnostic federated learning

Mehryar Mohri, Gary Sivek, and Ananda Theertha Suresh. Agnostic federated learning. InProceedings of the 36th In- ternational Conference on Machine Learning, pages 4615– 4625, 2019. 2

work page 2019
[35]

A survey on security and privacy of federated learning.Future Gener- ation Computer Systems, 115:619–640, 2021

Viraaji Mothukuri, Reza M Parizi, Seyedamin Pouriyeh, Yan Huang, Ali Dehghantanha, and Gautam Srivastava. A survey on security and privacy of federated learning.Future Gener- ation Computer Systems, 115:619–640, 2021. 3

work page 2021
[36]

In31st USENIX Security Symposium (USENIX Se- curity 22), pages 1415–1432, 2022

Thien Duc Nguyen, Phillip Rieger, Roberta De Viti, Huili Chen, Bj ¨orn B Brandenburg, Hossein Yalame, Helen M¨ollering, Hossein Fereidooni, Samuel Marchal, Markus Miettinen, et al.{FLAME}: Taming backdoors in federated learning. In31st USENIX Security Symposium (USENIX Se- curity 22), pages 1415–1432, 2022. 8

work page 2022
[37]

Byzantine-robust decen- tralized stochastic optimization over static and time-varying networks.Signal Processing, 183:108020, 2021

Jie Peng, Weiyu Li, and Qing Ling. Byzantine-robust decen- tralized stochastic optimization over static and time-varying networks.Signal Processing, 183:108020, 2021. 5

work page 2021
[38]

Reducing communication in federated learning via efficient client sampling.Pattern Recognition, 148:110122, 2024

M ´onica Ribero and Haris Vikalo. Reducing communication in federated learning via efficient client sampling.Pattern Recognition, 148:110122, 2024. 2

work page 2024
[39]

Model compres- sion for communication efficient federated learning.IEEE Transactions on Neural Networks and Learning Systems, 34 (9):5937–5951, 2021

Suhail Mohmad Shah and Vincent KN Lau. Model compres- sion for communication efficient federated learning.IEEE Transactions on Neural Networks and Learning Systems, 34 (9):5937–5951, 2021. 2

work page 2021
[40]

Privacy-preserving deep learning

Reza Shokri and Vitaly Shmatikov. Privacy-preserving deep learning. InProceedings of the 22nd ACM SIGSAC Con- ference on Computer and Communications Security, pages 1310–1321, 2015. 1

work page 2015
[41]

Decentralized feder- ated averaging.IEEE Transactions on Pattern Analysis and Machine Intelligence, 45(4):4289–4301, 2022

Tao Sun, Dongsheng Li, and Bao Wang. Decentralized feder- ated averaging.IEEE Transactions on Pattern Analysis and Machine Intelligence, 45(4):4289–4301, 2022. 3

work page 2022
[42]

Cognitive load during problem solving: Effects on learning.Cognitive science, 12(2):257–285, 1998

John Sweller, Jeroen JG van Merri ¨enboer, and Fred GWC Paas. Cognitive load during problem solving: Effects on learning.Cognitive science, 12(2):257–285, 1998. 2

work page 1998
[43]

Federated learning with fair averaging.arXiv preprint arXiv:2104.14937, 2021

Zheng Wang, Xiaoliang Fan, Jianzhong Qi, Chenglu Wen, Cheng Wang, and Rongshan Yu. Federated learning with fair averaging.arXiv preprint arXiv:2104.14937, 2021. 2

work page arXiv 2021
[44]

Communication-efficient federated learning via knowledge distillation.Nature communications, 13(1): 2032, 2022

Chuhan Wu, Fangzhao Wu, Lingjuan Lyu, Yongfeng Huang, and Xing Xie. Communication-efficient federated learning via knowledge distillation.Nature communications, 13(1): 2032, 2022. 2

work page 2032
[45]

Byzantine- resilient decentralized stochastic optimization with robust aggregation rules.IEEE transactions on signal processing, 71:3179–3195, 2023

Zhaoxian Wu, Tianyi Chen, and Qing Ling. Byzantine- resilient decentralized stochastic optimization with robust aggregation rules.IEEE transactions on signal processing, 71:3179–3195, 2023. 3, 5

work page 2023
[46]

Fashion-MNIST: a Novel Image Dataset for Benchmarking Machine Learning Algorithms

Han Xiao, Kashif Rasul, and Roland V ollgraf. Fashion- mnist: a novel image dataset for benchmarking machine learning algorithms.arXiv preprint arXiv:1708.07747, 2017. 6

work page internal anchor Pith review Pith/arXiv arXiv 2017
[47]

Federated learning over wireless device-to-device networks: Algo- rithms and convergence analysis.IEEE Journal on Selected Areas in Communications, 39(12):3723–3741, 2021

Hong Xing, Osvaldo Simeone, and Suzhi Bi. Federated learning over wireless device-to-device networks: Algo- rithms and convergence analysis.IEEE Journal on Selected Areas in Communications, 39(12):3723–3741, 2021. 1

work page 2021
[48]

Byzantine-robust decentral- ized learning via remove-then-clip aggregation

Caiyi Yang and Javad Ghaderi. Byzantine-robust decentral- ized learning via remove-then-clip aggregation. InProceed- ings of the AAAI Conference on Artificial Intelligence, pages 21735–21743, 2024. 1

work page 2024
[49]

Decentralized fed- erated learning with unreliable communications.IEEE Jour- nal of Selected Topics in Signal Processing, 16(3):487–500,

Hao Ye, Le Liang, and Geoffrey Ye Li. Decentralized fed- erated learning with unreliable communications.IEEE Jour- nal of Selected Topics in Signal Processing, 16(3):487–500,

work page
[50]

On the tradeoff be- tween privacy preservation and byzantine-robustness in de- centralized learning

Haoxiang Ye, Heng Zhu, and Qing Ling. On the tradeoff be- tween privacy preservation and byzantine-robustness in de- centralized learning. InICASSP 2024-2024 IEEE Interna- tional Conference on Acoustics, Speech and Signal Process- ing (ICASSP), pages 9336–9340. IEEE, 2024. 6, 8, 1

work page 2024
[51]

Byzantine-robust distributed learning: Towards op- timal statistical rates

Dong Yin, Yudong Chen, Ramchandran Kannan, and Peter Bartlett. Byzantine-robust distributed learning: Towards op- timal statistical rates. InInternational Conference on Ma- chine Learning, pages 5650–5659. Pmlr, 2018. 8

work page 2018
[52]

Decentralized federated learning: A survey and perspective.IEEE Internet of Things Journal,

Liangqi Yuan, Ziran Wang, Lichao Sun, Philip S Yu, and Christopher G Brinton. Decentralized federated learning: A survey and perspective.IEEE Internet of Things Journal,

work page
[53]

Gifair-fl: A framework for group and individual fairness in federated learning.INFORMS Journal on Data Science, 2(1):10–23,

Xubo Yue, Maher Nouiehed, and Raed Al Kontar. Gifair-fl: A framework for group and individual fairness in federated learning.INFORMS Journal on Data Science, 2(1):10–23,

work page
[54]

Byzantine- robust decentralized federated learning via local perfor- mance checking

Kaichuang Zhang, Alina Basharat, and Ping Xu. Byzantine- robust decentralized federated learning via local perfor- mance checking. In31st International Conference on Neural Information Processing, 2024. 1, 3, 6

work page 2024
[55]

Distributional and byzantine robust decentralized federated learning

Kaichuang Zhang, Ping Xu, and Zhi Tian. Distributional and byzantine robust decentralized federated learning. In2025 59th Annual Conference on Information Sciences and Sys- tems (CISS), pages 1–6. IEEE, 2025. 1

work page 2025
[56]

A dynamic reweighting strat- egy for fair federated learning

Zhiyuan Zhao and Gauri Joshi. A dynamic reweighting strat- egy for fair federated learning. InICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 8772–8776. IEEE, 2022. 2 DFedReweighting: A Unified Framework for Objective-Oriented Reweighting in Decentralized Federated Learning Supplementary Material

work page 2022
[57]

Assumptions and Remarks Beyond Assumptions 3 and 4 in the main text, we introduce additional assumptions for our convergence analysis

Convergence Analysis 8.1. Assumptions and Remarks Beyond Assumptions 3 and 4 in the main text, we introduce additional assumptions for our convergence analysis. Assumption 3G B is connected.[12, 48, 50] The subgraph of benign nodes should be connected, meaning that there should exist a path between any two benign nodes allowing them to reach each other. A...

work page
[58]

For each candidatew a, com- pute its distances to all other candidates and letS a be the multiset of the smallestn−f−2distances, wheren=|N i| andfis the Byzantine tolerance parameter. The Krum score ofw a is score(w t+ 1 2 a ) = X d∈Sa d.(41) Krum selects the candidate with the minimal score and uses it as the aggregated model: wt+1 i = arg min w t+ 1 2a ...

work page
[59]

Given a smoothing parameterβ >0, it then defines the unnormalized weight uj = 1 dj +β , the normalized weightp j = ujP k uk , and finally updates its model by wt+1 i = X j∈Ni pjw t+ 1 2 j .(45) In our experiments, the default value ofβis1.0

work page

[1] [1]

Mda: Availability-aware federated learning client selection.arXiv preprint arXiv:2211.14391, 2022

Amin Eslami Abyane, Steve Drew, and Hadi Hemmati. Mda: Availability-aware federated learning client selection.arXiv preprint arXiv:2211.14391, 2022. 3

work page arXiv 2022

[2] [2]

Federated learning under arbitrary communication patterns

Dmitrii Avdiukhin and Shiva Kasiviswanathan. Federated learning under arbitrary communication patterns. InInter- national Conference on Machine Learning, pages 425–435. PMLR, 2021. 5, 1

work page 2021

[3] [3]

A lit- tle is enough: Circumventing defenses for distributed learn- ing.Advances in Neural Information Processing Systems, 32, 2019

Gilad Baruch, Moran Baruch, and Yoav Goldberg. A lit- tle is enough: Circumventing defenses for distributed learn- ing.Advances in Neural Information Processing Systems, 32, 2019. 8 Figure 3.Different TPM-CRS combinations lead to different final performance on Byzantine attack robustness (same problem).The first combination (TPM: Loss, CRS: Equation (17))...

work page 2019

[4] [4]

Machine learning with adversaries: Byzantine tolerant gradient descent.Advances in Neural In- formation Processing Systems, 30, 2017

Peva Blanchard, El Mahdi El Mhamdi, Rachid Guerraoui, and Julien Stainer. Machine learning with adversaries: Byzantine tolerant gradient descent.Advances in Neural In- formation Processing Systems, 30, 2017. 8

work page 2017

[5] [5]

Expanding the Reach of Federated Learning by Reducing Client Resource Requirements

Sebastian Caldas, Jakub Kone ˇcny, H Brendan McMahan, and Ameet Talwalkar. Expanding the reach of federated learning by reducing client resource requirements.arXiv preprint arXiv:1812.07210, 2018. 2

work page internal anchor Pith review Pith/arXiv arXiv 2018

[6] [6]

Robust blockchained federated learning with model validation and proof-of-stake inspired consensus

Hang Chen, Syed Ali Asif, Jihong Park, Chien-Chung Shen, and Mehdi Bennis. Robust blockchained federated learning with model validation and proof-of-stake inspired consensus. arXiv preprint arXiv:2101.03300, 2021. 1

work page arXiv 2021

[7] [7]

A fairness-aware peer-to-peer decentralized learning framework with heterogeneous devices.Future Internet, 14 (5):138, 2022

Zheyi Chen, Weixian Liao, Pu Tian, Qianlong Wang, and Wei Yu. A fairness-aware peer-to-peer decentralized learning framework with heterogeneous devices.Future Internet, 14 (5):138, 2022. 2

work page 2022

[8] [8]

The mnist database of handwritten digit images for machine learning research.IEEE Signal Processing Maga- zine, 29(6):141–142, 2012

Li Deng. The mnist database of handwritten digit images for machine learning research.IEEE Signal Processing Maga- zine, 29(6):141–142, 2012. 6

work page 2012

[9] [9]

Fairness-aware agnostic federated learning

Wei Du, Depeng Xu, Xintao Wu, and Hanghang Tong. Fairness-aware agnostic federated learning. InProceedings of the 2021 SIAM International Conference on Data Mining (SDM), pages 181–189. SIAM, 2021. 2

work page 2021

[10] [10]

Ezzeldin, Shen Yan, Chaoyang He, Emilio Ferrara, and Salman Avestimehr

Yahya H. Ezzeldin, Shen Yan, Chaoyang He, Emilio Ferrara, and Salman Avestimehr. Fairfed: Enabling group fairness in federated learning.arXiv preprint arXiv:2110.00857, 2021. 2

work page arXiv 2021

[11] [11]

Local model poisoning attacks to byzantine-robust federated learning

Minghong Fang, Xiaoyu Cao, Jinyuan Jia, and Neil Gong. Local model poisoning attacks to byzantine-robust federated learning. In29th USENIX Security Symposium (USENIX Se- curity 20), pages 1605–1622, 2020. 6

work page 2020

[12] [12]

Byzantine- robust decentralized federated learning.arXiv preprint arXiv:2406.10416, 2024

Minghong Fang, Zifan Zhang, Prashant Khanduri, Song- tao Lu, Yuchen Liu, Neil Gong, et al. Byzantine- robust decentralized federated learning.arXiv preprint arXiv:2406.10416, 2024. 3, 8, 1

work page arXiv 2024

[13] [13]

Fedgr: Federated learning with gravitation regulation for double imbalance distribu- tion

Songyue Guo, Xu Yang, Jiyuan Feng, Ye Ding, Wei Wang, Yunqing Feng, and Qing Liao. Fedgr: Federated learning with gravitation regulation for double imbalance distribu- tion. InInternational Conference on Database Systems for Advanced Applications, pages 703–718. Springer, 2023. 6

work page 2023

[14] [14]

Central server free federated learning over single-sided trust social networks.arXiv preprint arXiv:1910.04956, 2019

Chaoyang He, Conghui Tan, Hanlin Tang, Shuang Qiu, and Ji Liu. Central server free federated learning over single-sided trust social networks.arXiv preprint arXiv:1910.04956, 2019. 1

work page arXiv 1910

[15] [15]

Spreadgnn: Serverless multi-task federated learning for graph neural net- works.arXiv preprint arXiv:2106.02743, 2021

Chaoyang He, Emir Ceyani, Keshav Balasubramanian, Mu- rali Annavaram, and Salman Avestimehr. Spreadgnn: Serverless multi-task federated learning for graph neural net- works.arXiv preprint arXiv:2106.02743, 2021. 1

work page arXiv 2021

[16] [16]

Measuring the Effects of Non-Identical Data Distribution for Federated Visual Classification

Tzu-Ming Harry Hsu, Hang Qi, and Matthew Brown. Mea- suring the effects of non-identical data distribution for feder- ated visual classification.arXiv preprint arXiv:1909.06335,

work page internal anchor Pith review arXiv 1909

[17] [17]

Fedmgda+: Federated learning meets multi-objective optimization.arXiv preprint arXiv:2006.11489, 2020

Zeou Hu, Kiarash Shaloudegi, Guojun Zhang, and Yaoliang Yu. Fedmgda+: Federated learning meets multi-objective optimization.arXiv preprint arXiv:2006.11489, 2020. 2

work page arXiv 2006

[18] [18]

Performance analysis of decentralized federated learning de- ployments, 2025

Chengyan Jiang, Jiamin Fan, Talal Halabi, and Israat Haque. Performance analysis of decentralized federated learning de- ployments, 2025. 5, 1

work page 2025

[19] [19]

Association Press, 1975

Malcolm Shepherd Knowles.Self-directed learning: A guide for learners and teachers. Association Press, 1975. 2

work page 1975

[20] [20]

Federated Learning: Strategies for Improving Communication Efficiency

Jakub Kone ˇcn`y, H Brendan McMahan, Felix X Yu, Peter Richt´arik, Ananda Theertha Suresh, and Dave Bacon. Fed- erated learning: Strategies for improving communication ef- ficiency.arXiv preprint arXiv:1610.05492, 2016. 2

work page internal anchor Pith review Pith/arXiv arXiv 2016

[21] [21]

Benign overfitting in two-layer relu convolutional neural networks

Yiwen Kou, Zixiang Chen, Yuanzhou Chen, and Quanquan Gu. Benign overfitting in two-layer relu convolutional neural networks. InInternational conference on machine learning, pages 17615–17659. PMLR, 2023. 8

work page 2023

[22] [22]

Fully decentralized federated learning

Anusha Lalitha, Shubhanshu Shekhar, Tara Javidi, and Fari- naz Koushanfar. Fully decentralized federated learning. InThird Workshop on Bayesian Deep Learning (NeurIPS),

work page

[23] [23]

Peer-to-peer Federated Learning on Graphs

Anusha Lalitha, Osman Cihan Kilinc, Tara Javidi, and Fari- naz Koushanfar. Peer-to-peer federated learning on graphs. arXiv preprint arXiv:1901.11173, 2019. 1

work page internal anchor Pith review Pith/arXiv arXiv 1901

[24] [24]

The byzantine generals problem

Leslie Lamport, Robert Shostak, and Marshall Pease. The byzantine generals problem. InConcurrency: the works of leslie lamport, pages 203–226. 2019. 8

work page 2019

[25] [25]

Exploring the prac- ticality of federated learning: A survey towards the com- munication perspective.arXiv preprint arXiv:2405.20431,

Khiem Le, Nhan Luong-Ha, Manh Nguyen-Duc, Danh Le- Phuoc, Cuong Do, and Kok-Seng Wong. Exploring the prac- ticality of federated learning: A survey towards the com- munication perspective.arXiv preprint arXiv:2405.20431,

work page arXiv

[26] [26]

pfedfair: Towards optimal group fairness-accuracy trade- off in heterogeneous federated learning.arXiv preprint arXiv:2503.14925, 2025

Haoyu Lei, Shizhan Gong, Qi Dou, and Farzan Farnia. pfedfair: Towards optimal group fairness-accuracy trade- off in heterogeneous federated learning.arXiv preprint arXiv:2503.14925, 2025. 2

work page arXiv 2025

[27] [27]

Learning to detect malicious clients for robust feder- ated learning.arXiv preprint arXiv:2002.00211, 2020

Suyi Li, Yong Cheng, Wei Wang, Yang Liu, and Tianjian Chen. Learning to detect malicious clients for robust feder- ated learning.arXiv preprint arXiv:2002.00211, 2020. 2

work page arXiv 2002

[28] [28]

Fair re- source allocation in federated learning,

Tian Li, Maziar Sanjabi, Ahmad Beirami, and Virginia Smith. Fair resource allocation in federated learning.arXiv preprint arXiv:1905.10497, 2019. 2, 6

work page arXiv 1905

[29] [29]

Ditto: Fair and robust federated learning through per- sonalization

Tian Li, Shengyuan Hu, Ahmad Beirami, and Virginia Smith. Ditto: Fair and robust federated learning through per- sonalization. InInternational conference on machine learn- ing, pages 6357–6368. PMLR, 2021. 7

work page 2021

[30] [30]

Convergence analysis of se- quential federated learning on heterogeneous data.Advances in Neural Information Processing Systems, 36:56700–56755,

Yipeng Li and Xinchen Lyu. Convergence analysis of se- quential federated learning on heterogeneous data.Advances in Neural Information Processing Systems, 36:56700–56755,

work page

[31] [31]

Building a practically useful theory of goal setting and task motivation: A 35-year odyssey.American psychologist, 57(9):705, 2002

Edwin A Locke and Gary P Latham. Building a practically useful theory of goal setting and task motivation: A 35-year odyssey.American psychologist, 57(9):705, 2002. 2

work page 2002

[32] [32]

Shieldfl: Mitigating model poisoning at- tacks in privacy-preserving federated learning.IEEE Trans- actions on Information Forensics and Security, 17:1639– 1654, 2022

Zhuoran Ma, Jianfeng Ma, Yinbin Miao, Yingjiu Li, and Robert H Deng. Shieldfl: Mitigating model poisoning at- tacks in privacy-preserving federated learning.IEEE Trans- actions on Information Forensics and Security, 17:1639– 1654, 2022. 5

work page 2022

[33] [33]

Communication- efficient learning of deep networks from decentralized data

Brendan McMahan, Eider Moore, Daniel Ramage, Seth Hampson, and Blaise Aguera y Arcas. Communication- efficient learning of deep networks from decentralized data. InArtificial Intelligence and Statistics, pages 1273–1282. PMLR, 2017. 1, 2, 6

work page 2017

[34] [34]

Agnostic federated learning

Mehryar Mohri, Gary Sivek, and Ananda Theertha Suresh. Agnostic federated learning. InProceedings of the 36th In- ternational Conference on Machine Learning, pages 4615– 4625, 2019. 2

work page 2019

[35] [35]

A survey on security and privacy of federated learning.Future Gener- ation Computer Systems, 115:619–640, 2021

Viraaji Mothukuri, Reza M Parizi, Seyedamin Pouriyeh, Yan Huang, Ali Dehghantanha, and Gautam Srivastava. A survey on security and privacy of federated learning.Future Gener- ation Computer Systems, 115:619–640, 2021. 3

work page 2021

[36] [36]

In31st USENIX Security Symposium (USENIX Se- curity 22), pages 1415–1432, 2022

Thien Duc Nguyen, Phillip Rieger, Roberta De Viti, Huili Chen, Bj ¨orn B Brandenburg, Hossein Yalame, Helen M¨ollering, Hossein Fereidooni, Samuel Marchal, Markus Miettinen, et al.{FLAME}: Taming backdoors in federated learning. In31st USENIX Security Symposium (USENIX Se- curity 22), pages 1415–1432, 2022. 8

work page 2022

[37] [37]

Byzantine-robust decen- tralized stochastic optimization over static and time-varying networks.Signal Processing, 183:108020, 2021

Jie Peng, Weiyu Li, and Qing Ling. Byzantine-robust decen- tralized stochastic optimization over static and time-varying networks.Signal Processing, 183:108020, 2021. 5

work page 2021

[38] [38]

Reducing communication in federated learning via efficient client sampling.Pattern Recognition, 148:110122, 2024

M ´onica Ribero and Haris Vikalo. Reducing communication in federated learning via efficient client sampling.Pattern Recognition, 148:110122, 2024. 2

work page 2024

[39] [39]

Model compres- sion for communication efficient federated learning.IEEE Transactions on Neural Networks and Learning Systems, 34 (9):5937–5951, 2021

Suhail Mohmad Shah and Vincent KN Lau. Model compres- sion for communication efficient federated learning.IEEE Transactions on Neural Networks and Learning Systems, 34 (9):5937–5951, 2021. 2

work page 2021

[40] [40]

Privacy-preserving deep learning

Reza Shokri and Vitaly Shmatikov. Privacy-preserving deep learning. InProceedings of the 22nd ACM SIGSAC Con- ference on Computer and Communications Security, pages 1310–1321, 2015. 1

work page 2015

[41] [41]

Decentralized feder- ated averaging.IEEE Transactions on Pattern Analysis and Machine Intelligence, 45(4):4289–4301, 2022

Tao Sun, Dongsheng Li, and Bao Wang. Decentralized feder- ated averaging.IEEE Transactions on Pattern Analysis and Machine Intelligence, 45(4):4289–4301, 2022. 3

work page 2022

[42] [42]

Cognitive load during problem solving: Effects on learning.Cognitive science, 12(2):257–285, 1998

John Sweller, Jeroen JG van Merri ¨enboer, and Fred GWC Paas. Cognitive load during problem solving: Effects on learning.Cognitive science, 12(2):257–285, 1998. 2

work page 1998

[43] [43]

Federated learning with fair averaging.arXiv preprint arXiv:2104.14937, 2021

Zheng Wang, Xiaoliang Fan, Jianzhong Qi, Chenglu Wen, Cheng Wang, and Rongshan Yu. Federated learning with fair averaging.arXiv preprint arXiv:2104.14937, 2021. 2

work page arXiv 2021

[44] [44]

Communication-efficient federated learning via knowledge distillation.Nature communications, 13(1): 2032, 2022

Chuhan Wu, Fangzhao Wu, Lingjuan Lyu, Yongfeng Huang, and Xing Xie. Communication-efficient federated learning via knowledge distillation.Nature communications, 13(1): 2032, 2022. 2

work page 2032

[45] [45]

Byzantine- resilient decentralized stochastic optimization with robust aggregation rules.IEEE transactions on signal processing, 71:3179–3195, 2023

Zhaoxian Wu, Tianyi Chen, and Qing Ling. Byzantine- resilient decentralized stochastic optimization with robust aggregation rules.IEEE transactions on signal processing, 71:3179–3195, 2023. 3, 5

work page 2023

[46] [46]

Fashion-MNIST: a Novel Image Dataset for Benchmarking Machine Learning Algorithms

Han Xiao, Kashif Rasul, and Roland V ollgraf. Fashion- mnist: a novel image dataset for benchmarking machine learning algorithms.arXiv preprint arXiv:1708.07747, 2017. 6

work page internal anchor Pith review Pith/arXiv arXiv 2017

[47] [47]

Federated learning over wireless device-to-device networks: Algo- rithms and convergence analysis.IEEE Journal on Selected Areas in Communications, 39(12):3723–3741, 2021

Hong Xing, Osvaldo Simeone, and Suzhi Bi. Federated learning over wireless device-to-device networks: Algo- rithms and convergence analysis.IEEE Journal on Selected Areas in Communications, 39(12):3723–3741, 2021. 1

work page 2021

[48] [48]

Byzantine-robust decentral- ized learning via remove-then-clip aggregation

Caiyi Yang and Javad Ghaderi. Byzantine-robust decentral- ized learning via remove-then-clip aggregation. InProceed- ings of the AAAI Conference on Artificial Intelligence, pages 21735–21743, 2024. 1

work page 2024

[49] [49]

Decentralized fed- erated learning with unreliable communications.IEEE Jour- nal of Selected Topics in Signal Processing, 16(3):487–500,

Hao Ye, Le Liang, and Geoffrey Ye Li. Decentralized fed- erated learning with unreliable communications.IEEE Jour- nal of Selected Topics in Signal Processing, 16(3):487–500,

work page

[50] [50]

On the tradeoff be- tween privacy preservation and byzantine-robustness in de- centralized learning

Haoxiang Ye, Heng Zhu, and Qing Ling. On the tradeoff be- tween privacy preservation and byzantine-robustness in de- centralized learning. InICASSP 2024-2024 IEEE Interna- tional Conference on Acoustics, Speech and Signal Process- ing (ICASSP), pages 9336–9340. IEEE, 2024. 6, 8, 1

work page 2024

[51] [51]

Byzantine-robust distributed learning: Towards op- timal statistical rates

Dong Yin, Yudong Chen, Ramchandran Kannan, and Peter Bartlett. Byzantine-robust distributed learning: Towards op- timal statistical rates. InInternational Conference on Ma- chine Learning, pages 5650–5659. Pmlr, 2018. 8

work page 2018

[52] [52]

Decentralized federated learning: A survey and perspective.IEEE Internet of Things Journal,

Liangqi Yuan, Ziran Wang, Lichao Sun, Philip S Yu, and Christopher G Brinton. Decentralized federated learning: A survey and perspective.IEEE Internet of Things Journal,

work page

[53] [53]

Gifair-fl: A framework for group and individual fairness in federated learning.INFORMS Journal on Data Science, 2(1):10–23,

Xubo Yue, Maher Nouiehed, and Raed Al Kontar. Gifair-fl: A framework for group and individual fairness in federated learning.INFORMS Journal on Data Science, 2(1):10–23,

work page

[54] [54]

Byzantine- robust decentralized federated learning via local perfor- mance checking

Kaichuang Zhang, Alina Basharat, and Ping Xu. Byzantine- robust decentralized federated learning via local perfor- mance checking. In31st International Conference on Neural Information Processing, 2024. 1, 3, 6

work page 2024

[55] [55]

Distributional and byzantine robust decentralized federated learning

Kaichuang Zhang, Ping Xu, and Zhi Tian. Distributional and byzantine robust decentralized federated learning. In2025 59th Annual Conference on Information Sciences and Sys- tems (CISS), pages 1–6. IEEE, 2025. 1

work page 2025

[56] [56]

A dynamic reweighting strat- egy for fair federated learning

Zhiyuan Zhao and Gauri Joshi. A dynamic reweighting strat- egy for fair federated learning. InICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 8772–8776. IEEE, 2022. 2 DFedReweighting: A Unified Framework for Objective-Oriented Reweighting in Decentralized Federated Learning Supplementary Material

work page 2022

[57] [57]

Assumptions and Remarks Beyond Assumptions 3 and 4 in the main text, we introduce additional assumptions for our convergence analysis

Convergence Analysis 8.1. Assumptions and Remarks Beyond Assumptions 3 and 4 in the main text, we introduce additional assumptions for our convergence analysis. Assumption 3G B is connected.[12, 48, 50] The subgraph of benign nodes should be connected, meaning that there should exist a path between any two benign nodes allowing them to reach each other. A...

work page

[58] [58]

For each candidatew a, com- pute its distances to all other candidates and letS a be the multiset of the smallestn−f−2distances, wheren=|N i| andfis the Byzantine tolerance parameter. The Krum score ofw a is score(w t+ 1 2 a ) = X d∈Sa d.(41) Krum selects the candidate with the minimal score and uses it as the aggregated model: wt+1 i = arg min w t+ 1 2a ...

work page

[59] [59]

Given a smoothing parameterβ >0, it then defines the unnormalized weight uj = 1 dj +β , the normalized weightp j = ujP k uk , and finally updates its model by wt+1 i = X j∈Ni pjw t+ 1 2 j .(45) In our experiments, the default value ofβis1.0

work page