hub Canonical reference

Federated Learning with Non-IID Data

Yue Zhao, Meng Li, Liangzhen Lai, Naveen Suda, Damon Civin, Vikas Chandra · 2018 · cs.LG · arXiv 1806.00582

Canonical reference. 75% of citing Pith papers cite this work as background.

33 Pith papers citing it

Background 75% of classified citations

open full Pith review browse 33 citing papers arXiv PDF

abstract

Federated learning enables resource-constrained edge compute devices, such as mobile phones and IoT devices, to learn a shared model for prediction, while keeping the training data local. This decentralized approach to train models provides privacy, security, regulatory and economic benefits. In this work, we focus on the statistical challenge of federated learning when local data is non-IID. We first show that the accuracy of federated learning reduces significantly, by up to 55% for neural networks trained for highly skewed non-IID data, where each client device trains only on a single class of data. We further show that this accuracy reduction can be explained by the weight divergence, which can be quantified by the earth mover's distance (EMD) between the distribution over classes on each device and the population distribution. As a solution, we propose a strategy to improve training on non-IID data by creating a small subset of data which is globally shared between all the edge devices. Experiments show that accuracy can be increased by 30% for the CIFAR-10 dataset with only 5% globally shared data.

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

background 7 other 1

citation-polarity summary

background 6 support 1 unclear 1

representative citing papers

LOSCAR-SGD: Local SGD with Communication-Computation Overlap and Delay-Corrected Sparse Model Averaging

cs.LG · 2026-05-20 · unverdicted · novelty 7.0

LOSCAR-SGD combines local updates, sparse model averaging, and communication-computation overlap with a delay-corrected merge rule, providing convergence rates for smooth non-convex objectives under worker heterogeneity.

Ringmaster LMO: Asynchronous Linear Minimization Oracle Momentum Method

cs.LG · 2026-05-18 · unverdicted · novelty 7.0

Ringmaster LMO extends delay-thresholding from ASGD to LMO-based momentum updates, providing convergence guarantees under (L0, L1)-smoothness and time-complexity bounds that recover optimal rates in the Euclidean case.

Byzantine-Robust Distributed SGD: A Unified Analysis and Tight Error Bounds

math.OC · 2026-04-11 · unverdicted · novelty 7.0

Unified convergence rates and tight lower bounds for Byzantine-robust distributed SGD under stochasticity and general data heterogeneity, showing local momentum reduces stochastic error floors.

Bandwidth Allocation with Device Partitioning for Federated Learning over Industrial IoT networks

cs.LG · 2026-05-29 · unverdicted · novelty 6.0

A device-partitioning bandwidth allocation policy for federated learning over IIoT networks that provably reduces total training time compared to any non-partitioning scheme.

On the Fragility of Data Attribution When Learning Is Distributed

cs.LG · 2026-05-15 · unverdicted · novelty 6.0

A single adversary in distributed training inflates its attribution value via latent optimization on synthetic batches without degrading accuracy or triggering basic defenses.

FedVSSAM: Mitigating Flatness Incompatibility in Sharpness-Aware Federated Learning

cs.LG · 2026-05-09 · unverdicted · novelty 6.0

FedVSSAM mitigates flatness incompatibility in SAM-based federated learning by consistently using a variance-suppressed adjusted direction for local perturbation, descent, and global updates, with non-convex convergence guarantees.

ForgeVLA: Federated Vision-Language-Action Learning without Language Annotations

cs.CV · 2026-05-08 · unverdicted · novelty 6.0

ForgeVLA enables federated VLA model training from unlabeled vision-action pairs by recovering language via embodied classifiers and using contrastive planning plus adaptive aggregation to avoid feature collapse.

Robust Synchronisation for Federated Learning in The Face of Correlated Device Failure

cs.DC · 2026-04-17 · unverdicted · novelty 6.0

AW-PSP dynamically weights node sampling by real-time availability predictions and failure correlations to improve robustness, label coverage, and fairness in federated learning under correlated device failures.

HierFedCEA: Hierarchical Federated Edge Learning for Privacy-Preserving Climate Control Optimization Across Heterogeneous Controlled Environment Agriculture Facilities

eess.SY · 2026-04-15 · unverdicted · novelty 6.0

HierFedCEA delivers a hierarchical federated learning framework for privacy-preserving climate control optimization across heterogeneous CEA facilities, reaching 94% of centralized performance with under 1 MB communication.

Client-Conditional Federated Learning via Local Training Data Statistics

cs.LG · 2026-03-11 · unverdicted · novelty 6.0

Conditioning a global FL model on local PCA statistics of client data matches oracle cluster performance across heterogeneous settings and is robust to sparse data with zero added communication.

Practical Quantum Federated Learning for Privacy-Sensitive Healthcare: Communication Efficiency and Noise Resilience

quant-ph · 2026-03-04 · unverdicted · novelty 6.0

Hybrid QFL cuts quantum transmissions from 3TNMP to {3t + 2(T-t)}NMP over T rounds while preserving near-centralized convergence and improving depolarizing-noise resilience via decentralized aggregation and Steane-code QEC.

Fed-Listing: Federated Label Distribution Inference in Graph Neural Networks

cs.LG · 2026-01-30 · unverdicted · novelty 6.0

Fed-Listing infers client label proportions in FedGNNs from final-layer gradients, outperforming baselines on four datasets and three architectures even in non-i.i.d. settings.

Task-agnostic Low-rank Residual Adaptation for Efficient Federated Continual Fine-Tuning

cs.LG · 2025-05-18 · unverdicted · novelty 6.0

Fed-TaLoRA uses task-agnostic low-rank residual adaptation with post-aggregation calibration to enable efficient federated continual fine-tuning across sequential tasks under non-IID conditions.

SP-CACW: Convergence-Aware Client Weighting for Selfish Personalized Learning

cs.LG · 2026-06-28 · unverdicted · novelty 5.0

SP-CACW is a convergence-aware client weighting scheme for selfish personalized federated learning that minimizes an upper bound on the target client's convergence error and can zero out harmful peers.

FedMPT: Federated Multi-label Prompt Tuning of Vision-Language Models

cs.AI · 2026-05-27 · unverdicted · novelty 5.0

FedMPT applies causal modeling and LLM-driven condition prompts with optimal transport and gating to perform federated multi-label prompt tuning of VLMs, claiming competitive results on benchmarks.

FIRMA: FIbonacci Ring Model Aggregation for Privacy-preserving Federated Learning

cs.LG · 2026-05-21 · unverdicted · novelty 5.0

FIRMA introduces Fibonacci ring aggregation protocols for server-free federated learning that maintain private heads and achieve higher accuracy than FedAvg under label skew across multiple benchmarks and heterogeneity regimes.

Choose Wisely and Privately: Proactive Client Selection for Fair and Efficient Federated Learning

cs.LG · 2026-05-20 · unverdicted · novelty 5.0 · 2 refs

Proactive client selection in federated learning via differentially private mutual information and simulated annealing to optimize Potential Federation Loss for utility and fairness.

FedSDR: Federated Self-Distillation with Rectification

cs.LG · 2026-05-18 · unverdicted · novelty 5.0

FedSDR augments federated self-distillation with dual LoRA streams (local smoothing and global rectification) to produce globally aligned, factually faithful models under statistical heterogeneity.

FedSurrogate: Backdoor Defense in Federated Learning via Layer Criticality and Surrogate Replacement

cs.CR · 2026-05-11 · unverdicted · novelty 5.0

FedSurrogate defends federated learning against backdoors by clustering on security-critical layers and substituting malicious updates with benign surrogates, reporting false-positive rates below 10% and attack success below 2.1% under non-IID conditions.

Rennala MVR: Improved Time Complexity for Parallel Stochastic Optimization via Momentum-Based Variance Reduction

math.OC · 2026-05-09 · unverdicted · novelty 5.0

Rennala MVR improves time complexity over Rennala SGD for smooth nonconvex stochastic optimization in heterogeneous parallel systems under a mean-squared smoothness assumption.

CLAD: A Clustered Label-Agnostic Federated Learning Framework for Joint Anomaly Detection and Attack Classification

cs.LG · 2026-05-07 · unverdicted · novelty 5.0

CLAD is a clustered federated learning framework with a dual-mode architecture for joint anomaly detection and attack classification in IoT using labeled and unlabeled data.

Heterogeneous Model Fusion for Privacy-Aware Multi-Camera Surveillance via Synthetic Domain Adaptation

cs.CV · 2026-05-04 · unverdicted · novelty 5.0 · 2 refs

HeroCrystal achieves 33.4% mAP on cross-domain multi-camera object detection by combining one-shot diffusion-based synthetic data generation, probabilistic federated Faster R-CNN, and inconsistent-category distillation, outperforming prior privacy-preserving baselines by 2.1%.

FMCL: Class-Aware Client Clustering with Foundation Model Representations for Heterogeneous Federated Learning

cs.LG · 2026-04-30 · unverdicted · novelty 5.0

FMCL performs one-shot class-aware client clustering in heterogeneous federated learning by deriving semantic signatures from foundation model embeddings and using cosine distance, yielding improved performance and stable clusters compared to prior methods.

REVERB-FL: Server-Side Adversarial and Reserve-Enhanced Federated Learning for Robust Audio Classification

eess.AS · 2025-12-15 · unverdicted · novelty 5.0

REVERB-FL uses a server-side reserve set with retraining and adversarial training to reduce poisoning effects and speed convergence in federated audio classification under non-IID data.

citing papers explorer

Showing 1 of 1 citing paper after filters.

REVERB-FL: Server-Side Adversarial and Reserve-Enhanced Federated Learning for Robust Audio Classification eess.AS · 2025-12-15 · unverdicted · none · ref 38 · internal anchor
REVERB-FL uses a server-side reserve set with retraining and adversarial training to reduce poisoning effects and speed convergence in federated audio classification under non-IID data.

Federated Learning with Non-IID Data

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer