FLEX-MoE: Federated Mixture-of-Experts with Load-balanced Expert Assignment for Edge Computing

Boyang Zhang; Jian Zhang; Mingxuan Sun; Shuai Zhang; Songyang Zhang; Xiangwei Zhou; Xiaobing Chen

arxiv: 2512.23070 · v2 · pith:YMXBHK37new · submitted 2025-12-28 · 💻 cs.LG

FLEX-MoE: Federated Mixture-of-Experts with Load-balanced Expert Assignment for Edge Computing

Boyang Zhang , Xiaobing Chen , Songyang Zhang , Shuai Zhang , Xiangwei Zhou , Jian Zhang , Mingxuan Sun This is my paper

classification 💻 cs.LG

keywords expertedgefederatedflex-moeloadutilizationassignmentbalanced

0 comments

read the original abstract

Mixture-of-Experts (MoE) models enable scalable neural networks through conditional computation, offering enhanced effectiveness and efficiency for next-generation wireless communications. However, deploying MoE with federated learning (FL) over wireless and IoT edge networks faces two critical challenges: 1) resource-constrained clients cannot store large AI models with full expert sets, and 2) non-IID data distributions cause severe expert load imbalance that degrades model performance. To this end, we propose FLEX-MoE, a federated MoE framework that jointly optimizes expert assignment and load balancing under limited client capacity. Specifically, our approach introduces client-expert fitness scores that quantify expert suitability for local datasets through training feedback, and employs an optimization-based algorithm to maximize client-expert specialization while enforcing balanced expert utilization system-wide. Unlike greedy methods that focus solely on personalization while ignoring load imbalance, FLEX-MoE addresses expert utilization skew, which is particularly severe in heterogeneous edge FL. Our experimental results demonstrate superior accuracy and consistently balanced expert utilization across diverse resource-constrained scenarios for edge computing.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

$\phi$-Balancing for Mixture-of-Experts Training
cs.LG 2026-05 unverdicted novelty 7.0

φ-balancing is a convex optimization method for population-level expert balance in MoE training that derives an online EMA adjustment and outperforms heuristic baselines.