Beyond Assumptions: Measuring Federated Learning over Real 5G Networks
Pith reviewed 2026-05-22 21:18 UTC · model grok-4.3
The pith
A 5G testbed experiment finds that federated learning usually has one consistent straggler device rather than varying ones.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Deploying FL using a 5G-NR SA testbed with Raspberry Pis and COTS components reveals that there is a consistent straggler in about 70% of trials, while in the other 30% high communication time causes competing stragglers. These results challenge common assumptions about communication time in FL over wireless networks.
What carries the argument
The 5G testbed consisting of resource-constrained Raspberry Pi edge devices communicating with a central server via SDR and O-RAN software, instrumented with the Flower FL framework to track communication and ML metrics across different network interfaces.
If this is right
- FL performance over 5G is affected by external congestion in measurable ways.
- Varying 5G bandwidths and uplink-downlink scheduling ratios change the observed communication times.
- The testbed results can be checked against commercial 5G deployments for broader validity.
- Straggler mitigation in wireless FL must account for cases where one device consistently lags.
- Open-sourced instrumentation enables direct replication of the measurements on other hardware.
Where Pith is reading between the lines
- FL systems for IoT devices should include per-device profiling to detect persistent stragglers before training starts.
- The same measurement approach could be applied to other wireless links such as WiFi 6 or future 6G to compare straggler patterns.
- If consistent stragglers prove common, asynchronous FL updates or client selection methods may become more effective than synchronous rounds.
- The open-sourced tools lower the barrier for other researchers to test FL under controlled wireless conditions.
Load-bearing premise
The specific hardware and software setup in the custom 5G testbed produces communication and straggler patterns that would appear in commercial 5G networks and standard federated learning tasks.
What would settle it
Observing no consistent straggler across the majority of trials when repeating the FL experiments on a commercial 5G network would disprove the main finding.
Figures
read the original abstract
Deploying FL using IoT devices is an area poised to significantly benefit from advances in NextG wireless. In this paper, we deploy a FL application using a 5G-NR Standalone (SA) testbed with open-source and Commercial Off-the-Shelf (COTS) components. The 5G testbed architecture consists of a network of resource-constrained edge devices, namely Raspberry Pis, and a central server equipped with a Software Defined Radio (SDR) and running O-RAN software. Our testbed allows edge devices to communicate with the server using WiFi and Ethernet in addition to 5G. FL is deployed using the Flower FL framework, extended with custom instrumentation for communication and ML metrics. We analyze the FL application across three network interfaces--5G, WiFi, and Ethernet--as well as across 5G bandwidths and uplink-downlink scheduling ratios. Our experimental results challenge some common assumptions about communication time in FL over wireless and discuss the potential pitfalls of these assumptions. We find that there is a consistent straggler in about 70% of trials, while in the other 30%, high communication time causes competing stragglers. We also compare FL performance over 5G with and without external congestion and compare our testbed to commercial 5G to validate our findings in a broader context. For reproducibility, we have open-sourced our FL application, instrumentation tools, and testbed configuration.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript presents an experimental evaluation of Federated Learning (FL) over a custom 5G-NR Standalone testbed built with Raspberry Pi edge devices, an SDR-equipped server running O-RAN software, and the Flower framework with custom instrumentation. It measures FL performance across 5G, WiFi, and Ethernet interfaces, varying 5G bandwidths and uplink-downlink scheduling ratios, reports a consistent straggler in approximately 70% of trials (with competing stragglers in the remaining 30%), examines effects of external congestion, compares the testbed to commercial 5G, and open-sources the application, tools, and configurations.
Significance. If the testbed behaviors generalize, the direct measurements provide useful empirical evidence that can challenge common assumptions about communication times and stragglers in wireless FL deployments. The open-sourcing of the FL application, instrumentation tools, and testbed configuration is a clear strength that supports reproducibility. The work could help inform practical FL system design for IoT over 5G networks.
major comments (2)
- [Results section reporting straggler percentages] The central empirical claim rests on the finding of a consistent straggler in about 70% of trials. However, the manuscript provides no details on the total number of trials, statistical tests used, error bars, variance across runs, or exclusion criteria, which limits the ability to assess the robustness and generalizability of this percentage.
- [Section on testbed validation against commercial 5G] The comparison to commercial 5G is invoked to validate the testbed findings in a broader context, but if this comparison is restricted to basic network metrics rather than FL-specific straggler and communication-time behaviors under matched conditions, it does not fully address the representativeness concern for the straggler claims.
minor comments (1)
- [Abstract and results] The abstract and results would benefit from explicit statements of the number of trials or repetitions performed for each condition to allow readers to contextualize the reported percentages.
Simulated Author's Rebuttal
We thank the referee for the constructive review and the recommendation of minor revision. We address each major comment below and indicate the revisions we will make to strengthen the manuscript.
read point-by-point responses
-
Referee: [Results section reporting straggler percentages] The central empirical claim rests on the finding of a consistent straggler in about 70% of trials. However, the manuscript provides no details on the total number of trials, statistical tests used, error bars, variance across runs, or exclusion criteria, which limits the ability to assess the robustness and generalizability of this percentage.
Authors: We agree that additional methodological details are needed to allow readers to fully assess the robustness of the reported 70% figure. In the revised manuscript we will expand the relevant Results section to state the total number of trials conducted, describe any statistical tests or descriptive statistics applied, include error bars or variance measures across runs, and specify the exclusion criteria used (if any). These additions will be placed directly alongside the straggler-percentage claim. revision: yes
-
Referee: [Section on testbed validation against commercial 5G] The comparison to commercial 5G is invoked to validate the testbed findings in a broader context, but if this comparison is restricted to basic network metrics rather than FL-specific straggler and communication-time behaviors under matched conditions, it does not fully address the representativeness concern for the straggler claims.
Authors: We appreciate the referee's distinction between network-level and FL-specific validation. Our existing comparison reports both basic network metrics and FL round-completion times; however, we acknowledge that the linkage to straggler behavior could be made more explicit. We will revise the section to highlight the FL-specific communication-time and straggler observations obtained under the commercial 5G conditions we were able to measure, while noting any limitations in achieving perfectly matched experimental conditions. revision: partial
Circularity Check
No circularity: purely experimental measurements with no derivations or self-referential reductions
full rationale
The paper consists entirely of direct experimental reporting from a custom 5G-NR SA testbed using Raspberry Pis, SDR, O-RAN software, and the Flower FL framework. It measures communication times, straggler patterns (e.g., consistent straggler in ~70% of trials), and performance across interfaces and bandwidths, then compares results to commercial 5G. No equations, fitted parameters, predictions derived from inputs, or load-bearing self-citations appear in the provided text or abstract. Claims rest on observed data rather than any derivation chain that reduces to prior fitted values or self-citations. The generalization concern raised in the skeptic note is an external validity issue, not a circularity in the paper's internal logic.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption The 5G testbed with Raspberry Pis, SDR, and O-RAN software accurately captures communication behaviors relevant to commercial 5G deployments.
Reference graph
Works this paper leans on
-
[1]
3GPP. 2024. Service requirements for the 5G system . Technical Specification. 3rd Generation Partnership Project, Sophia Antipolis, France. https://www.3gpp. org/ftp/Specs/archive/22_series/22.261/ Release 18
work page 2024
-
[2]
3GPP. 2025. Base Station (BS) radio transmission and reception . Technical Specifi- cation. 3rd Generation Partnership Project. Release 18
work page 2025
-
[3]
Syreen Banabilah, Moayad Aloqaily, Eitaa Alsayed, Nida Malik, and Yaser Jarar- weh. 2022. Federated learning review: Fundamentals, enabling technologies, and future applications. Information Processing & Management 59, 6 (Nov. 2022), 103061. https://doi.org/10.1016/j.ipm.2022.103061
-
[4]
Mahdi Beitollahi and Ning Lu. 2023. Federated Learning Over Wireless Networks: Challenges and Solutions. IEEE Internet of Things Journal 10, 16 (Aug. 2023), 14749–14763. https://doi.org/10.1109/JIOT.2023.3285868 Conference Name: IEEE Internet of Things Journal
-
[5]
Daniel J Beutel, Taner Topal, Akhil Mathur, Xinchi Qiu, Javier Fernandez- Marques, Yan Gao, Lorenzo Sani, Hei Li Kwing, Titouan Parcollet, Pedro PB de Gusmão, and Nicholas D Lane. 2020. Flower: A Friendly Federated Learning Research Framework. arXiv preprint arXiv:2007.14390 (2020)
work page internal anchor Pith review Pith/arXiv arXiv 2020
-
[6]
Chowdhury, Ste- fano Basagni, and Tommaso Melodia
Leonardo Bonati, Pedram Johari, Michele Polese, Salvatore D’Oro, Subhramoy Mohanti, Miead Tehrani-Moayyed, Davide Villa, Shweta Shrivastava, Chi- nenye Tassie, Kurt Yoder, Ajeet Bagga, Paresh Patel, Ventz Petkov, Michael Seltser, Francesco Restuccia, Abhimanyu Gosain, Kaushik R. Chowdhury, Ste- fano Basagni, and Tommaso Melodia. 2021. Colosseum: Large-Sca...
-
[7]
Stoller, Jacobus Van der Merwe, Kirk Webb, and Gary Wong
Joe Breen, Andrew Buffmire, Jonathon Duerig, Kevin Dutt, Eric Eide, Mike Hibler, David Johnson, Sneha Kumar Kasera, Earl Lewis, Dustin Maas, Alex Orange, Neal Patwari, Daniel Reading, Robert Ricci, David Schurig, Leigh B. Stoller, Jacobus Van der Merwe, Kirk Webb, and Gary Wong. 2020. POWDER: Platform for Open Wireless Data-driven Experimental Research. I...
-
[8]
Vincent Poor, Walid Saad, and Shuguang Cui
Mingzhe Chen, H. Vincent Poor, Walid Saad, and Shuguang Cui. 2020. Wireless Communications for Collaborative Federated Learning. IEEE Communications Magazine 58, 12 (Dec. 2020), 48–54. https://doi.org/10.1109/MCOM.001.2000397 Conference Name: IEEE Communications Magazine
-
[9]
Vincent Poor, Walid Saad, and Shuguang Cui
Mingzhe Chen, H. Vincent Poor, Walid Saad, and Shuguang Cui. 2021. Con- vergence Time Optimization for Federated Learning over Wireless Networks. https://doi.org/10.48550/arXiv.2001.07845 arXiv:2001.07845 [cs]
-
[10]
Gerald Combs. 2024. Wireshark: The world’s most popular network protocol analyzer. https://www.wireshark.org/
work page 2024
-
[11]
TorchVision Contributors. 2025. TorchVision Image Classification Reference Training Scripts. https://github.com/pytorch/vision/blob/ c2ab0c59f42babf9ad01aa616cd8a901daac86dd/references/classification/ README.md. Accessed: 2025-02-24
work page 2025
-
[12]
free5GC. 2025. Open Source 5G Core Network Implementation. GitHub. https: //github.com/free5gc/free5gc
work page 2025
-
[13]
Google. 2024. gRPC: A high-performance, open source universal RPC framework . https://github.com/grpc/grpc
work page 2024
-
[14]
Hayek, Joaquin Chung, Kayla Comer, Chandra Murthy, Rajkumar Kettimuthu, and Igor Kadota
Robert J. Hayek, Joaquin Chung, Kayla Comer, Chandra Murthy, Rajkumar Kettimuthu, and Igor Kadota. 2025. Federated Learning for 5G Tesbed for 5G . https://github.com/Net-X-Research-Group/federated_learning_testbed
work page 2025
-
[15]
SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and <0.5MB model size
Forrest N. Iandola, Song Han, Matthew W. Moskewicz, Khalid Ashraf, William J. Dally, and Kurt Keutzer. 2016. SqueezeNet: AlexNet-level accuracy with 50x R. J. Hayek et al. fewer parameters and <0.5MB model size. arXiv:1602.07360 (2016)
work page internal anchor Pith review Pith/arXiv arXiv 2016
-
[16]
Yuang Jiang, Shiqiang Wang, Víctor Valls, Bong Jun Ko, Wei-Han Lee, Kin K. Leung, and Leandros Tassiulas. 2023. Model Pruning Enables Efficient Federated Learning on Edge Devices. IEEE Transactions on Neural Networks and Learning Systems 34, 12 (Dec. 2023), 10374–10386. https://doi.org/10.1109/TNNLS.2022. 3166101 Conference Name: IEEE Transactions on Neur...
-
[17]
Alex Krizhevsky. 2009. Learning Multiple Layers of Features from Tiny Images . Technical Report. https://www.cs.toronto.edu/~kriz/learning-features-2009- TR.pdf
work page 2009
-
[19]
Seungyeol Lee and Myung-Ki Shin. 2022. Federated learning over private 5G networks: demo. In Proceedings of the Twenty-Third International Symposium on Theory, Algorithmic Foundations, and Protocol Design for Mobile Networks and Mobile Computing (MobiHoc ’22). Association for Computing Machinery, New York, NY, USA, 295–296. https://doi.org/10.1145/3492866.3561259
-
[20]
Wei Yang Bryan Lim, Nguyen Cong Luong, Dinh Thai Hoang, Yutao Jiao, Ying- Chang Liang, Qiang Yang, Dusit Niyato, and Chunyan Miao. 2020. Federated Learning in Mobile Edge Networks: A Comprehensive Survey. IEEE Communica- tions Surveys & Tutorials 22, 3 (2020), 2031–2063. https://doi.org/10.1109/COMST. 2020.2986024 Conference Name: IEEE Communications Surv...
-
[21]
Brendan McMahan, Eider Moore, Daniel Ramage, Seth Hampson, and Blaise Agüera y Arcas
H. Brendan McMahan, Eider Moore, Daniel Ramage, Seth Hampson, and Blaise Agüera y Arcas. 2023. Communication-Efficient Learning of Deep Networks from Decentralized Data. https://doi.org/10.48550/arXiv.1602.05629 arXiv:1602.05629 [cs]
-
[22]
Giovanni Nardini, Giovanni Stea, and Antonio Virdis. 2021. Scalable Real-Time Emulation of 5G Networks With Simu5G. IEEE Access 9 (2021), 148504–148520. https://doi.org/10.1109/ACCESS.2021.3123873
-
[23]
Marina, Saravana Manickam, Alex Dawson, Raymond Knopp, and Christian Bonnet
Navid Nikaein, Mahesh K. Marina, Saravana Manickam, Alex Dawson, Raymond Knopp, and Christian Bonnet. 2014. OpenAirInterface: A Flexible Platform for 5G Research. SIGCOMM Comput. Commun. Rev. 44, 5 (Oct. 2014), 33–38. https://doi.org/10.1145/2677046.2677053
- [24]
-
[25]
Yasintha Rumesh, Dinaj Attanayaka, Pawani Porambage, Jarno Pinola, Joshua Groen, and Kaushik Chowdhury. 2024. Federated Learning for Anomaly Detec- tion in Open RAN: Security Architecture Within a Digital Twin. In2024 Joint Euro- pean Conference on Networks and Communications & 6G Summit (EuCNC/6G Sum- mit). 877–882. https://doi.org/10.1109/EuCNC/6GSummit...
work page doi:10.1109/eucnc/6gsummit60053.2024.10597083 2024
- [26]
-
[27]
Tran, Wei Bao, Albert Zomaya, Minh N
Nguyen H. Tran, Wei Bao, Albert Zomaya, Minh N. H. Nguyen, and Choong Seon Hong. 2019. Federated Learning over Wireless Networks: Optimization Model Design and Analysis. In IEEE INFOCOM 2019 - IEEE Conference on Computer Communications. 1387–1395. https://doi.org/10.1109/INFOCOM.2019.8737464 ISSN: 2641-9874
-
[28]
Shuo Wan, Jiaxun Lu, Pingyi Fan, Yunfeng Shao, Chenghui Peng, and Khaled B. Letaief. 2021. Convergence Analysis and System Design for Federated Learning Over Wireless Networks. IEEE Journal on Selected Areas in Communications 39, 12 (Dec. 2021), 3622–3639. https://doi.org/10.1109/JSAC.2021.3118351 Conference Name: IEEE Journal on Selected Areas in Communications
-
[29]
Kok-Seng Wong, Manh Nguyen-Duc, Khiem Le-Huy, Long Ho-Tuan, Cuong Do-Danh, and Danh Le-Phuoc. 2023. An Empirical Study of Federated Learning on IoT-Edge Devices: Resource Allocation and Heterogeneity. arXiv:2305.19831 (May 2023). http://arxiv.org/abs/2305.19831 arXiv:2305.19831 [cs]
- [30]
-
[31]
Tianming Zang, Ce Zheng, Shiyao Ma, Chen Sun, and Wei Chen. 2023. A General Solution for Straggler Effect and Unreliable Communication in Federated Learning. In ICC 2023 - IEEE International Conference on Communications . IEEE, Rome, Italy, 1194–1199. https://doi.org/10.1109/ICC45041.2023.10279635 ACRONYMS 5G-NR 5G New Radio. 2–4 5GC 5G Core Network. 3, 4...
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.