pith. sign in

arxiv: 2508.17431 · v4 · pith:NG3UI3Z2new · submitted 2025-08-24 · 💻 cs.CV · cs.AI· cs.LG

FedKLPR: KL-Guided Pruning-Aware Federated Learning for Person Re-Identification

Pith reviewed 2026-05-21 22:25 UTC · model grok-4.3

classification 💻 cs.CV cs.AIcs.LG
keywords federated learningperson re-identificationmodel pruningKL divergencecommunication efficiencynon-IID dataprivacy-preserving training
0
0 comments X

The pith

FedKLPR reduces communication costs by 40-42 percent in federated person re-identification through KL-guided pruning and weighted aggregation while maintaining competitive accuracy.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces FedKLPR as a framework to make federated learning practical for person re-identification by tackling non-IID data distributions and heavy model transmission costs. It adds a KL-Divergence Regularization Loss and KL-Divergence-aggregation Weight to stabilize training and reduce heterogeneity effects. Unstructured pruning is combined with a Pruning-ratio-aggregation Weight to form KLPWA for effective server-side merging of compressed client models. Cross-Round Recovery then adjusts pruning intensity over rounds to recover any lost accuracy. Tests across eight datasets show the approach delivers better overall results than prior federated re-ID methods at substantially lower communication volume for ResNet-50 backbones.

Core claim

FedKLPR integrates KL-Divergence-Guided training, KL-Divergence-Prune Weighted Aggregation, and Cross-Round Recovery to train re-identification models collaboratively without sharing raw images. The KL components counteract statistical heterogeneity while the pruning weights and recovery mechanism limit the size of transmitted parameters, enabling the federated process to converge to competitive accuracy with far less bandwidth.

What carries the argument

KL-Divergence-Prune Weighted Aggregation (KLPWA) that combines divergence-based and pruning-ratio weights for server aggregation of pruned local models, paired with Cross-Round Recovery to adapt compression strength across rounds.

If this is right

  • Re-ID models become deployable in bandwidth-constrained surveillance networks without centralized data collection.
  • Pruning can be increased in later rounds once models stabilize, further lowering total transmitted volume.
  • Convergence remains stable even when client camera views and identities overlap little.
  • The same weighting logic could reduce overhead in other vision tasks trained federatedly.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The framework might apply directly to federated training of other large vision models such as object detectors on edge cameras.
  • Pairing KLPWA with quantization could yield additional multiplicative savings in total bits moved.
  • Validation on live multi-camera streams would test whether the fixed weight formulas hold outside benchmark splits.
  • Extending the recovery schedule to client-specific pruning ratios could further personalize compression per device.

Load-bearing premise

The new KL regularization loss, aggregation weights, and cross-round recovery will reliably offset accuracy loss from both non-IID data and pruning on any client data split without extra per-deployment tuning.

What would settle it

Running the method on a new re-ID benchmark with extreme label skew where accuracy drops more than 3-5 percent below unpruned federated averaging while communication savings remain below 30 percent would disprove the central performance claim.

Figures

Figures reproduced from arXiv: 2508.17431 by Po-Hsien Yu, Shao-Yi Chien, Yu-Syuan Tseng.

Figure 1
Figure 1. Figure 1: Two major challenges in unsupervised federated learning re-ID [PITH_FULL_IMAGE:figures/full_fig_p002_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Overview of the FedKLPR framework, consisting of a cloud server and eight clients; each client performs local training with KLL, applies unstructured [PITH_FULL_IMAGE:figures/full_fig_p005_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: The example of Sparse Activation Skipping (SAS) for federated [PITH_FULL_IMAGE:figures/full_fig_p007_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: The CRR mechanism with two-stage verification: Stage 1 checks [PITH_FULL_IMAGE:figures/full_fig_p007_4.png] view at source ↗
read the original abstract

Person re-identification (re-ID) is a fundamental task in intelligent surveillance and public safety. Federated learning (FL) provides a privacy-preserving paradigm for collaborative model training without centralized data collection. However, deploying FL in real-world re-ID systems remains challenging due to statistical heterogeneity caused by non-IID client data and the substantial communication overhead incurred by frequent transmission of large-scale models. To address these challenges, we propose FedKLPR, a lightweight and communication-efficient federated learning framework for person re-ID. FedKLPR consists of three key components. First, KL-Divergence-Guided training, including the KL-Divergence Regularization Loss (KLL) and KL-Divergence-aggregation Weight (KLAW), is introduced to mitigate statistical heterogeneity and improve convergence stability under non-IID settings. Second, unstructured pruning is incorporated to reduce communication overhead, and the Pruning-ratio-aggregation Weight (PRAW) is proposed to measure the relative importance of client parameters after pruning. Together with KLAW, PRAW forms KL-Divergence-Prune Weighted Aggregation (KLPWA), enabling effective aggregation of pruned local models under heterogeneous data distributions. Third, Cross-Round Recovery (CRR) adaptively controls pruning across communication rounds to prevent excessive compression and preserve model accuracy. Experiments on eight benchmark datasets demonstrate that FedKLPR achieves substantial communication savings while maintaining competitive accuracy. Compared with state-of-the-art methods, FedKLPR reduces communication cost by 40\%--42\% on ResNet-50 while achieving better overall performance.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The manuscript introduces FedKLPR, a federated learning framework for person re-identification that integrates KL-divergence regularization loss (KLL) and aggregation weight (KLAW) to mitigate statistical heterogeneity, unstructured pruning with pruning-ratio-aggregation weight (PRAW) and KL-divergence-prune weighted aggregation (KLPWA) to reduce communication overhead, and cross-round recovery (CRR) to adaptively control pruning and preserve accuracy. Experiments on eight benchmark datasets are reported to show that FedKLPR achieves 40-42% communication cost reduction on ResNet-50 while maintaining competitive or superior accuracy relative to state-of-the-art methods.

Significance. If the empirical claims hold after detailed verification, the work would offer a practical heuristic for deploying communication-efficient FL in re-ID applications under non-IID conditions, addressing key barriers to real-world surveillance systems. The explicit combination of KL-guided weighting with pruning and recovery mechanisms provides a concrete, testable approach that could generalize to other vision tasks with high model size and data heterogeneity.

major comments (2)
  1. [Experiments] Experimental section: the headline claim of 40-42% communication reduction with better overall performance rests on the joint action of KLL, KLAW, PRAW, and CRR counteracting non-IID effects and pruning-induced loss, yet no details are provided on client partitioning strategies, whether extreme skew distributions were tested, exact pruning schedules, or statistical significance of the reported gains; this directly affects verifiability of the robustness assumption.
  2. [Method] Method section describing KLPWA: the Pruning-ratio-aggregation Weight (PRAW) is introduced to measure relative importance of pruned client parameters, but without an explicit equation or derivation showing how it combines with KLAW to form the aggregation rule, it is unclear whether the weighting is parameter-free or requires per-deployment tuning, which is load-bearing for the claim of reliable performance across arbitrary non-IID partitions.
minor comments (2)
  1. [Abstract] The abstract refers to 'eight benchmark datasets' without naming them or providing a summary table of results; adding this would improve clarity for readers.
  2. [Method] Notation for the newly introduced components (KLL, KLAW, PRAW, CRR) is introduced in prose but would benefit from a dedicated table or diagram summarizing their roles and interactions.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive feedback and the recommendation for major revision. The comments highlight important areas for improving clarity and reproducibility, which we will address directly in the revised manuscript.

read point-by-point responses
  1. Referee: [Experiments] Experimental section: the headline claim of 40-42% communication reduction with better overall performance rests on the joint action of KLL, KLAW, PRAW, and CRR counteracting non-IID effects and pruning-induced loss, yet no details are provided on client partitioning strategies, whether extreme skew distributions were tested, exact pruning schedules, or statistical significance of the reported gains; this directly affects verifiability of the robustness assumption.

    Authors: We agree that expanded details on the experimental protocol are essential for verifying robustness under non-IID conditions. In the revised manuscript, we will add a dedicated subsection in Experiments describing the client partitioning strategy, including the specific Dirichlet distribution parameters used to generate non-IID partitions (e.g., α values for moderate and high heterogeneity). We will also report additional results on extreme skew distributions. Exact pruning schedules will be specified, including per-round pruning ratios and the adaptive threshold logic within CRR. Finally, we will include mean performance with standard deviations across multiple random seeds and note statistical significance testing for key comparisons. These changes will be incorporated without altering the reported headline results. revision: yes

  2. Referee: [Method] Method section describing KLPWA: the Pruning-ratio-aggregation Weight (PRAW) is introduced to measure relative importance of pruned client parameters, but without an explicit equation or derivation showing how it combines with KLAW to form the aggregation rule, it is unclear whether the weighting is parameter-free or requires per-deployment tuning, which is load-bearing for the claim of reliable performance across arbitrary non-IID partitions.

    Authors: We thank the referee for this observation on methodological clarity. The current manuscript states that KLPWA is formed by the joint use of KLAW and PRAW, but we acknowledge that an explicit equation and derivation would remove ambiguity. In the revision, we will insert the precise aggregation rule as w_i = (KLAW_i * PRAW_i) / sum_j (KLAW_j * PRAW_j), together with a short derivation explaining that both factors are computed directly from the local KL divergence and the observed pruning ratio. This formulation introduces no extra hyperparameters and requires no per-deployment tuning. We will also add a compact algorithm box illustrating the full weighted aggregation step. These additions will be placed in the Method section describing KLPWA. revision: yes

Circularity Check

0 steps flagged

No circularity: empirical framework with independent experimental validation

full rationale

The paper introduces FedKLPR as a combination of KL-divergence regularization loss, aggregation weights, unstructured pruning, and cross-round recovery to address non-IID heterogeneity and communication costs in federated re-ID. No equations, derivations, or self-referential definitions appear that reduce any claimed prediction or result to its own inputs by construction. The components are presented as design choices validated through experiments on eight benchmark datasets rather than through fitted parameters renamed as predictions or load-bearing self-citations. The central performance claims rest on external empirical outcomes, making the derivation self-contained.

Axiom & Free-Parameter Ledger

1 free parameters · 1 axioms · 0 invented entities

Review performed on abstract only; the ledger therefore records only the high-level assumptions stated in the abstract and cannot enumerate concrete fitted values or unstated modeling choices present in the full manuscript.

free parameters (1)
  • pruning ratios
    Adaptive ratios are controlled by the Cross-Round Recovery mechanism; exact selection or initialization procedure is not specified in the abstract.
axioms (1)
  • domain assumption Client datasets exhibit statistical heterogeneity (non-IID distributions) that degrades standard federated averaging.
    Invoked to motivate the KL-Divergence Regularization Loss and KL-Divergence-aggregation Weight.

pith-pipeline@v0.9.0 · 5828 in / 1293 out tokens · 88297 ms · 2026-05-21T22:25:27.972648+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

What do these tags mean?
matches
The paper's claim is directly supported by a theorem in the formal canon.
supports
The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends
The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses
The paper appears to rely on the theorem as machinery.
contradicts
The paper's claim conflicts with a theorem or certificate in the canon.
unclear
Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

51 extracted references · 51 canonical work pages · 1 internal anchor

  1. [1]

    Shenzhi Wang, Chang Liu, Zilong Zheng, Siyuan Qi, Shuo Chen, Qisen Yang, Andrew Zhao, Chaofei Wang, Shiji Song, and Gao Huang

    L. Wei, S. Zhang, W. Gao and Q. Tian, “Person Transfer GAN to Bridge Domain Gap for Person Re-identification,” 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition , Salt Lake City, UT, USA, 2018, pp. 79–88, doi: 10.1109/CVPR.2018.00016. 1, 3

  2. [3]

    Mutual mean-teaching: Pseudo label refinery for unsupervised domain adaptation on person re-identification,

    Y . Ge, D. Chen, and H. Li, “Mutual mean-teaching: Pseudo label refinery for unsupervised domain adaptation on person re-identification,” in Proceedings of International Conference on Learning Representations (ICLR), 2020. 1, 3

  3. [4]

    Unsupervised cross-dataset per- son re-identification by transfer learning of spatial-temporal patterns,

    J. Lv, W. Chen, Q. Li, and C. Yang, “Unsupervised cross-dataset per- son re-identification by transfer learning of spatial-temporal patterns,” in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition(CVPR), Salt Lake City, UT, USA, 2018, pp. 7948–7956, doi: 10.1109/CVPR.2018.00829. 1, 3

  4. [5]

    Walk in the cloud: Learning curves for point clouds shape analysis, pp

    T. Isobe, D. Li, L. Tian, W. Chen, Y . Shan, and S. Wang, “To- wards discriminative representation learning for unsupervised person re-identification,” in Proceedings of IEEE International Conference on Computer Vision (ICCV) , Montreal, QC, Canada, 2021, pp. 8506–8516, doi: 10.1109/ICCV48922.2021.00841. 1, 3

  5. [6]

    Intra-inter camera similarity for unsupervised person re-identification,

    S. Xuan and S. Zhang, “Intra-inter camera similarity for unsupervised person re-identification,” in Proceedings of IEEE Conference on Com- puter Vision and Pattern Recognition (CVPR) , 2021, pp. 11926–11935. 1, 3

  6. [8]

    PoseNet: A convolutional network for real-time 6-dof camera relocalization,

    J. Wu, Y . Yang, H. Liu, S. Liao, Z. Lei, and S. Z. Li, “Unsupervised graph association for person re-identification,” in Proceedings of IEEE International Conference on Computer Vision (ICCV) , Seoul, Korea (South), 2019, pp. 8320-8329, doi: 10.1109/ICCV .2019.00841. 1, 3

  7. [9]

    Unsupervised person re-identification by deep learning tracklet association,

    M. Li, X. Zhu, and S. Gong, “Unsupervised person re-identification by deep learning tracklet association,” in Proceedings of European Conference on Computer Vision (ECCV) , 2018, pp. 737-–753. 1, 3

  8. [10]

    Deep Association Learning for Unsupervised Video Person Re-identification

    Y . Chen, X. Zhu, and S. Gong, “Deep association learning for unsuper- vised video person re-identification,” arXiv preprint arXiv:1808.07301 ,

  9. [11]

    Unsupervised person re-identification by soft multilabel learning,

    H.-X. Yu, W.-S. Zheng, A. Wu, X. Guo, S. Gong, and J.-H. Lai, “Unsupervised person re-identification by soft multilabel learning,” in Proceedings of IEEE Conference on Computer Vision and Pattern Recog- nition (CVPR), 2019, pp. 2148—2157. 1, 3

  10. [12]

    Camera-aware proxies for unsupervised person re-identification,

    M. Wang, B. Lai, J. Huang, X. Gong, and X.-S. Hua, “Camera-aware proxies for unsupervised person re-identification,” in Proceedings of The Association for the Advancement of Artificial Intelligence (AAAI) , 2021, pp. 2764-–2772. 1, 3, 4

  11. [13]

    Communication-efficient learning of deep networks from decentralized data,

    B. McMahan, E. Moore, D. Ramage, S. Hampson, and B. A. y Arcas, “Communication-efficient learning of deep networks from decentralized data,” in Artificial intelligence and statistics. PMLR, 2017, pp. 1273–

  12. [14]

    Performance optimization of federated person re- identification via benchmark analysis

    W. Zhuang, et al., “Performance optimization of federated person re- identification via benchmark analysis.” in Proceedings of the 28th ACM international conference on multimedia , 2020, pp. 955–963. 1

  13. [15]

    FedCAPR: feder- ated camera-aware unsupervised person re-identification with identity- distributed equalization for decentralized data clustering,

    Y .-S. Tseng, T.-C. Hsu, C.-T. Liu, S.-Y . Chien, “FedCAPR: feder- ated camera-aware unsupervised person re-identification with identity- distributed equalization for decentralized data clustering,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, 2025. 1, 2, 3, 4, 6, 7, 8, 9, 10

  14. [16]

    Joint optimization in edge- cloud continuum for federated unsupervised person re-identification,

    W. Zhuang, Y . Wen, and S. Zhang, “Joint optimization in edge- cloud continuum for federated unsupervised person re-identification,” in Proceedings of the 29th ACM International Conference on Multimedia ,

  15. [17]

    433–441 1, 6, 8, 9

    pp. 433–441 1, 6, 8, 9

  16. [18]

    Robust knowledge adaptation for federated unsupervised person reid,

    J. Weng, K. Hu, T. Yao, J. Wang, and Z. Wang, “Robust knowledge adaptation for federated unsupervised person reid,” in 2022 International Conference on Digital Image Computing: Techniques and Applications (DICTA). IEEE, 2022, pp. 1-–8. 1, 3, 6, 8, 9

  17. [19]

    Federated Unsupervised Cluster-Contrastive learning for person Re-identification: A coarse-to- fine approach,

    J. Weng, K. Hu, T. Yao, J. Wang, and Z. Wang, “Federated Unsupervised Cluster-Contrastive learning for person Re-identification: A coarse-to- fine approach,” Computer Vision and Image Understanding, V olume 237,

  18. [20]

    Optimiz- ing Federated Unsupervised Person Re-identification via Camera-aware Clustering,

    J. Liu, W. Zhuang, Y . Wen, J. Huang, and W. Lin, “Optimiz- ing Federated Unsupervised Person Re-identification via Camera-aware Clustering,” 2022 IEEE 24th International Workshop on Multimedia Signal Processing (MMSP) , Shanghai, China, 2022, pp. 1–6, doi: 10.1109/MMSP55362.2022.9949249. 1, 3, 6, 8, 9

  19. [21]

    Federated submodel optimization for hot and cold data features

    Y . Ding, et al., “Federated submodel optimization for hot and cold data features.” Advances in Neural Information Processing Systems, 2022, vol. 35, pp. 1–13. 1, 4, 8, 9, 10

  20. [23]

    1223–1228

    Q. Long, C. Anagnostopoulos, S. P. Parambath and D. Bi, “Fed- DIP: Federated Learning with Extreme Dynamic Pruning and In- cremental Regularization,” 2023 IEEE International Conference on Data Mining (ICDM) , Shanghai, China, 2023, pp. 1187–1192, doi: 10.1109/ICDM58522.2023.00146. 1, 4, 8, 9, 10

  21. [24]

    On Information and Sufficiency,

    S. Kullback, and R. A. Leibler, “On Information and Sufficiency,” in The Annals of Mathematical Statistics , 1951, vol. 22, no. 1, pp. 79–86. 2

  22. [25]

    Unlabeled samples generated by gan improve the person re-identification baseline in vitro,

    Z. Zheng, L. Zhen, and Y . Yang, “Unlabeled samples generated by gan improve the person re-identification baseline in vitro,” in Proceedings of IEEE International Conference on Computer Vision (ICCV) , 2017, pp. 3754–3762. 2, 8

  23. [26]

    Scalable person re-identification: A benchmark,

    L. Zheng, L. Shen, L. Tian, S. Wang, J. Wang, and Q. Tian, “Scalable person re-identification: A benchmark,” in Proceedings of IEEE Interna- tional Conference on Computer Vision (ICCV) , 2015, pp. 1116—1124. 2, 8

  24. [27]

    Person re-identification by video ranking,

    T. Wang, S. Gong, X. Zhu, and S. Wang, “Person re-identification by video ranking,” in Proceedings of European Conference on Computer Vision (ECCV), 2014, pp. 688-–703. 2, 8

  25. [28]

    Deepreid: Deep filter pairing neural network for person re-identification,

    W. Li, R. Zhao, T. Xiao, and X. Wang, “Deepreid: Deep filter pairing neural network for person re-identification,” in Proceedings of IEEE Con- ference on Computer Vision and Pattern Recognition (CVPR), Columbus, OH, USA, 2014, pp. 152–159, doi: 10.1109/CVPR.2014.27. 2, 8

  26. [29]

    Person re-identification by descriptive and discriminative classification,

    M. Hirzer, P. Roth, C. Beleznai, and H. Bischof, “Person re-identification by descriptive and discriminative classification,” in Proceedings of the Scandinavian Conference on Image Analysis (SCIA) , 2011, pp. 91—102. 2, 8

  27. [30]

    Viewpoint invariant pedestrian recognition with an ensemble of localized features,

    D. Gray and H. Tao, “Viewpoint invariant pedestrian recognition with an ensemble of localized features,” in Proceedings of European Conference on Computer Vision (ECCV) , 2008, pp. 262—275. 2, 8

  28. [31]

    Human Reidentification with Transferred Metric Learning,

    W. Li, R. Zhao, and X. Wang, “Human Reidentification with Transferred Metric Learning,” in Computer Vision – ACCV 2012 , 2012, pp. 31–44, doi: 10.1007/978-3-642-37331-2 3. 2, 8

  29. [32]

    3dpes: 3d people dataset for surveillance and forensics,

    D. Baltieri, R. Vezzani, and R. Cucchiara, “3dpes: 3d people dataset for surveillance and forensics,” in Joint ACM Workshop on Human Gesture and Behavior Understanding , 2011, pp. 59—64. 2, 8

  30. [33]

    A density-based algorithm for discovering clusters in large spatial databases with noise,

    M. Ester, H.-P. Kriegal, J. Sander, and X. Xu, “A density-based algorithm for discovering clusters in large spatial databases with noise,” in Proceed- ings of The Association for the Advancement of Artificial Intelligence (AAAI), 1996, pp. 226—231. 3

  31. [34]

    A bottom-up clustering approach to unsupervised person re-identification,

    Y . Lin, X. Dong, L. Zheng, Y . Yan, and Y . Yang, “A bottom-up clustering approach to unsupervised person re-identification,” in Proceedings of The Association for the Advancement of Artificial Intelligence (AAAI) , 2019, pp. 8738-–8745. 3

  32. [35]

    Federated optimization in heterogeneous networks,

    T. Li, A. K. Sahu, M. Zaheer, M. Sanjabi, A. Talwalkar, and V . Smith, “Federated optimization in heterogeneous networks,” Proceedings of Machine learning and systems , vol. 2, pp. 429-–450, 2020. 3

  33. [36]

    Personalized federated learning with theoretical guarantees: A model-agnostic meta-learning approach,

    A. Fallah, A. Mokhtari, and A. Ozdaglar, “Personalized federated learning with theoretical guarantees: A model-agnostic meta-learning approach,” Advances in neural information processing systems , vol. 33, pp. 3557—3568, 2020. 3

  34. [37]

    Personalized federated learning with moreau envelopes,

    C. T Dinh, N. Tran, and J. Nguyen, “Personalized federated learning with moreau envelopes,” Advances in neural information processing systems, vol. 33, pp. 21394—21405, 2020. 3

  35. [38]

    Ditto: Fair and robust federated learning through personalization,

    T. Li, S. Hu, A. Beirami, and V . Smith, “Ditto: Fair and robust federated learning through personalization,” in International conference on machine learning, PMLR, 2021, pp. 6357—6368. 3

  36. [39]

    Per- sonalized cross-silo federated learning on non-iid data,

    Y . Huang, L. Chu, Z. Zhou, L. Wang, J. Liu, J. Pei, and Y . Zhang, “Per- sonalized cross-silo federated learning on non-iid data,” in Proceedings of the AAAI conference on artificial intelligence , vol. 35, no. 9, 2021, pp. 7865-–7873. 3 IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 13

  37. [40]

    Fedphp: Federated personalization with inherited private models,

    X.-C. Li, D.-C. Zhan, Y . Shao, B. Li, and S. Song, “Fedphp: Federated personalization with inherited private models,” in Joint European Con- ference on Machine Learning and Knowledge Discovery in Databases . Springer, 2021, pp. 587-–602. 3

  38. [41]

    Per- sonalized federated learning with first order model optimization,

    M. Zhang, K. Sapra, S. Fidler, S. Yeung, and J. M. Alvarez, “Per- sonalized federated learning with first order model optimization,” arXiv preprint arXiv:2012.08565, 2020. 3

  39. [42]

    Adapt to adaptation: Learning personalization for crosssilo federated learning,

    J. Luo and S. Wu, “Adapt to adaptation: Learning personalization for crosssilo federated learning,” in Proceedings of International Joint Conference on Artificial Intelligence (IJCAI) , vol. 2022. NIH Public Access, 2021, pp. 2166–2173. 3

  40. [43]

    Batch normalization: Accelerating deep network training by reducing internal covariate shift,

    S. Ioffe and C. Szegedy, “Batch normalization: Accelerating deep network training by reducing internal covariate shift,” in Proc. Int. Conf. Mach. Learn., 2015, pp. 448—456. 3

  41. [44]

    Easyfl: A low- code federated learning platform for dummies,

    W. Zhuang, X. Gan, Y . Wen, and S. Zhang, “Easyfl: A low- code federated learning platform for dummies,” in IEEE Internet of Things Journal , vol. 9, no. 15, pp. 13740–13754, 1 Aug.1, 2022, doi: 10.1109/JIOT.2022.3143842. 3

  42. [45]

    LotteryFL: Empower Edge Intelligence with Personal- ized and Communication-Efficient Federated Learning,

    A. Li et al., “LotteryFL: Empower Edge Intelligence with Personal- ized and Communication-Efficient Federated Learning,” 2021 IEEE/ACM Symposium on Edge Computing (SEC) , San Jose, CA, USA, 2021, pp. 68–79, doi: 10.1145/3453142.3492909. 3

  43. [46]

    Model Pruning Enables Efficient Federated Learning on Edge Devices,

    Y . Jiang et al., “Model Pruning Enables Efficient Federated Learning on Edge Devices,” in IEEE Transactions on Neural Networks and Learning Systems , vol. 34, no. 12, pp. 10374–10386, Dec. 2023, doi: 10.1109/TNNLS.2022.3166101. 3

  44. [47]

    Federated Learn- ing: Challenges, Methods, and Future Directions,

    T. Li, A. K. Sahu, A. Talwalkar and V . Smith, “Federated Learn- ing: Challenges, Methods, and Future Directions,” in IEEE Signal Processing Magazine , vol. 37, no. 3, pp. 50–60, May 2020, doi: 10.1109/MSP.2020.2975749. 3

  45. [48]

    The convergence of sparsified gradient methods

    D. Alistarh, et al. “The convergence of sparsified gradient methods.” Advances in Neural Information Processing Systems (NIPS) , pp. 5977– 5987, 2018. 3

  46. [49]

    A survey of model compression and acceleration for deep neural networks.ArXiv, abs/1710.09282, 2017

    Y . Cheng, et al. “A Survey of Model Compression and Acceleration for Deep Neural Networks.” arXiv preprint arXiv:1710.09282 , 2017. 3

  47. [50]

    A density-based algorithm for discovering clusters in large spatial databases with noise,

    M. Ester, H.-P. Kriegal, J. Sander, and X. Xu, “A density-based algorithm for discovering clusters in large spatial databases with noise,” in Proceed- ings of The Association for the Advancement of Artificial Intelligence (AAAI), 1996, pp. 226—231 4

  48. [51]

    K. He, X. Zhang, S. Ren and J. Sun, ”Deep Residual Learning for Image Recognition,” 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) , Las Vegas, NV , USA, 2016, pp. 770–778, doi: 10.1109/CVPR.2016.90. 8

  49. [52]

    J. Deng, W. Dong, R. Socher, L. -J. Li, Kai Li and Li Fei-Fei, ”ImageNet: A large-scale hierarchical image database,” 2009 IEEE Conference on Computer Vision and Pattern Recognition , Miami, FL, USA, 2009, pp. 248–255, doi: 10.1109/CVPR.2009.5206848. 8

  50. [53]

    FedAffect: Few-shot federated learning for facial expression recognition,

    D. Shome and T. Kar, “FedAffect: Few-shot federated learning for facial expression recognition,” 2021 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW), Montreal, BC, Canada, 2021, pp. 4151–4158, doi: 10.1109/ICCVW54120.2021.00463

  51. [54]

    Derf: Decomposed radiance fields,

    Q. Liu, C. Chen, J. Qin, Q. Dou and P. -A. Heng, “FedDG: Federated Domain Generalization on Medical Image Segmentation via Episodic Learning in Continuous Frequency Space,” 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) , Nashville, TN, USA, 2021, pp. 1013-1023, doi: 10.1109/CVPR46437.2021.00107. Po-Hsien Yu received the B.S. d...