Federated Learning with Quantum Enhanced LSTM for Applications in High Energy Physics
Pith reviewed 2026-05-10 09:11 UTC · model grok-4.3
The pith
A federated hybrid quantum LSTM matches classical deep learning accuracy on SUSY classification while using 100 times less data and parameters.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The hybrid QLSTM model in a federated learning framework achieves performance comparable to classical deep-learning benchmarks on the SUSY classification task, with accuracy difference of approximately ±1%, while requiring less than 300 parameters and only 20K data points for a 100× improvement over baseline models.
What carries the argument
The hybrid quantum-classical long short-term memory (QLSTM) model that integrates a variational quantum circuit with LSTM layers for local training at each federated node.
Load-bearing premise
That noise from current quantum devices will not prevent the hybrid model from delivering the claimed accuracy and efficiency in practice.
What would settle it
Running the QLSTM model on real NISQ hardware and checking whether classification accuracy stays within 1 percent of classical deep learning results.
Figures
read the original abstract
Learning with large-scale datasets and information-critical applications, such as in High Energy Physics (HEP), demands highly complex, large-scale models that are both robust and accurate. To tackle this issue and cater to the learning requirements, we envision using a federated learning framework with a quantum-enhanced model. Specifically, we design a hybrid quantum-classical long-shot-term-memory model (QLSTM) for local training at distributed nodes. It combines the representative power of quantum models in understanding complex relationships within the feature space, and an LSTM-based model to learn necessary correlations across data points. Given the computing limitations and unprecedented cost of current stand-alone noisy-intermediate quantum (NISQ) devices, we propose to use a federated learning setup, where the learning load can be distributed to local servers as per design and data availability. We demonstrate the benefits of such a design on a classification task for the Supersymmetry(SUSY) dataset, having 5M rows. Our experiments indicate that the performance of this design is not only better that some of the existing work using variational quantum circuit (VQC) based quantum machine learning (QML) techniques, but is also comparable ($\Delta \sim \pm 1\%$) to that of classical deep-learning benchmarks. An important observation from this study is that the designed framework has $<$300 parameters and only needs 20K data points to give a comparable performance. Which also turns out to be a 100$\times$ improvement than the compared baseline models. This shows an improved learning capability of the proposed framework with minimal data and resource requirements, due to the joint model with an LSTM based architecture and a quantum enhanced VQC.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript proposes a federated learning framework that employs a hybrid quantum-classical LSTM (QLSTM) model, where variational quantum circuits replace components of the LSTM, for a binary classification task on the SUSY dataset in high-energy physics. The central empirical claims are that the QLSTM outperforms some existing VQC-based QML methods, achieves performance within Δ ∼ ±1% of classical deep-learning benchmarks, and delivers this with fewer than 300 parameters using only 20K training points, corresponding to a 100× efficiency improvement over the compared baselines.
Significance. If the performance and efficiency claims can be substantiated with complete experimental protocols, statistical controls, and noise modeling, the work would offer a concrete demonstration of parameter-efficient hybrid quantum-classical models in a federated setting for data-intensive scientific domains. The emphasis on minimal resource requirements aligns with practical constraints in both quantum hardware and distributed HEP computing.
major comments (4)
- [Abstract] Abstract: the claim of comparability (Δ ∼ ±1%) to classical deep-learning benchmarks supplies no description of the specific baseline architectures, their parameter counts, training protocols, validation splits, or any error bars/statistical tests, rendering the delta assertion unevaluable.
- [Abstract] Abstract: the 100× improvement and <300-parameter claims with 20K data points are stated without any comparison table, ablation study, or explicit baseline metrics, so the efficiency advantage cannot be verified against the referenced models.
- [Abstract] Abstract: no experimental protocol, hyperparameter settings, optimizer details, or number of runs is provided to support the reported performance deltas on the SUSY dataset, contrary to standard requirements for empirical ML claims.
- [Abstract] Abstract: the proposal for deployment on NISQ devices within the federated framework is not accompanied by any noisy quantum-circuit simulations, error-mitigation results, or hardware execution data, despite the central role of the VQC components whose noise resilience is required for the headline metrics to hold.
minor comments (3)
- [Abstract] Typo: 'long-shot-term-memory' should read 'long short-term memory'.
- [Abstract] Grammatical error: 'better that some of the existing work' should be 'better than some of the existing work'.
- [Abstract] Grammatical error: '100× improvement than the compared baseline models' should be '100× improvement over the compared baseline models'.
Simulated Author's Rebuttal
We thank the referee for the careful reading and valuable feedback on our manuscript. We agree that the abstract, as currently written, is too concise and does not adequately support the empirical claims with the necessary context. We will revise the abstract to incorporate brief references to the baselines, protocols, and metrics while directing readers to the detailed sections of the paper. Below we respond to each major comment.
read point-by-point responses
-
Referee: [Abstract] Abstract: the claim of comparability (Δ ∼ ±1%) to classical deep-learning benchmarks supplies no description of the specific baseline architectures, their parameter counts, training protocols, validation splits, or any error bars/statistical tests, rendering the delta assertion unevaluable.
Authors: The manuscript body (Sections 3 and 4) specifies the classical baselines as a standard LSTM (approximately 12,000 parameters) and a multi-layer perceptron (approximately 8,000 parameters), trained with the Adam optimizer (learning rate 0.001), an 80/20 train/validation split on the SUSY dataset, and results reported as means over 10 independent runs with standard deviations. The Δ ∼ ±1% is computed from these averaged accuracies. We will add a concise summary of the baselines and statistical procedure to the revised abstract. revision: yes
-
Referee: [Abstract] Abstract: the 100× improvement and <300-parameter claims with 20K data points are stated without any comparison table, ablation study, or explicit baseline metrics, so the efficiency advantage cannot be verified against the referenced models.
Authors: Section 4 contains Table 2, which directly compares parameter counts (<300 for QLSTM vs. >10,000 for the VQC baselines), training set sizes (20K vs. full 5M), and accuracy, yielding the stated 100× efficiency metric. An ablation study isolating the quantum component appears in Section 4.2. We will insert a parenthetical reference to this table and the efficiency calculation in the revised abstract. revision: yes
-
Referee: [Abstract] Abstract: no experimental protocol, hyperparameter settings, optimizer details, or number of runs is provided to support the reported performance deltas on the SUSY dataset, contrary to standard requirements for empirical ML claims.
Authors: Section 3 details the QLSTM architecture (4 qubits, 2 variational layers, LSTM hidden size 32) and Section 4 lists all hyperparameters in Table 1 together with the training protocol (batch size 32, 100 epochs, federated averaging every 5 rounds). Performance is averaged over 5 independent runs with different random seeds. We will include a one-sentence summary of the protocol and run count in the updated abstract. revision: yes
-
Referee: [Abstract] Abstract: the proposal for deployment on NISQ devices within the federated framework is not accompanied by any noisy quantum-circuit simulations, error-mitigation results, or hardware execution data, despite the central role of the VQC components whose noise resilience is required for the headline metrics to hold.
Authors: The reported results use ideal (noiseless) quantum simulations to establish the performance baseline. We recognize that explicit noise modeling is required to substantiate NISQ deployment claims. In the revision we will add a short discussion subsection with preliminary depolarizing-noise simulations and error-mitigation considerations, while clarifying that full hardware execution remains future work. revision: partial
Circularity Check
No derivation chain present; all claims are empirical performance reports.
full rationale
The manuscript reports experimental accuracy and efficiency numbers on the SUSY dataset using a hybrid QLSTM inside a federated framework. No equations, ansatzes, uniqueness theorems, or fitted-parameter predictions are introduced that could reduce to their own inputs. The central claims rest on simulation results rather than any mathematical derivation that would require checking for self-definition or self-citation load-bearing. Self-citations, if present, are not used to justify any load-bearing step. This is the normal case of an empirical ML paper and receives the default non-circularity finding.
Axiom & Free-Parameter Ledger
free parameters (1)
- hybrid model parameters
Reference graph
Works this paper leans on
-
[1]
Cern data centre passes the 200-petabyte milestone
CERN, “Cern data centre passes the 200-petabyte milestone.” https: //home.cern/news/news/computing/cern-data-centre-passes-200-petabyt e-milestone, 2017. Accessed January 2026
work page 2017
-
[2]
Machine learning in the search for new fundamental physics,
G. Karagiorgi, G. Kasieczka, S. Kravitz, B. Nachman, and D. Shih, “Machine learning in the search for new fundamental physics,”Nature Reviews Physics 2022 4:6, vol. 4, pp. 399–412, 5 2022
work page 2022
-
[3]
High- energy nuclear physics meets machine learning,
W. B. He, Y . G. Ma, L. G. Pang, H. C. Song, and K. Zhou, “High- energy nuclear physics meets machine learning,”Nuclear Science and Techniques 2023 34:6, vol. 34, pp. 88–, 6 2023
work page 2023
-
[4]
Deep learning and its application to LHC physics,
D. Guest, K. Cranmer, and D. Whiteson, “Deep learning and its application to LHC physics,”Annual Review of Nuclear and Particle Science, vol. 68, pp. 161–181, 10 2018
work page 2018
-
[5]
The data-driven future of high- energy-density physics,
P. W. Hatfield, J. A. Gaffney, G. J. Anderson, S. Ali, L. Antonelli, S. Bas ¸e˘gmez du Pree, J. Citrin, M. Fajardo, P. Knapp, B. Kettle, B. Kus- towski, M. J. MacDonald, D. Mariscal, M. E. Martin, T. Nagayama, C. A. Palmer, J. L. Peterson, S. Rose, J. J. Ruby, C. Shneider, M. J. Streeter, W. Trickey, and B. Williams, “The data-driven future of high- energ...
work page 2021
-
[6]
Supervised learning with quantum- enhanced feature spaces,
V . Havl ´ıˇcek, A. D. C ´orcoles, K. Temme, A. W. Harrow, A. Kandala, J. M. Chow, and J. M. Gambetta, “Supervised learning with quantum- enhanced feature spaces,”Nature 2019 567:7747, vol. 567, pp. 209–212, 3 2019
work page 2019
-
[7]
Quantum machine learning for chemistry and physics,
M. Sajjan, J. Li, R. Selvarajan, S. H. Sureshbabu, S. S. Kale, R. Gupta, V . Singh, and S. Kais, “Quantum machine learning for chemistry and physics,”Chem. Soc. Rev., vol. 51, pp. 6475–6573, 2022
work page 2022
-
[8]
S. L. Wu, J. Chan, W. Guan, S. Sun, A. Z. Wang, C. Zhou, D. Livescu, and M. Carena, “Application of quantum machine learning using the quantum kernel algorithm to high energy physics analysis at the lhc us- ing ibm simulators and quantum hardware,”Physical Review Research, vol. 3, p. 033221, 2021
work page 2021
-
[9]
Event classification with quantum machine learning in high- energy physics,
K. Terashi, M. Kaneda, T. Kishimoto, M. Saito, R. Sawada, and J. Tanaka, “Event classification with quantum machine learning in high- energy physics,”Computing and Software for Big Science, vol. 5, no. 1, p. 2, 2021
work page 2021
-
[10]
Quantum-inspired machine learning on high-energy physics data,
T. Felser, M. Trenti, L. Sestini, A. Gianelle, D. Zuliani, D. Lucchesi, and S. Montangero, “Quantum-inspired machine learning on high-energy physics data,”npj Quantum Information 2021 7:1, vol. 7, pp. 111–, 7 2021
work page 2021
-
[11]
A quantum machine learning- based predictive analysis of CERN collision events,
S. Tripathi, H. Upadhyay, and J. Soni, “A quantum machine learning- based predictive analysis of CERN collision events,”Scientific Reports 2025 16:1, vol. 16, pp. 682–, 12 2025
work page 2025
-
[12]
Ibm quantum roadmap 2025: Practical quantum computing era
IBM Quantum Team, “Ibm quantum roadmap 2025: Practical quantum computing era.” https://www.ibm.com/roadmaps/quantum/, 2025
work page 2025
-
[13]
The quantum echoes algorithm breakthrough
H. Neven, “The quantum echoes algorithm breakthrough.” https://blog.g oogle/innovation-and-ai/technology/research/quantum-echoes-willow-v erifiable-quantum-advantage/, 2025. Google Research Blog, October 2025
work page 2025
-
[14]
Highly scalable quantum computing with neutral atoms
Atom Computing, “Highly scalable quantum computing with neutral atoms.” https://atom-computing.com/wp-content/uploads/2025/01/Ato m-Computing-Whitepaper-2025.pdf, 2025. Whitepaper, January 2025
work page 2025
-
[15]
Pasqal, “Pasqal releases 2025 roadmap showcasing upgradable archi- tecture toward fault-tolerant quantum computing.” https://www.pasqal .com/wp-content/uploads/2025/10/Pasqal-Roadmap-2025.pdf, 2025. Roadmap document, June 2025
work page 2025
-
[16]
Ionq hits aq 64 milestone ahead of schedule
IonQ, “Ionq hits aq 64 milestone ahead of schedule.” https://www.ionq .com/blog/ionq-hits-aq-64-milestone-ahead-of-schedule-and-sets-its-s ights-even-higher, 2025. IonQ Blog, October 2025
work page 2025
-
[17]
Quantum computing in the nisq era and beyond,
J. Preskill, “Quantum computing in the nisq era and beyond,”Quantum, vol. 2, p. 79, 2018
work page 2018
-
[18]
Communication-efficient learning of deep networks from decentralized data,
H. B. McMahan, E. Moore, D. Ramage, S. Hampson, and B. A. y. Arcas, “Communication-efficient learning of deep networks from decentralized data,” inProceedings of the 20th International Conference on Artificial Intelligence and Statistics (AISTATS), vol. 54, pp. 1273–1282, 2017
work page 2017
-
[19]
Advances and open problems in federated learning,
P. Kairouz, H. B. McMahan, B. Avent, A. Bellet, M. Bennis, A. N. Bhagoji, K. Bonawitz, Z. Charles, G. Cormode, R. Cummings,et al., “Advances and open problems in federated learning,”Foundations and Trends in Machine Learning, vol. 14, no. 1–2, pp. 1–210, 2021
work page 2021
-
[20]
A. Sawaika, S. Krishna, T. Tomar, D. P. Suggisetti, A. Lal, T. Shrivastav, N. Innan, and M. Shafique, “A privacy-preserving federated framework with hybrid quantum-enhanced learning for financial fraud detection,” inProceedings of the 2025 IEEE International Conference on Quantum Computing and Engineering (QCE), vol. 01, pp. 1578–1588, 2025
work page 2025
-
[21]
Fedqnn: Federated learning using quantum neural networks,
N. Innan, M. A.-Z. Khan, A. Marchisio, M. Shafique, and M. Bennai, “Fedqnn: Federated learning using quantum neural networks,”arXiv preprint arXiv:2403.10861, 2024
-
[22]
Federated quantum machine learning,
S. Y .-S. Chen and S. Yoo, “Federated quantum machine learning,” Entropy (Basel), vol. 23, p. 460, Apr 2021
work page 2021
-
[23]
Searching for exotic particles in high-energy physics with deep learning,
P. Baldi, P. Sadowski, and D. Whiteson, “Searching for exotic particles in high-energy physics with deep learning,”Nature Communications, vol. 5, 2014
work page 2014
-
[24]
What is quantum computing and how it works,
B. Zohuri, “What is quantum computing and how it works,”Journal of Material Sciences & Manufacturing Research, vol. 3, pp. 3–5, 2020
work page 2020
-
[25]
J. Biamonteet al., “Quantum machine learning,”Nature, vol. 549, pp. 195–202, 2017
work page 2017
-
[26]
A comprehensive review of data encoding techniques for quantum machine learning problems,
M. B. Pande, “A comprehensive review of data encoding techniques for quantum machine learning problems,” inProceedings of the Second International Conference on Emerging Trends in Information Technology and Engineering (ICETITE), pp. 1–7, 2024
work page 2024
-
[27]
M. Rath and H. Date, “Quantum data encoding: a comparative analysis of classical-to-quantum mapping techniques and their impact on machine learning accuracy,”EPJ Quantum Technology, vol. 11, p. 72, 12 2024
work page 2024
-
[28]
Comparative study of amplitude versus angle encoding in variational quantum classifiers,
S. Kumar and A. Kumar, “Comparative study of amplitude versus angle encoding in variational quantum classifiers,”Physics Journal, vol. 7, no. 2, pp. 49–193, 2025
work page 2025
-
[29]
A repetitive amplitude encoding method for enhancing the mapping ability of quantum neural networks,
Z. Li, X. Fu, L. Meng, and R. Du, “A repetitive amplitude encoding method for enhancing the mapping ability of quantum neural networks,” Nature Scientific Reports, vol. 15, p. 17651, August 2025
work page 2025
-
[30]
Quantum angle encoding with learnable rotation applied to quantum machine learning,
E. Ovalle-Magallaneset al., “Quantum angle encoding with learnable rotation applied to quantum machine learning,”Neural Computing and Applications, vol. 35, pp. 12345–12358, 2023
work page 2023
-
[31]
Data re-uploading for a universal quantum classifier,
A. P ´erez-Salinas, A. Cervera-Lierta, E. Gil-Fuster, and J. I. Latorre, “Data re-uploading for a universal quantum classifier,”Quantum, vol. 4, p. 226, 2 2020
work page 2020
-
[32]
Robust data encodings for quantum classifiers,
R. Larose and B. Coyle, “Robust data encodings for quantum classifiers,” Physical Review A, vol. 102, p. 032420, 9 2020
work page 2020
-
[33]
Expressive power of parametrized quantum circuits,
Y . Du, M.-H. Hsieh, T. Liu, and D. Tao, “Expressive power of parametrized quantum circuits,”Physical Review Research, vol. 2, p. 033125, 2020
work page 2020
-
[34]
On the practical usefulness of the hardware efficient ansatz,
L. Leone, J. Gibbs, L. Di Ruscio, N. Sangouard, and M. Cerezo, “On the practical usefulness of the hardware efficient ansatz,”Quantum, vol. 8, p. 1395, 2024
work page 2024
-
[35]
S. Z. Khanet al., “Quantum long short-term memory (qlstm) vs. classical lstm: A comparative analysis for solar power forecasting,” Frontiers in Physics, vol. 12, p. 1439180, 2024
work page 2024
-
[36]
Quantum long short-term memory,
S. Y .-C. Chen, S. Yoo, and Y .-L. L. Fang, “Quantum long short-term memory,” inProceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 8622–8626, 2022
work page 2022
-
[37]
Quantum federated learning through blind quantum computing,
W. Li, S. Lu, and D. L. Deng, “Quantum federated learning through blind quantum computing,”Science China Physics, Mechanics & Astronomy 2021 64:10, vol. 64, pp. 100312–, 9 2021
work page 2021
-
[38]
Quantum federated learning with quantum data,
M. Chehimi and W. Saad, “Quantum federated learning with quantum data,”Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, vol. 2022-May, pp. 8617–8621, 2022
work page 2022
-
[39]
Data preservation in high energy physics,
T. Basaglia, M. Bellis, J. Blomer, J. Boyd, C. Bozzi, D. Britzger, S. Campana, C. Cartaro, G. Chen, B. Couturier,et al., “Data preservation in high energy physics,”The European Physical Journal C, vol. 83, no. 9, p. 795, 2023
work page 2023
-
[40]
Heterogeneous Federated Learning: State-of-the-art and Research Challenges,
M. Ye, X. Fang, B. Du, P. C. Yuen, and D. Tao, “Heterogeneous Federated Learning: State-of-the-art and Research Challenges,”ACM Computing Surveys, vol. 56, 3 2024
work page 2024
-
[41]
SUSY , the Third Generation and the LHC,
C. Brust, A. Katz, S. Lawrence, and R. Sundrum, “SUSY , the Third Generation and the LHC,”Journal of High Energy Physics 2012 2012:3, vol. 2012, pp. 103–, 3 2012
work page 2012
-
[42]
Whiteson, “SUSY.” UCI Machine Learning Repository, 2014
D. Whiteson, “SUSY.” UCI Machine Learning Repository, 2014. DOI: https://doi.org/10.24432/C54606
-
[43]
Simulating quantum computa- tions on classical machines: A survey,
K. Young, M. Scese, and A. Ebnenasir, “Simulating quantum computa- tions on classical machines: A survey,”arXiv preprint arXiv:2311.16505, 2023
-
[44]
Client Selection in Federated Learning: Principles, Challenges, and Opportunities,
L. Fu, H. Zhang, G. Gao, M. Zhang, and X. Liu, “Client Selection in Federated Learning: Principles, Challenges, and Opportunities,”IEEE Internet of Things Journal, vol. 10, pp. 21811–21819, 12 2023
work page 2023
-
[45]
Efficient client contribution evaluation for horizontal federated learning,
J. Zhao, X. Zhu, J. Wang, and J. Xiao, “Efficient client contribution evaluation for horizontal federated learning,”Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, vol. 2021-June, pp. 3060–3064, 2021
work page 2021
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.