Error-mitigated quantum state tomography using neural networks
Pith reviewed 2026-05-16 05:18 UTC · model grok-4.3
The pith
Neural network tomography mitigates unknown noise in quantum state reconstruction without explicit models.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
A scalable tomography method based on multilayer perceptron networks mitigates unknown noise through supervised learning. This approach is data-driven and thus does not rely on explicit assumptions about the noise model or measurement, making it readily extendable to general quantum systems. Numerical simulations, ranging from special pure states to random mixed states, demonstrate that the proposed method effectively mitigates noise across a broad range of scenarios, compared with the case without mitigation.
What carries the argument
Multilayer perceptron networks trained by supervised learning on simulated noisy measurement data to output denoised quantum state estimates.
If this is right
- Accurate state estimation becomes feasible on noisy quantum hardware without needing to characterize the noise source first.
- The method extends to larger systems because it avoids explicit noise modeling and scales with available simulation data.
- Reconstructed states achieve higher fidelity in scenarios ranging from pure to mixed, improving downstream tasks that depend on state knowledge.
- The approach works across diverse noise types since it learns from data rather than assuming a fixed model.
- It provides a practical route to tomography on near-term devices where noise is dominant but unknown in detail.
Where Pith is reading between the lines
- The same supervised-learning structure could be adapted to quantum process tomography or other characterization tasks by changing the input-output pairs.
- Hardware-specific training data generated from detailed device simulations might further boost performance on particular platforms.
- Combining the network output with existing error-correction protocols could reduce the overhead needed for reliable quantum computation.
- The method might serve as a post-processing layer for data collected from other tomography protocols, such as compressed sensing approaches.
Load-bearing premise
Training the network on simulated noisy data will allow it to generalize and mitigate noise in actual experimental measurements without any explicit noise model or measurement assumptions.
What would settle it
Apply the trained network to real experimental tomography data from a quantum device, then compare the fidelity of the reconstructed state against a high-precision reference measurement or known ground truth to check whether accuracy improves over standard tomography.
read the original abstract
The reliable characterization of quantum states is a fundamental task in quantum information science. For this purpose, quantum state tomography provides a standard framework for reconstructing quantum states from measurement data, yet it is often degraded by experimental noise. Mitigating such noise is therefore essential for the accurate estimation of the states in realistic settings. In this work, we propose a scalable tomography method based on multilayer perceptron networks that mitigate unknown noise through supervised learning. This approach is data-driven and thus does not rely on explicit assumptions about the noise model or measurement, making it readily extendable to general quantum systems. Numerical simulations, ranging from special pure states to random mixed states, demonstrate that the proposed method effectively mitigates noise across a broad range of scenarios, compared with the case without mitigation.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper proposes a data-driven quantum state tomography method that uses a multilayer perceptron trained via supervised learning on simulated noisy measurement data to mitigate unknown noise without requiring an explicit noise model or measurement assumptions. Numerical simulations spanning special pure states to random mixed states are presented to show improved state reconstruction fidelity relative to standard tomography without mitigation.
Significance. If the generalization from simulation to experiment holds, the approach could offer a scalable, model-free tool for accurate state estimation on noisy quantum hardware, reducing reliance on detailed noise characterization and extending readily to larger systems. The explicit use of supervised learning on simulated data is a clear methodological strength, though its practical impact hinges on transferability.
major comments (2)
- [Numerical simulations] Numerical simulations section: the central claim of effective noise mitigation across pure and mixed states rests on these runs, yet the manuscript provides insufficient detail on network architecture (layer count, neuron sizes), training procedure (loss function, optimizer, dataset generation), and quantitative metrics (fidelity or trace-distance improvements with error bars), preventing assessment of whether results are robust or tied to specific simulation conditions.
- [Method and results] Method and results sections: the load-bearing assumption that a network trained solely on simulated noisy tomography data will generalize to real experimental measurements (without explicit noise models) is unvalidated; no tests with mismatched noise (e.g., unmodeled crosstalk, calibration drift, or non-Markovian effects) or actual hardware data are reported, which directly limits the model-free applicability claim.
minor comments (2)
- The description of input/output dimensions for the perceptron relative to the number of measurement settings could be clarified with an explicit equation or diagram.
- [Introduction] A few references to prior neural-network tomography works appear missing or under-cited in the introduction.
Simulated Author's Rebuttal
We thank the referee for the detailed and constructive report. We have revised the manuscript to address the major comments on numerical details and have added explicit discussion of the simulation-to-experiment gap. Below we respond point by point.
read point-by-point responses
-
Referee: [Numerical simulations] Numerical simulations section: the central claim of effective noise mitigation across pure and mixed states rests on these runs, yet the manuscript provides insufficient detail on network architecture (layer count, neuron sizes), training procedure (loss function, optimizer, dataset generation), and quantitative metrics (fidelity or trace-distance improvements with error bars), preventing assessment of whether results are robust or tied to specific simulation conditions.
Authors: We agree that the original manuscript lacked sufficient implementation details for reproducibility. In the revised version we have expanded the Numerical simulations section with: (i) the exact multilayer perceptron architecture (3 hidden layers with 256, 128 and 64 neurons, ReLU activations, linear output); (ii) training details (Adam optimizer with learning rate 10^{-3}, mean-squared-error loss on the vectorized density-matrix elements, 10^5 training samples generated from random pure and mixed states with simulated depolarizing and amplitude-damping noise); (iii) quantitative results reported as mean fidelity and trace distance over 100 independent runs, each with standard-error bars. These additions allow direct assessment of robustness. revision: yes
-
Referee: [Method and results] Method and results sections: the load-bearing assumption that a network trained solely on simulated noisy tomography data will generalize to real experimental measurements (without explicit noise models) is unvalidated; no tests with mismatched noise (e.g., unmodeled crosstalk, calibration drift, or non-Markovian effects) or actual hardware data are reported, which directly limits the model-free applicability claim.
Authors: We acknowledge that the present work contains only numerical simulations and does not demonstrate transfer to real hardware or to deliberately mismatched noise models. This is a genuine limitation of the current study, which is intended as a controlled proof-of-concept for the data-driven approach. In the revised manuscript we have added a dedicated paragraph in the Discussion section that explicitly states this limitation, quantifies the simulation noise models used, and outlines the experimental validation steps required for future work. The core methodological claim—that the network learns a mapping without an explicit noise model—remains supported by the simulation results. revision: partial
Circularity Check
No circularity: method is a standard supervised NN trained and evaluated on separate simulated datasets
full rationale
The paper describes a multilayer perceptron trained via supervised learning on simulated noisy tomography data to map to quantum states, with performance shown on separate numerical simulations from pure states to random mixed states. No derivation step reduces by construction to its inputs: the network parameters are fitted on training simulations and then applied to distinct test simulations, which is standard non-circular ML evaluation. No self-citations, uniqueness theorems, or ansatzes from prior author work are invoked as load-bearing; the approach is explicitly data-driven without explicit noise models. The central claim therefore rests on independent simulation benchmarks rather than any self-referential fit or renaming.
Axiom & Free-Parameter Ledger
free parameters (1)
- Neural network weights and biases
axioms (1)
- domain assumption Supervised learning on simulated data suffices to mitigate unknown experimental noise without any noise model assumptions
Reference graph
Works this paper leans on
-
[1]
H. M. Wiseman and G. J. Milburn,Quantum Measurement and Control(Cambridge University Press, Cambridge, 2009). 7
work page 2009
- [2]
-
[3]
T. Morimae, Y. Takeuchi, and M. Hayashi, Verification of hypergraph states, Phys. Rev. A96,062321(2017)
work page 2017
-
[4]
S. Pallister, N. Linden, and A. Montanaro, Optimal ver- ification of entangled states with local measurements, Phys. Rev. Lett.120,170502(2018)
work page 2018
-
[5]
Y. Takeuchi and T. Morimae, Verification of many-qubit states, Phys. Rev. X8,021060(2018)
work page 2018
-
[6]
X.-D. Yu, J. Shang, and O. Gühne, Optimal verification of general bipartite pure states, npj Quantum Inf.5,112 (2019)
work page 2019
-
[7]
S. T. Flammia and Y.-K. Liu, Direct fidelity estimation from few Pauli measurements, Phys. Rev. Lett.106, 230501(2011)
work page 2011
-
[8]
M. A. Nielsen and I. L. Chuang,Quantum Computation and Quantum Information,2nd ed. (Cambridge University Press, Cambridge, UK,2010)
work page 2010
- [9]
-
[10]
M. Paris and J. ˇRehᡠcek, eds.,Quantum State Estimation, Lecture Notes in Physics, Vol.649(Springer-Verlag Berlin Heidelberg,2004)
work page 2004
-
[11]
T. Opatrný, D.-G. Welsch, and W. Vogel, Least-squares inversion for density-matrix reconstruction, Phys. Rev. A 56,1788(1997)
work page 1997
-
[12]
F. Huszár and N. M. T. Houlsby, Adaptive Bayesian quan- tum tomography, Phys. Rev. A85,052120(2012)
work page 2012
-
[13]
Blume-Kohout, Optimal, reliable estimation of quan- tum states, New J
R. Blume-Kohout, Optimal, reliable estimation of quan- tum states, New J. Phys.12,043034(2010)
work page 2010
-
[14]
J. ˇRehᡠcek, Z. Hradil, and M. Ježek, Iterative algorithm for reconstruction of entangled states, Phys. Rev. A63, 040303(2001)
work page 2001
-
[15]
B. Qi, Z. Hou, L. Li, D. Dong, G.-Y. Xiang, and G.-C. Guo, Quantum state tomography via linear regression estimation, Sci. Rep.3,3496(2013)
work page 2013
-
[16]
B. Qi, Z. Hou, Y. Wang, D. Dong, H.-S. Zhong, L. Li, G.-Y. Xiang, H. M. Wiseman, C.-F. Li, and G.-C. Guo, Adaptive quantum state tomography via linear regression estima- tion: Theory and two-qubit experiment, npj Quantum Inf.3,19(2017)
work page 2017
-
[17]
I. Strandberg, Simple, reliable, and noise-resilient continuous-variable quantum state tomography with convex optimization, Phys. Rev. Appl.18,044041(2022)
work page 2022
-
[18]
D. Gross, Y.-K. Liu, S. T. Flammia, S. Becker, and J. Eis- ert, Quantum state tomography via compressed sensing, Phys. Rev. Lett.105,150401(2010)
work page 2010
-
[19]
A. Shabani, R. L. Kosut, M. Mohseni, H. Rabitz, M. A. Broome, M. P . Almeida, A. Fedrizzi, and A. G. White, Ef- ficient measurement of quantum dynamics via compres- sive sensing, Phys. Rev. Lett.106,100401(2011)
work page 2011
-
[20]
K. M. Sherbert, N. Naimipour, H. Safavi, H. C. Shaw, and M. Soltanalian, Quantum compressive sensing: Mathe- matical machinery, quantum algorithms, and quantum circuitry, Appl. Sci.12,7525(2022)
work page 2022
-
[21]
D. T. Lennon, H. Moon, L. C. Camenzind, L. Yu, D. M. Zumbühl, G. A. D. Briggs, M. A. Osborne, E. A. Laird, and N. Ares, Efficiently measuring a quantum device us- ing machine learning, npj Quantum Inf.5,79(2019)
work page 2019
- [22]
- [23]
- [24]
-
[25]
F. Rosenblatt,Principles of Neurodynamics: Perceptrons and the Theory of Brain Mechanisms(Spartan Books, Washing- ton, DC,1961)
work page 1961
-
[26]
P . J. Werbos, Applications of advances in nonlinear sen- sitivity analysis, inSystem Modeling and Optimization (Springer Berlin Heidelberg, Berlin, Heidelberg,1982) p. 762
work page 1982
-
[27]
Cybenko, Approximation by superpositions of a sig- moidal function, Math
G. Cybenko, Approximation by superpositions of a sig- moidal function, Math. Control Signals Syst.2,303 (1989)
work page 1989
-
[28]
G. E. Hinton, S. Osindero, and Y.-W. Teh, A fast learning algorithm for deep belief nets, Neural Comput.18,1527 (2006)
work page 2006
- [29]
-
[30]
E. Shelhamer, J. Long, and T. Darrell, Fully convolutional networks for semantic segmentation, IEEE TPAMI39,640 (2017)
work page 2017
-
[31]
J. Ho, A. Jain, and P . Abbeel, Denoising diffusion proba- bilistic models, inProceedings of the34th International Con- ference on Neural Information Processing Systems, NIPS ’20 (Curran Associates Inc., Red Hook, NY, USA,2020)
work page 2020
-
[32]
A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez, L. Kaiser, and I. Polosukhin, Attention is all you need, arXiv:1706.03762
work page internal anchor Pith review Pith/arXiv arXiv
- [33]
- [34]
-
[35]
T. Xin, S. Lu, N. Cao, G. Anikeeva, D. Lu, J. Li, G. Long, and B. Zeng, Local-measurement-based quantum state tomography via neural networks, npj Quantum Inf.5, 109(2019)
work page 2019
-
[36]
Neural network state estimation for full quantum state tomography
Q. Xu and S. Xu, Neural network state estimation for full quantum state tomography, arXiv:1811.06654
work page internal anchor Pith review Pith/arXiv arXiv
-
[37]
A. M. Palmieri, G. Müller-Rigat, A. K. Srivastava, M. Lewenstein, G. Rajchel-Mieldzio´ c, and M. Płodzie ´ n, Enhancing quantum state tomography via resource- efficient attention-based neural networks, Phys. Rev. Res. 6,033248(2024)
work page 2024
-
[38]
A. Gaikwad, O. Bihani, Arvind, and K. Dorai, Neural- network-assisted quantum state and process tomography using limited data sets, Phys. Rev. A109,012402(2024)
work page 2024
-
[39]
Y. Quek, S. Fort, and H. K. Ng, Adaptive quantum state tomography with neural networks, npj Quantum Inf.7, 105(2021)
work page 2021
-
[40]
G. Torlai and R. G. Melko, Machine-learning quantum states in the NISQ era, Annu. Rev. Condens. Matter Phys. 11,325(2020)
work page 2020
-
[41]
H. Ma, D. Dong, I. R. Petersen, C.-J. Huang, and G.-Y. Xiang, Neural networks for quantum state tomography with constrained measurements, Quantum Inf. Process. 23,317(2024). 8
work page 2024
-
[42]
N. J. Higham, inReliable Numerical Computation(Oxford University Press, Oxford,1990) p.161
work page 1990
-
[43]
S. L. Harris and D. M. Harris, inDigital Design and Com- puter Architecture(Morgan Kaufmann, Boston,2016) p. 108
work page 2016
-
[44]
Normalization techniques in training dnns: Methodology, analysis and application,
L. Huang, J. Qin, Y. Zhou, F. Zhu, L. Liu, and L. Shao, Normalization techniques in training DNNs: Methodol- ogy, analysis and application, arXiv:2009.12836
- [45]
-
[46]
https://github.com/CyberPardofelis/Error- mitigated_QST_NN
-
[47]
M. D. de Burgh, N. K. Langford, A. C. Doherty, and A. Gilchrist, Choice of measurement sets in qubit tomog- raphy, Phys. Rev. A78,052122(2008)
work page 2008
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.