pith. sign in

arxiv: 2602.09733 · v2 · submitted 2026-02-10 · 🪐 quant-ph

Error-mitigated quantum state tomography using neural networks

Pith reviewed 2026-05-16 05:18 UTC · model grok-4.3

classification 🪐 quant-ph
keywords quantum state tomographyneural networkserror mitigationsupervised learningquantum informationnoise reductionmultilayer perceptronstate reconstruction
0
0 comments X

The pith

Neural network tomography mitigates unknown noise in quantum state reconstruction without explicit models.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper proposes a quantum state tomography method that uses multilayer perceptron networks to reduce the impact of experimental noise on reconstructed states. Training occurs via supervised learning on simulated noisy data, so the approach requires no specific noise model or measurement assumptions. Simulations test the method on pure states through random mixed states and show higher accuracy than unmitigated tomography. The data-driven design makes it scalable and applicable to general quantum systems. This addresses a core barrier to reliable characterization in real devices where noise is pervasive but hard to model exactly.

Core claim

A scalable tomography method based on multilayer perceptron networks mitigates unknown noise through supervised learning. This approach is data-driven and thus does not rely on explicit assumptions about the noise model or measurement, making it readily extendable to general quantum systems. Numerical simulations, ranging from special pure states to random mixed states, demonstrate that the proposed method effectively mitigates noise across a broad range of scenarios, compared with the case without mitigation.

What carries the argument

Multilayer perceptron networks trained by supervised learning on simulated noisy measurement data to output denoised quantum state estimates.

If this is right

  • Accurate state estimation becomes feasible on noisy quantum hardware without needing to characterize the noise source first.
  • The method extends to larger systems because it avoids explicit noise modeling and scales with available simulation data.
  • Reconstructed states achieve higher fidelity in scenarios ranging from pure to mixed, improving downstream tasks that depend on state knowledge.
  • The approach works across diverse noise types since it learns from data rather than assuming a fixed model.
  • It provides a practical route to tomography on near-term devices where noise is dominant but unknown in detail.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The same supervised-learning structure could be adapted to quantum process tomography or other characterization tasks by changing the input-output pairs.
  • Hardware-specific training data generated from detailed device simulations might further boost performance on particular platforms.
  • Combining the network output with existing error-correction protocols could reduce the overhead needed for reliable quantum computation.
  • The method might serve as a post-processing layer for data collected from other tomography protocols, such as compressed sensing approaches.

Load-bearing premise

Training the network on simulated noisy data will allow it to generalize and mitigate noise in actual experimental measurements without any explicit noise model or measurement assumptions.

What would settle it

Apply the trained network to real experimental tomography data from a quantum device, then compare the fidelity of the reconstructed state against a high-precision reference measurement or known ground truth to check whether accuracy improves over standard tomography.

read the original abstract

The reliable characterization of quantum states is a fundamental task in quantum information science. For this purpose, quantum state tomography provides a standard framework for reconstructing quantum states from measurement data, yet it is often degraded by experimental noise. Mitigating such noise is therefore essential for the accurate estimation of the states in realistic settings. In this work, we propose a scalable tomography method based on multilayer perceptron networks that mitigate unknown noise through supervised learning. This approach is data-driven and thus does not rely on explicit assumptions about the noise model or measurement, making it readily extendable to general quantum systems. Numerical simulations, ranging from special pure states to random mixed states, demonstrate that the proposed method effectively mitigates noise across a broad range of scenarios, compared with the case without mitigation.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The paper proposes a data-driven quantum state tomography method that uses a multilayer perceptron trained via supervised learning on simulated noisy measurement data to mitigate unknown noise without requiring an explicit noise model or measurement assumptions. Numerical simulations spanning special pure states to random mixed states are presented to show improved state reconstruction fidelity relative to standard tomography without mitigation.

Significance. If the generalization from simulation to experiment holds, the approach could offer a scalable, model-free tool for accurate state estimation on noisy quantum hardware, reducing reliance on detailed noise characterization and extending readily to larger systems. The explicit use of supervised learning on simulated data is a clear methodological strength, though its practical impact hinges on transferability.

major comments (2)
  1. [Numerical simulations] Numerical simulations section: the central claim of effective noise mitigation across pure and mixed states rests on these runs, yet the manuscript provides insufficient detail on network architecture (layer count, neuron sizes), training procedure (loss function, optimizer, dataset generation), and quantitative metrics (fidelity or trace-distance improvements with error bars), preventing assessment of whether results are robust or tied to specific simulation conditions.
  2. [Method and results] Method and results sections: the load-bearing assumption that a network trained solely on simulated noisy tomography data will generalize to real experimental measurements (without explicit noise models) is unvalidated; no tests with mismatched noise (e.g., unmodeled crosstalk, calibration drift, or non-Markovian effects) or actual hardware data are reported, which directly limits the model-free applicability claim.
minor comments (2)
  1. The description of input/output dimensions for the perceptron relative to the number of measurement settings could be clarified with an explicit equation or diagram.
  2. [Introduction] A few references to prior neural-network tomography works appear missing or under-cited in the introduction.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the detailed and constructive report. We have revised the manuscript to address the major comments on numerical details and have added explicit discussion of the simulation-to-experiment gap. Below we respond point by point.

read point-by-point responses
  1. Referee: [Numerical simulations] Numerical simulations section: the central claim of effective noise mitigation across pure and mixed states rests on these runs, yet the manuscript provides insufficient detail on network architecture (layer count, neuron sizes), training procedure (loss function, optimizer, dataset generation), and quantitative metrics (fidelity or trace-distance improvements with error bars), preventing assessment of whether results are robust or tied to specific simulation conditions.

    Authors: We agree that the original manuscript lacked sufficient implementation details for reproducibility. In the revised version we have expanded the Numerical simulations section with: (i) the exact multilayer perceptron architecture (3 hidden layers with 256, 128 and 64 neurons, ReLU activations, linear output); (ii) training details (Adam optimizer with learning rate 10^{-3}, mean-squared-error loss on the vectorized density-matrix elements, 10^5 training samples generated from random pure and mixed states with simulated depolarizing and amplitude-damping noise); (iii) quantitative results reported as mean fidelity and trace distance over 100 independent runs, each with standard-error bars. These additions allow direct assessment of robustness. revision: yes

  2. Referee: [Method and results] Method and results sections: the load-bearing assumption that a network trained solely on simulated noisy tomography data will generalize to real experimental measurements (without explicit noise models) is unvalidated; no tests with mismatched noise (e.g., unmodeled crosstalk, calibration drift, or non-Markovian effects) or actual hardware data are reported, which directly limits the model-free applicability claim.

    Authors: We acknowledge that the present work contains only numerical simulations and does not demonstrate transfer to real hardware or to deliberately mismatched noise models. This is a genuine limitation of the current study, which is intended as a controlled proof-of-concept for the data-driven approach. In the revised manuscript we have added a dedicated paragraph in the Discussion section that explicitly states this limitation, quantifies the simulation noise models used, and outlines the experimental validation steps required for future work. The core methodological claim—that the network learns a mapping without an explicit noise model—remains supported by the simulation results. revision: partial

Circularity Check

0 steps flagged

No circularity: method is a standard supervised NN trained and evaluated on separate simulated datasets

full rationale

The paper describes a multilayer perceptron trained via supervised learning on simulated noisy tomography data to map to quantum states, with performance shown on separate numerical simulations from pure states to random mixed states. No derivation step reduces by construction to its inputs: the network parameters are fitted on training simulations and then applied to distinct test simulations, which is standard non-circular ML evaluation. No self-citations, uniqueness theorems, or ansatzes from prior author work are invoked as load-bearing; the approach is explicitly data-driven without explicit noise models. The central claim therefore rests on independent simulation benchmarks rather than any self-referential fit or renaming.

Axiom & Free-Parameter Ledger

1 free parameters · 1 axioms · 0 invented entities

The central claim rests on the domain assumption that supervised learning on simulated data can learn a general noise-mitigation mapping without explicit noise models. No free parameters are named because architecture and training details are absent from the abstract. No new physical entities are introduced.

free parameters (1)
  • Neural network weights and biases
    Learned during supervised training on simulated noisy versus ideal data; exact count and initialization not specified.
axioms (1)
  • domain assumption Supervised learning on simulated data suffices to mitigate unknown experimental noise without any noise model assumptions
    Invoked in the description of the data-driven method that 'does not rely on explicit assumptions about the noise model or measurement'.

pith-pipeline@v0.9.0 · 5421 in / 1288 out tokens · 40673 ms · 2026-05-16T05:18:52.028087+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

47 extracted references · 47 canonical work pages · 2 internal anchors

  1. [1]

    H. M. Wiseman and G. J. Milburn,Quantum Measurement and Control(Cambridge University Press, Cambridge, 2009). 7

  2. [2]

    Ježek, J

    M. Ježek, J. Fiurášek, and Z. Hradil, Quantum inference of states and processes, Phys. Rev. A68,012305(2003)

  3. [3]

    Morimae, Y

    T. Morimae, Y. Takeuchi, and M. Hayashi, Verification of hypergraph states, Phys. Rev. A96,062321(2017)

  4. [4]

    Pallister, N

    S. Pallister, N. Linden, and A. Montanaro, Optimal ver- ification of entangled states with local measurements, Phys. Rev. Lett.120,170502(2018)

  5. [5]

    Takeuchi and T

    Y. Takeuchi and T. Morimae, Verification of many-qubit states, Phys. Rev. X8,021060(2018)

  6. [6]

    X.-D. Yu, J. Shang, and O. Gühne, Optimal verification of general bipartite pure states, npj Quantum Inf.5,112 (2019)

  7. [7]

    S. T. Flammia and Y.-K. Liu, Direct fidelity estimation from few Pauli measurements, Phys. Rev. Lett.106, 230501(2011)

  8. [8]

    M. A. Nielsen and I. L. Chuang,Quantum Computation and Quantum Information,2nd ed. (Cambridge University Press, Cambridge, UK,2010)

  9. [9]

    Gisin and R

    N. Gisin and R. Thew, Quantum communication, Nat. Photon.1,165(2007)

  10. [10]

    Paris and J

    M. Paris and J. ˇRehᡠcek, eds.,Quantum State Estimation, Lecture Notes in Physics, Vol.649(Springer-Verlag Berlin Heidelberg,2004)

  11. [11]

    Opatrný, D.-G

    T. Opatrný, D.-G. Welsch, and W. Vogel, Least-squares inversion for density-matrix reconstruction, Phys. Rev. A 56,1788(1997)

  12. [12]

    Huszár and N

    F. Huszár and N. M. T. Houlsby, Adaptive Bayesian quan- tum tomography, Phys. Rev. A85,052120(2012)

  13. [13]

    Blume-Kohout, Optimal, reliable estimation of quan- tum states, New J

    R. Blume-Kohout, Optimal, reliable estimation of quan- tum states, New J. Phys.12,043034(2010)

  14. [14]

    ˇRehᡠcek, Z

    J. ˇRehᡠcek, Z. Hradil, and M. Ježek, Iterative algorithm for reconstruction of entangled states, Phys. Rev. A63, 040303(2001)

  15. [15]

    B. Qi, Z. Hou, L. Li, D. Dong, G.-Y. Xiang, and G.-C. Guo, Quantum state tomography via linear regression estimation, Sci. Rep.3,3496(2013)

  16. [16]

    B. Qi, Z. Hou, Y. Wang, D. Dong, H.-S. Zhong, L. Li, G.-Y. Xiang, H. M. Wiseman, C.-F. Li, and G.-C. Guo, Adaptive quantum state tomography via linear regression estima- tion: Theory and two-qubit experiment, npj Quantum Inf.3,19(2017)

  17. [17]

    Strandberg, Simple, reliable, and noise-resilient continuous-variable quantum state tomography with convex optimization, Phys

    I. Strandberg, Simple, reliable, and noise-resilient continuous-variable quantum state tomography with convex optimization, Phys. Rev. Appl.18,044041(2022)

  18. [18]

    Gross, Y.-K

    D. Gross, Y.-K. Liu, S. T. Flammia, S. Becker, and J. Eis- ert, Quantum state tomography via compressed sensing, Phys. Rev. Lett.105,150401(2010)

  19. [19]

    Shabani, R

    A. Shabani, R. L. Kosut, M. Mohseni, H. Rabitz, M. A. Broome, M. P . Almeida, A. Fedrizzi, and A. G. White, Ef- ficient measurement of quantum dynamics via compres- sive sensing, Phys. Rev. Lett.106,100401(2011)

  20. [20]

    K. M. Sherbert, N. Naimipour, H. Safavi, H. C. Shaw, and M. Soltanalian, Quantum compressive sensing: Mathe- matical machinery, quantum algorithms, and quantum circuitry, Appl. Sci.12,7525(2022)

  21. [21]

    D. T. Lennon, H. Moon, L. C. Camenzind, L. Yu, D. M. Zumbühl, G. A. D. Briggs, M. A. Osborne, E. A. Laird, and N. Ares, Efficiently measuring a quantum device us- ing machine learning, npj Quantum Inf.5,79(2019)

  22. [22]

    Fösel, P

    T. Fösel, P . Tighineanu, T. Weiss, and F. Marquardt, Re- inforcement learning with neural networks for quantum feedback, Phys. Rev. X8,031084(2018)

  23. [23]

    Huang, H

    C.-J. Huang, H. Ma, Q. Yin, J.-F. Tang, D. Dong, C. Chen, G.-Y. Xiang, C.-F. Li, and G.-C. Guo, Realization of a quantum autoencoder for lossless compression of quan- tum data, Phys. Rev. A102,032412(2020)

  24. [24]

    Ma, C.-J

    H. Ma, C.-J. Huang, C. Chen, D. Dong, Y. Wang, R.-B. Wu, and G.-Y. Xiang, On compression rate of quantum autoencoders: Control design, numerical and experimen- tal realization, Automatica147,110659(2023)

  25. [25]

    Rosenblatt,Principles of Neurodynamics: Perceptrons and the Theory of Brain Mechanisms(Spartan Books, Washing- ton, DC,1961)

    F. Rosenblatt,Principles of Neurodynamics: Perceptrons and the Theory of Brain Mechanisms(Spartan Books, Washing- ton, DC,1961)

  26. [26]

    P . J. Werbos, Applications of advances in nonlinear sen- sitivity analysis, inSystem Modeling and Optimization (Springer Berlin Heidelberg, Berlin, Heidelberg,1982) p. 762

  27. [27]

    Cybenko, Approximation by superpositions of a sig- moidal function, Math

    G. Cybenko, Approximation by superpositions of a sig- moidal function, Math. Control Signals Syst.2,303 (1989)

  28. [28]

    G. E. Hinton, S. Osindero, and Y.-W. Teh, A fast learning algorithm for deep belief nets, Neural Comput.18,1527 (2006)

  29. [29]

    LeCun, Y

    Y. LeCun, Y. Bengio, and G. Hinton, Deep learning, Na- ture521,436(2015)

  30. [30]

    Shelhamer, J

    E. Shelhamer, J. Long, and T. Darrell, Fully convolutional networks for semantic segmentation, IEEE TPAMI39,640 (2017)

  31. [31]

    J. Ho, A. Jain, and P . Abbeel, Denoising diffusion proba- bilistic models, inProceedings of the34th International Con- ference on Neural Information Processing Systems, NIPS ’20 (Curran Associates Inc., Red Hook, NY, USA,2020)

  32. [32]

    Attention Is All You Need

    A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez, L. Kaiser, and I. Polosukhin, Attention is all you need, arXiv:1706.03762

  33. [33]

    Torlai, G

    G. Torlai, G. Mazzola, J. Carrasquilla, M. Troyer, R. Melko, and G. Carleo, Neural-network quantum state tomography, Nat. Phys.14,447(2018)

  34. [34]

    Ahmed, C

    S. Ahmed, C. S. Muñoz, F. Nori, and A. F. Kockum, Quantum state tomography with conditional generative adversarial networks, Phys. Rev. Lett.127,140502(2021)

  35. [35]

    T. Xin, S. Lu, N. Cao, G. Anikeeva, D. Lu, J. Li, G. Long, and B. Zeng, Local-measurement-based quantum state tomography via neural networks, npj Quantum Inf.5, 109(2019)

  36. [36]

    Neural network state estimation for full quantum state tomography

    Q. Xu and S. Xu, Neural network state estimation for full quantum state tomography, arXiv:1811.06654

  37. [37]

    A. M. Palmieri, G. Müller-Rigat, A. K. Srivastava, M. Lewenstein, G. Rajchel-Mieldzio´ c, and M. Płodzie ´ n, Enhancing quantum state tomography via resource- efficient attention-based neural networks, Phys. Rev. Res. 6,033248(2024)

  38. [38]

    Gaikwad, O

    A. Gaikwad, O. Bihani, Arvind, and K. Dorai, Neural- network-assisted quantum state and process tomography using limited data sets, Phys. Rev. A109,012402(2024)

  39. [39]

    Y. Quek, S. Fort, and H. K. Ng, Adaptive quantum state tomography with neural networks, npj Quantum Inf.7, 105(2021)

  40. [40]

    Torlai and R

    G. Torlai and R. G. Melko, Machine-learning quantum states in the NISQ era, Annu. Rev. Condens. Matter Phys. 11,325(2020)

  41. [41]

    H. Ma, D. Dong, I. R. Petersen, C.-J. Huang, and G.-Y. Xiang, Neural networks for quantum state tomography with constrained measurements, Quantum Inf. Process. 23,317(2024). 8

  42. [42]

    N. J. Higham, inReliable Numerical Computation(Oxford University Press, Oxford,1990) p.161

  43. [43]

    S. L. Harris and D. M. Harris, inDigital Design and Com- puter Architecture(Morgan Kaufmann, Boston,2016) p. 108

  44. [44]

    Normalization techniques in training dnns: Methodology, analysis and application,

    L. Huang, J. Qin, Y. Zhou, F. Zhu, L. Liu, and L. Shao, Normalization techniques in training DNNs: Methodol- ogy, analysis and application, arXiv:2009.12836

  45. [45]

    Minsky and S

    M. Minsky and S. A. Papert,Perceptrons(The MIT Press, Cambridge,1969)

  46. [46]

    https://github.com/CyberPardofelis/Error- mitigated_QST_NN

  47. [47]

    M. D. de Burgh, N. K. Langford, A. C. Doherty, and A. Gilchrist, Choice of measurement sets in qubit tomog- raphy, Phys. Rev. A78,052122(2008)