Error-mitigated quantum state tomography using neural networks

Jiangwei Shang; Mengru Ma; Yixuan Hu

arxiv: 2602.09733 · v2 · submitted 2026-02-10 · 🪐 quant-ph

Error-mitigated quantum state tomography using neural networks

Yixuan Hu , Mengru Ma , Jiangwei Shang This is my paper

Pith reviewed 2026-05-16 05:18 UTC · model grok-4.3

classification 🪐 quant-ph

keywords quantum state tomographyneural networkserror mitigationsupervised learningquantum informationnoise reductionmultilayer perceptronstate reconstruction

0 comments

The pith

Neural network tomography mitigates unknown noise in quantum state reconstruction without explicit models.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper proposes a quantum state tomography method that uses multilayer perceptron networks to reduce the impact of experimental noise on reconstructed states. Training occurs via supervised learning on simulated noisy data, so the approach requires no specific noise model or measurement assumptions. Simulations test the method on pure states through random mixed states and show higher accuracy than unmitigated tomography. The data-driven design makes it scalable and applicable to general quantum systems. This addresses a core barrier to reliable characterization in real devices where noise is pervasive but hard to model exactly.

Core claim

A scalable tomography method based on multilayer perceptron networks mitigates unknown noise through supervised learning. This approach is data-driven and thus does not rely on explicit assumptions about the noise model or measurement, making it readily extendable to general quantum systems. Numerical simulations, ranging from special pure states to random mixed states, demonstrate that the proposed method effectively mitigates noise across a broad range of scenarios, compared with the case without mitigation.

What carries the argument

Multilayer perceptron networks trained by supervised learning on simulated noisy measurement data to output denoised quantum state estimates.

If this is right

Accurate state estimation becomes feasible on noisy quantum hardware without needing to characterize the noise source first.
The method extends to larger systems because it avoids explicit noise modeling and scales with available simulation data.
Reconstructed states achieve higher fidelity in scenarios ranging from pure to mixed, improving downstream tasks that depend on state knowledge.
The approach works across diverse noise types since it learns from data rather than assuming a fixed model.
It provides a practical route to tomography on near-term devices where noise is dominant but unknown in detail.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same supervised-learning structure could be adapted to quantum process tomography or other characterization tasks by changing the input-output pairs.
Hardware-specific training data generated from detailed device simulations might further boost performance on particular platforms.
Combining the network output with existing error-correction protocols could reduce the overhead needed for reliable quantum computation.
The method might serve as a post-processing layer for data collected from other tomography protocols, such as compressed sensing approaches.

Load-bearing premise

Training the network on simulated noisy data will allow it to generalize and mitigate noise in actual experimental measurements without any explicit noise model or measurement assumptions.

What would settle it

Apply the trained network to real experimental tomography data from a quantum device, then compare the fidelity of the reconstructed state against a high-precision reference measurement or known ground truth to check whether accuracy improves over standard tomography.

read the original abstract

The reliable characterization of quantum states is a fundamental task in quantum information science. For this purpose, quantum state tomography provides a standard framework for reconstructing quantum states from measurement data, yet it is often degraded by experimental noise. Mitigating such noise is therefore essential for the accurate estimation of the states in realistic settings. In this work, we propose a scalable tomography method based on multilayer perceptron networks that mitigate unknown noise through supervised learning. This approach is data-driven and thus does not rely on explicit assumptions about the noise model or measurement, making it readily extendable to general quantum systems. Numerical simulations, ranging from special pure states to random mixed states, demonstrate that the proposed method effectively mitigates noise across a broad range of scenarios, compared with the case without mitigation.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The neural net improves simulated tomography results but offers no evidence it transfers to real experiments.

read the letter

The paper trains a multilayer perceptron on simulated noisy tomography data to output cleaner state estimates for general quantum states. This is a direct supervised-learning extension of earlier neural-net tomography work, applied here without an explicit noise model. The simulations cover pure states through random mixed states and show lower reconstruction error than the unmitigated baseline, which is a clean within-distribution result. The data-driven framing is practical because it avoids committing to a particular noise channel. That is the main positive. The central weakness is that every reported number comes from the same simulation distribution used for training. No hardware data, no out-of-distribution noise tests, and no checks against effects like crosstalk or drift appear in the work. Because the method is explicitly model-free, performance on real devices hinges on whether the training distribution matches experiment; the paper supplies no evidence on that point. Network size, training schedule, and exact error metrics are also not detailed enough to judge robustness. The citation list is standard and appropriate. This is useful reading for groups already running tomography on noisy hardware who want to try a learned correction, but it functions as a simulation study rather than a validated tool. A serious editor should send it to review so referees can ask for the missing architecture details and at least one real-device test case.

Referee Report

2 major / 2 minor

Summary. The paper proposes a data-driven quantum state tomography method that uses a multilayer perceptron trained via supervised learning on simulated noisy measurement data to mitigate unknown noise without requiring an explicit noise model or measurement assumptions. Numerical simulations spanning special pure states to random mixed states are presented to show improved state reconstruction fidelity relative to standard tomography without mitigation.

Significance. If the generalization from simulation to experiment holds, the approach could offer a scalable, model-free tool for accurate state estimation on noisy quantum hardware, reducing reliance on detailed noise characterization and extending readily to larger systems. The explicit use of supervised learning on simulated data is a clear methodological strength, though its practical impact hinges on transferability.

major comments (2)

[Numerical simulations] Numerical simulations section: the central claim of effective noise mitigation across pure and mixed states rests on these runs, yet the manuscript provides insufficient detail on network architecture (layer count, neuron sizes), training procedure (loss function, optimizer, dataset generation), and quantitative metrics (fidelity or trace-distance improvements with error bars), preventing assessment of whether results are robust or tied to specific simulation conditions.
[Method and results] Method and results sections: the load-bearing assumption that a network trained solely on simulated noisy tomography data will generalize to real experimental measurements (without explicit noise models) is unvalidated; no tests with mismatched noise (e.g., unmodeled crosstalk, calibration drift, or non-Markovian effects) or actual hardware data are reported, which directly limits the model-free applicability claim.

minor comments (2)

The description of input/output dimensions for the perceptron relative to the number of measurement settings could be clarified with an explicit equation or diagram.
[Introduction] A few references to prior neural-network tomography works appear missing or under-cited in the introduction.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the detailed and constructive report. We have revised the manuscript to address the major comments on numerical details and have added explicit discussion of the simulation-to-experiment gap. Below we respond point by point.

read point-by-point responses

Referee: [Numerical simulations] Numerical simulations section: the central claim of effective noise mitigation across pure and mixed states rests on these runs, yet the manuscript provides insufficient detail on network architecture (layer count, neuron sizes), training procedure (loss function, optimizer, dataset generation), and quantitative metrics (fidelity or trace-distance improvements with error bars), preventing assessment of whether results are robust or tied to specific simulation conditions.

Authors: We agree that the original manuscript lacked sufficient implementation details for reproducibility. In the revised version we have expanded the Numerical simulations section with: (i) the exact multilayer perceptron architecture (3 hidden layers with 256, 128 and 64 neurons, ReLU activations, linear output); (ii) training details (Adam optimizer with learning rate 10^{-3}, mean-squared-error loss on the vectorized density-matrix elements, 10^5 training samples generated from random pure and mixed states with simulated depolarizing and amplitude-damping noise); (iii) quantitative results reported as mean fidelity and trace distance over 100 independent runs, each with standard-error bars. These additions allow direct assessment of robustness. revision: yes
Referee: [Method and results] Method and results sections: the load-bearing assumption that a network trained solely on simulated noisy tomography data will generalize to real experimental measurements (without explicit noise models) is unvalidated; no tests with mismatched noise (e.g., unmodeled crosstalk, calibration drift, or non-Markovian effects) or actual hardware data are reported, which directly limits the model-free applicability claim.

Authors: We acknowledge that the present work contains only numerical simulations and does not demonstrate transfer to real hardware or to deliberately mismatched noise models. This is a genuine limitation of the current study, which is intended as a controlled proof-of-concept for the data-driven approach. In the revised manuscript we have added a dedicated paragraph in the Discussion section that explicitly states this limitation, quantifies the simulation noise models used, and outlines the experimental validation steps required for future work. The core methodological claim—that the network learns a mapping without an explicit noise model—remains supported by the simulation results. revision: partial

Circularity Check

0 steps flagged

No circularity: method is a standard supervised NN trained and evaluated on separate simulated datasets

full rationale

The paper describes a multilayer perceptron trained via supervised learning on simulated noisy tomography data to map to quantum states, with performance shown on separate numerical simulations from pure states to random mixed states. No derivation step reduces by construction to its inputs: the network parameters are fitted on training simulations and then applied to distinct test simulations, which is standard non-circular ML evaluation. No self-citations, uniqueness theorems, or ansatzes from prior author work are invoked as load-bearing; the approach is explicitly data-driven without explicit noise models. The central claim therefore rests on independent simulation benchmarks rather than any self-referential fit or renaming.

Axiom & Free-Parameter Ledger

1 free parameters · 1 axioms · 0 invented entities

The central claim rests on the domain assumption that supervised learning on simulated data can learn a general noise-mitigation mapping without explicit noise models. No free parameters are named because architecture and training details are absent from the abstract. No new physical entities are introduced.

free parameters (1)

Neural network weights and biases
Learned during supervised training on simulated noisy versus ideal data; exact count and initialization not specified.

axioms (1)

domain assumption Supervised learning on simulated data suffices to mitigate unknown experimental noise without any noise model assumptions
Invoked in the description of the data-driven method that 'does not rely on explicit assumptions about the noise model or measurement'.

pith-pipeline@v0.9.0 · 5421 in / 1288 out tokens · 40673 ms · 2026-05-16T05:18:52.028087+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

47 extracted references · 47 canonical work pages · 2 internal anchors

[1]

H. M. Wiseman and G. J. Milburn,Quantum Measurement and Control(Cambridge University Press, Cambridge, 2009). 7

work page 2009
[2]

Ježek, J

M. Ježek, J. Fiurášek, and Z. Hradil, Quantum inference of states and processes, Phys. Rev. A68,012305(2003)

work page 2003
[3]

Morimae, Y

T. Morimae, Y. Takeuchi, and M. Hayashi, Verification of hypergraph states, Phys. Rev. A96,062321(2017)

work page 2017
[4]

Pallister, N

S. Pallister, N. Linden, and A. Montanaro, Optimal ver- ification of entangled states with local measurements, Phys. Rev. Lett.120,170502(2018)

work page 2018
[5]

Takeuchi and T

Y. Takeuchi and T. Morimae, Verification of many-qubit states, Phys. Rev. X8,021060(2018)

work page 2018
[6]

X.-D. Yu, J. Shang, and O. Gühne, Optimal verification of general bipartite pure states, npj Quantum Inf.5,112 (2019)

work page 2019
[7]

S. T. Flammia and Y.-K. Liu, Direct fidelity estimation from few Pauli measurements, Phys. Rev. Lett.106, 230501(2011)

work page 2011
[8]

M. A. Nielsen and I. L. Chuang,Quantum Computation and Quantum Information,2nd ed. (Cambridge University Press, Cambridge, UK,2010)

work page 2010
[9]

Gisin and R

N. Gisin and R. Thew, Quantum communication, Nat. Photon.1,165(2007)

work page 2007
[10]

Paris and J

M. Paris and J. ˇReháˇ cek, eds.,Quantum State Estimation, Lecture Notes in Physics, Vol.649(Springer-Verlag Berlin Heidelberg,2004)

work page 2004
[11]

Opatrný, D.-G

T. Opatrný, D.-G. Welsch, and W. Vogel, Least-squares inversion for density-matrix reconstruction, Phys. Rev. A 56,1788(1997)

work page 1997
[12]

Huszár and N

F. Huszár and N. M. T. Houlsby, Adaptive Bayesian quan- tum tomography, Phys. Rev. A85,052120(2012)

work page 2012
[13]

Blume-Kohout, Optimal, reliable estimation of quan- tum states, New J

R. Blume-Kohout, Optimal, reliable estimation of quan- tum states, New J. Phys.12,043034(2010)

work page 2010
[14]

ˇReháˇ cek, Z

J. ˇReháˇ cek, Z. Hradil, and M. Ježek, Iterative algorithm for reconstruction of entangled states, Phys. Rev. A63, 040303(2001)

work page 2001
[15]

B. Qi, Z. Hou, L. Li, D. Dong, G.-Y. Xiang, and G.-C. Guo, Quantum state tomography via linear regression estimation, Sci. Rep.3,3496(2013)

work page 2013
[16]

B. Qi, Z. Hou, Y. Wang, D. Dong, H.-S. Zhong, L. Li, G.-Y. Xiang, H. M. Wiseman, C.-F. Li, and G.-C. Guo, Adaptive quantum state tomography via linear regression estima- tion: Theory and two-qubit experiment, npj Quantum Inf.3,19(2017)

work page 2017
[17]

Strandberg, Simple, reliable, and noise-resilient continuous-variable quantum state tomography with convex optimization, Phys

I. Strandberg, Simple, reliable, and noise-resilient continuous-variable quantum state tomography with convex optimization, Phys. Rev. Appl.18,044041(2022)

work page 2022
[18]

Gross, Y.-K

D. Gross, Y.-K. Liu, S. T. Flammia, S. Becker, and J. Eis- ert, Quantum state tomography via compressed sensing, Phys. Rev. Lett.105,150401(2010)

work page 2010
[19]

Shabani, R

A. Shabani, R. L. Kosut, M. Mohseni, H. Rabitz, M. A. Broome, M. P . Almeida, A. Fedrizzi, and A. G. White, Ef- ficient measurement of quantum dynamics via compres- sive sensing, Phys. Rev. Lett.106,100401(2011)

work page 2011
[20]

K. M. Sherbert, N. Naimipour, H. Safavi, H. C. Shaw, and M. Soltanalian, Quantum compressive sensing: Mathe- matical machinery, quantum algorithms, and quantum circuitry, Appl. Sci.12,7525(2022)

work page 2022
[21]

D. T. Lennon, H. Moon, L. C. Camenzind, L. Yu, D. M. Zumbühl, G. A. D. Briggs, M. A. Osborne, E. A. Laird, and N. Ares, Efficiently measuring a quantum device us- ing machine learning, npj Quantum Inf.5,79(2019)

work page 2019
[22]

Fösel, P

T. Fösel, P . Tighineanu, T. Weiss, and F. Marquardt, Re- inforcement learning with neural networks for quantum feedback, Phys. Rev. X8,031084(2018)

work page 2018
[23]

Huang, H

C.-J. Huang, H. Ma, Q. Yin, J.-F. Tang, D. Dong, C. Chen, G.-Y. Xiang, C.-F. Li, and G.-C. Guo, Realization of a quantum autoencoder for lossless compression of quan- tum data, Phys. Rev. A102,032412(2020)

work page 2020
[24]

Ma, C.-J

H. Ma, C.-J. Huang, C. Chen, D. Dong, Y. Wang, R.-B. Wu, and G.-Y. Xiang, On compression rate of quantum autoencoders: Control design, numerical and experimen- tal realization, Automatica147,110659(2023)

work page 2023
[25]

Rosenblatt,Principles of Neurodynamics: Perceptrons and the Theory of Brain Mechanisms(Spartan Books, Washing- ton, DC,1961)

F. Rosenblatt,Principles of Neurodynamics: Perceptrons and the Theory of Brain Mechanisms(Spartan Books, Washing- ton, DC,1961)

work page 1961
[26]

P . J. Werbos, Applications of advances in nonlinear sen- sitivity analysis, inSystem Modeling and Optimization (Springer Berlin Heidelberg, Berlin, Heidelberg,1982) p. 762

work page 1982
[27]

Cybenko, Approximation by superpositions of a sig- moidal function, Math

G. Cybenko, Approximation by superpositions of a sig- moidal function, Math. Control Signals Syst.2,303 (1989)

work page 1989
[28]

G. E. Hinton, S. Osindero, and Y.-W. Teh, A fast learning algorithm for deep belief nets, Neural Comput.18,1527 (2006)

work page 2006
[29]

LeCun, Y

Y. LeCun, Y. Bengio, and G. Hinton, Deep learning, Na- ture521,436(2015)

work page 2015
[30]

Shelhamer, J

E. Shelhamer, J. Long, and T. Darrell, Fully convolutional networks for semantic segmentation, IEEE TPAMI39,640 (2017)

work page 2017
[31]

J. Ho, A. Jain, and P . Abbeel, Denoising diffusion proba- bilistic models, inProceedings of the34th International Con- ference on Neural Information Processing Systems, NIPS ’20 (Curran Associates Inc., Red Hook, NY, USA,2020)

work page 2020
[32]

Attention Is All You Need

A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez, L. Kaiser, and I. Polosukhin, Attention is all you need, arXiv:1706.03762

work page internal anchor Pith review Pith/arXiv arXiv
[33]

Torlai, G

G. Torlai, G. Mazzola, J. Carrasquilla, M. Troyer, R. Melko, and G. Carleo, Neural-network quantum state tomography, Nat. Phys.14,447(2018)

work page 2018
[34]

Ahmed, C

S. Ahmed, C. S. Muñoz, F. Nori, and A. F. Kockum, Quantum state tomography with conditional generative adversarial networks, Phys. Rev. Lett.127,140502(2021)

work page 2021
[35]

T. Xin, S. Lu, N. Cao, G. Anikeeva, D. Lu, J. Li, G. Long, and B. Zeng, Local-measurement-based quantum state tomography via neural networks, npj Quantum Inf.5, 109(2019)

work page 2019
[36]

Neural network state estimation for full quantum state tomography

Q. Xu and S. Xu, Neural network state estimation for full quantum state tomography, arXiv:1811.06654

work page internal anchor Pith review Pith/arXiv arXiv
[37]

A. M. Palmieri, G. Müller-Rigat, A. K. Srivastava, M. Lewenstein, G. Rajchel-Mieldzio´ c, and M. Płodzie ´ n, Enhancing quantum state tomography via resource- efficient attention-based neural networks, Phys. Rev. Res. 6,033248(2024)

work page 2024
[38]

Gaikwad, O

A. Gaikwad, O. Bihani, Arvind, and K. Dorai, Neural- network-assisted quantum state and process tomography using limited data sets, Phys. Rev. A109,012402(2024)

work page 2024
[39]

Y. Quek, S. Fort, and H. K. Ng, Adaptive quantum state tomography with neural networks, npj Quantum Inf.7, 105(2021)

work page 2021
[40]

Torlai and R

G. Torlai and R. G. Melko, Machine-learning quantum states in the NISQ era, Annu. Rev. Condens. Matter Phys. 11,325(2020)

work page 2020
[41]

H. Ma, D. Dong, I. R. Petersen, C.-J. Huang, and G.-Y. Xiang, Neural networks for quantum state tomography with constrained measurements, Quantum Inf. Process. 23,317(2024). 8

work page 2024
[42]

N. J. Higham, inReliable Numerical Computation(Oxford University Press, Oxford,1990) p.161

work page 1990
[43]

S. L. Harris and D. M. Harris, inDigital Design and Com- puter Architecture(Morgan Kaufmann, Boston,2016) p. 108

work page 2016
[44]

Normalization techniques in training dnns: Methodology, analysis and application,

L. Huang, J. Qin, Y. Zhou, F. Zhu, L. Liu, and L. Shao, Normalization techniques in training DNNs: Methodol- ogy, analysis and application, arXiv:2009.12836

work page arXiv 2009
[45]

Minsky and S

M. Minsky and S. A. Papert,Perceptrons(The MIT Press, Cambridge,1969)

work page 1969
[46]

https://github.com/CyberPardofelis/Error- mitigated_QST_NN

work page
[47]

M. D. de Burgh, N. K. Langford, A. C. Doherty, and A. Gilchrist, Choice of measurement sets in qubit tomog- raphy, Phys. Rev. A78,052122(2008)

work page 2008

[1] [1]

H. M. Wiseman and G. J. Milburn,Quantum Measurement and Control(Cambridge University Press, Cambridge, 2009). 7

work page 2009

[2] [2]

Ježek, J

M. Ježek, J. Fiurášek, and Z. Hradil, Quantum inference of states and processes, Phys. Rev. A68,012305(2003)

work page 2003

[3] [3]

Morimae, Y

T. Morimae, Y. Takeuchi, and M. Hayashi, Verification of hypergraph states, Phys. Rev. A96,062321(2017)

work page 2017

[4] [4]

Pallister, N

S. Pallister, N. Linden, and A. Montanaro, Optimal ver- ification of entangled states with local measurements, Phys. Rev. Lett.120,170502(2018)

work page 2018

[5] [5]

Takeuchi and T

Y. Takeuchi and T. Morimae, Verification of many-qubit states, Phys. Rev. X8,021060(2018)

work page 2018

[6] [6]

X.-D. Yu, J. Shang, and O. Gühne, Optimal verification of general bipartite pure states, npj Quantum Inf.5,112 (2019)

work page 2019

[7] [7]

S. T. Flammia and Y.-K. Liu, Direct fidelity estimation from few Pauli measurements, Phys. Rev. Lett.106, 230501(2011)

work page 2011

[8] [8]

M. A. Nielsen and I. L. Chuang,Quantum Computation and Quantum Information,2nd ed. (Cambridge University Press, Cambridge, UK,2010)

work page 2010

[9] [9]

Gisin and R

N. Gisin and R. Thew, Quantum communication, Nat. Photon.1,165(2007)

work page 2007

[10] [10]

Paris and J

M. Paris and J. ˇReháˇ cek, eds.,Quantum State Estimation, Lecture Notes in Physics, Vol.649(Springer-Verlag Berlin Heidelberg,2004)

work page 2004

[11] [11]

Opatrný, D.-G

T. Opatrný, D.-G. Welsch, and W. Vogel, Least-squares inversion for density-matrix reconstruction, Phys. Rev. A 56,1788(1997)

work page 1997

[12] [12]

Huszár and N

F. Huszár and N. M. T. Houlsby, Adaptive Bayesian quan- tum tomography, Phys. Rev. A85,052120(2012)

work page 2012

[13] [13]

Blume-Kohout, Optimal, reliable estimation of quan- tum states, New J

R. Blume-Kohout, Optimal, reliable estimation of quan- tum states, New J. Phys.12,043034(2010)

work page 2010

[14] [14]

ˇReháˇ cek, Z

J. ˇReháˇ cek, Z. Hradil, and M. Ježek, Iterative algorithm for reconstruction of entangled states, Phys. Rev. A63, 040303(2001)

work page 2001

[15] [15]

B. Qi, Z. Hou, L. Li, D. Dong, G.-Y. Xiang, and G.-C. Guo, Quantum state tomography via linear regression estimation, Sci. Rep.3,3496(2013)

work page 2013

[16] [16]

B. Qi, Z. Hou, Y. Wang, D. Dong, H.-S. Zhong, L. Li, G.-Y. Xiang, H. M. Wiseman, C.-F. Li, and G.-C. Guo, Adaptive quantum state tomography via linear regression estima- tion: Theory and two-qubit experiment, npj Quantum Inf.3,19(2017)

work page 2017

[17] [17]

Strandberg, Simple, reliable, and noise-resilient continuous-variable quantum state tomography with convex optimization, Phys

I. Strandberg, Simple, reliable, and noise-resilient continuous-variable quantum state tomography with convex optimization, Phys. Rev. Appl.18,044041(2022)

work page 2022

[18] [18]

Gross, Y.-K

D. Gross, Y.-K. Liu, S. T. Flammia, S. Becker, and J. Eis- ert, Quantum state tomography via compressed sensing, Phys. Rev. Lett.105,150401(2010)

work page 2010

[19] [19]

Shabani, R

A. Shabani, R. L. Kosut, M. Mohseni, H. Rabitz, M. A. Broome, M. P . Almeida, A. Fedrizzi, and A. G. White, Ef- ficient measurement of quantum dynamics via compres- sive sensing, Phys. Rev. Lett.106,100401(2011)

work page 2011

[20] [20]

K. M. Sherbert, N. Naimipour, H. Safavi, H. C. Shaw, and M. Soltanalian, Quantum compressive sensing: Mathe- matical machinery, quantum algorithms, and quantum circuitry, Appl. Sci.12,7525(2022)

work page 2022

[21] [21]

D. T. Lennon, H. Moon, L. C. Camenzind, L. Yu, D. M. Zumbühl, G. A. D. Briggs, M. A. Osborne, E. A. Laird, and N. Ares, Efficiently measuring a quantum device us- ing machine learning, npj Quantum Inf.5,79(2019)

work page 2019

[22] [22]

Fösel, P

T. Fösel, P . Tighineanu, T. Weiss, and F. Marquardt, Re- inforcement learning with neural networks for quantum feedback, Phys. Rev. X8,031084(2018)

work page 2018

[23] [23]

Huang, H

C.-J. Huang, H. Ma, Q. Yin, J.-F. Tang, D. Dong, C. Chen, G.-Y. Xiang, C.-F. Li, and G.-C. Guo, Realization of a quantum autoencoder for lossless compression of quan- tum data, Phys. Rev. A102,032412(2020)

work page 2020

[24] [24]

Ma, C.-J

H. Ma, C.-J. Huang, C. Chen, D. Dong, Y. Wang, R.-B. Wu, and G.-Y. Xiang, On compression rate of quantum autoencoders: Control design, numerical and experimen- tal realization, Automatica147,110659(2023)

work page 2023

[25] [25]

Rosenblatt,Principles of Neurodynamics: Perceptrons and the Theory of Brain Mechanisms(Spartan Books, Washing- ton, DC,1961)

F. Rosenblatt,Principles of Neurodynamics: Perceptrons and the Theory of Brain Mechanisms(Spartan Books, Washing- ton, DC,1961)

work page 1961

[26] [26]

P . J. Werbos, Applications of advances in nonlinear sen- sitivity analysis, inSystem Modeling and Optimization (Springer Berlin Heidelberg, Berlin, Heidelberg,1982) p. 762

work page 1982

[27] [27]

Cybenko, Approximation by superpositions of a sig- moidal function, Math

G. Cybenko, Approximation by superpositions of a sig- moidal function, Math. Control Signals Syst.2,303 (1989)

work page 1989

[28] [28]

G. E. Hinton, S. Osindero, and Y.-W. Teh, A fast learning algorithm for deep belief nets, Neural Comput.18,1527 (2006)

work page 2006

[29] [29]

LeCun, Y

Y. LeCun, Y. Bengio, and G. Hinton, Deep learning, Na- ture521,436(2015)

work page 2015

[30] [30]

Shelhamer, J

E. Shelhamer, J. Long, and T. Darrell, Fully convolutional networks for semantic segmentation, IEEE TPAMI39,640 (2017)

work page 2017

[31] [31]

J. Ho, A. Jain, and P . Abbeel, Denoising diffusion proba- bilistic models, inProceedings of the34th International Con- ference on Neural Information Processing Systems, NIPS ’20 (Curran Associates Inc., Red Hook, NY, USA,2020)

work page 2020

[32] [32]

Attention Is All You Need

A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez, L. Kaiser, and I. Polosukhin, Attention is all you need, arXiv:1706.03762

work page internal anchor Pith review Pith/arXiv arXiv

[33] [33]

Torlai, G

G. Torlai, G. Mazzola, J. Carrasquilla, M. Troyer, R. Melko, and G. Carleo, Neural-network quantum state tomography, Nat. Phys.14,447(2018)

work page 2018

[34] [34]

Ahmed, C

S. Ahmed, C. S. Muñoz, F. Nori, and A. F. Kockum, Quantum state tomography with conditional generative adversarial networks, Phys. Rev. Lett.127,140502(2021)

work page 2021

[35] [35]

T. Xin, S. Lu, N. Cao, G. Anikeeva, D. Lu, J. Li, G. Long, and B. Zeng, Local-measurement-based quantum state tomography via neural networks, npj Quantum Inf.5, 109(2019)

work page 2019

[36] [36]

Neural network state estimation for full quantum state tomography

Q. Xu and S. Xu, Neural network state estimation for full quantum state tomography, arXiv:1811.06654

work page internal anchor Pith review Pith/arXiv arXiv

[37] [37]

A. M. Palmieri, G. Müller-Rigat, A. K. Srivastava, M. Lewenstein, G. Rajchel-Mieldzio´ c, and M. Płodzie ´ n, Enhancing quantum state tomography via resource- efficient attention-based neural networks, Phys. Rev. Res. 6,033248(2024)

work page 2024

[38] [38]

Gaikwad, O

A. Gaikwad, O. Bihani, Arvind, and K. Dorai, Neural- network-assisted quantum state and process tomography using limited data sets, Phys. Rev. A109,012402(2024)

work page 2024

[39] [39]

Y. Quek, S. Fort, and H. K. Ng, Adaptive quantum state tomography with neural networks, npj Quantum Inf.7, 105(2021)

work page 2021

[40] [40]

Torlai and R

G. Torlai and R. G. Melko, Machine-learning quantum states in the NISQ era, Annu. Rev. Condens. Matter Phys. 11,325(2020)

work page 2020

[41] [41]

H. Ma, D. Dong, I. R. Petersen, C.-J. Huang, and G.-Y. Xiang, Neural networks for quantum state tomography with constrained measurements, Quantum Inf. Process. 23,317(2024). 8

work page 2024

[42] [42]

N. J. Higham, inReliable Numerical Computation(Oxford University Press, Oxford,1990) p.161

work page 1990

[43] [43]

S. L. Harris and D. M. Harris, inDigital Design and Com- puter Architecture(Morgan Kaufmann, Boston,2016) p. 108

work page 2016

[44] [44]

Normalization techniques in training dnns: Methodology, analysis and application,

L. Huang, J. Qin, Y. Zhou, F. Zhu, L. Liu, and L. Shao, Normalization techniques in training DNNs: Methodol- ogy, analysis and application, arXiv:2009.12836

work page arXiv 2009

[45] [45]

Minsky and S

M. Minsky and S. A. Papert,Perceptrons(The MIT Press, Cambridge,1969)

work page 1969

[46] [46]

https://github.com/CyberPardofelis/Error- mitigated_QST_NN

work page

[47] [47]

M. D. de Burgh, N. K. Langford, A. C. Doherty, and A. Gilchrist, Choice of measurement sets in qubit tomog- raphy, Phys. Rev. A78,052122(2008)

work page 2008