Efficient measurement of neutral-atom qubits with matched filters
Pith reviewed 2026-05-22 19:34 UTC · model grok-4.3
The pith
Matched filters for neutral-atom qubit readout cut errors by up to 43% compared to Gaussian thresholds while using far fewer parameters than convolutional neural networks.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The central claim is that an array matched filter trained on experimental fluorescence data incorporates neighbor-site information to mitigate readout crosstalk and produces up to 43% lower error rates than a conventional Gaussian threshold approach, while requiring only two orders of magnitude fewer parameters and four orders of magnitude fewer arithmetic operations than a convolutional neural network that achieves comparable accuracy.
What carries the argument
The array matched filter, a linear combination of fluorescence signals from a target atom and its neighbors whose weights are learned from calibration data to minimize total classification error.
If this is right
- The array model delivers up to 43% error reduction and the local model up to 32% reduction versus a Gaussian threshold detector.
- The array matched filter uses two orders of magnitude fewer trainable parameters and four orders of magnitude fewer multiplications than a convolutional neural network while increasing error by only 3.5%.
- Visual inspection of the learned filter weights reveals experimental imperfections such as crosstalk sources.
- A convolutional neural network can be pruned to 70 times fewer parameters and 4000 times fewer operations with comparable error rates.
- Performance remains stable across a range of readout durations with only minor error increase.
Where Pith is reading between the lines
- The low computational cost could allow real-time retraining when experimental conditions drift, extending usable lifetime of the calibration.
- The same matched-filter construction may transfer to other physical qubit platforms that suffer from spatially correlated readout noise.
- Interpretable weights could directly inform hardware redesigns that reduce crosstalk at the source rather than correcting it after the fact.
- Because the model is linear, its error statistics may combine more cleanly with standard quantum error-correction analyses than black-box neural networks.
Load-bearing premise
The training data gathered on the present apparatus remains representative of future operating conditions so that the learned filters do not overfit to transient hardware imperfections or particular atom placements.
What would settle it
Apply the trained filters to a new data set taken with altered atom spacing, different laser intensity, or a different trap configuration and verify whether the stated error reductions of 32% and 43% continue to hold.
Figures
read the original abstract
Quantum computers require high-fidelity measurement of many qubits to achieve a quantum advantage. Traditional approaches suffer from readout crosstalk for a neutral-atom quantum processor with a tightly spaced array. Although classical machine learning algorithms based on convolutional neural networks can improve fidelity, they are computationally expensive, making it difficult to scale them to large qubit counts. We present two simpler and scalable machine learning algorithms that realize matched filters for the readout problem. One is a local model that focuses on a single qubit, and the other uses information from neighboring qubits in the array to prevent crosstalk among the qubits. We demonstrate error reductions of up to 32% and 43% for the site and array models, respectively, compared to a conventional Gaussian threshold approach. Additionally, our array model uses two orders of magnitude fewer trainable parameters and four orders of magnitude fewer multiplications and nonlinear function evaluations than a recent convolutional neural network approach, with only a minor (3.5%) increase in error across different readout times. Another strength of our approach is its physical interpretability: the learned filter can be visualized to provide insights into experimental imperfections. We also show that a convolutional neural network model for improved can be pruned to have 70x and 4000x fewer parameters, respectively, while maintaining similar errors. Our work shows that simple machine learning approaches can achieve high-fidelity qubit measurements while remaining scalable to systems with larger qubit counts.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript introduces two matched-filter machine learning models for readout of neutral-atom qubits: a local site model and an array model that incorporates neighbor information to mitigate crosstalk. On experimental fluorescence traces, the models achieve error reductions of up to 32% and 43% relative to a conventional Gaussian threshold, while the array model uses two orders of magnitude fewer trainable parameters and four orders of magnitude fewer multiplications than a recent CNN approach, with only a 3.5% error increase across readout times. The learned filters are presented as physically interpretable.
Significance. If the reported gains hold, the work supplies a computationally lightweight and interpretable alternative to neural-network readout for neutral-atom arrays. The large reduction in parameter count and arithmetic operations directly addresses scalability concerns for larger qubit numbers, and the interpretability of the filters offers a route to diagnosing experimental imperfections without sacrificing performance.
major comments (1)
- [§4 (Experimental Results)] The 43% error reduction and efficiency claims for the array model are measured on held-out shots from the same experimental runs and apparatus. No cross-run or cross-calibration validation is reported to test whether the learned neighbor weights remain effective when laser intensity, position jitter, or loading statistics vary, which is load-bearing for the claim that the approach scales robustly to future operating conditions.
minor comments (2)
- [Abstract] The abstract sentence beginning 'We also show that a convolutional neural network model for improved can be pruned...' is incomplete; the intended phrase should be restored for clarity.
- [Figures 2-4] Figure captions and axis labels should explicitly state the number of shots and the precise definition of 'error' (e.g., assignment error per site) to allow direct comparison with the tabulated results.
Simulated Author's Rebuttal
We thank the referee for their positive summary and for identifying this important aspect of our validation strategy. We address the major comment point by point below.
read point-by-point responses
-
Referee: [§4 (Experimental Results)] The 43% error reduction and efficiency claims for the array model are measured on held-out shots from the same experimental runs and apparatus. No cross-run or cross-calibration validation is reported to test whether the learned neighbor weights remain effective when laser intensity, position jitter, or loading statistics vary, which is load-bearing for the claim that the approach scales robustly to future operating conditions.
Authors: We agree that cross-run validation would provide stronger evidence for robustness under varying conditions. Our reported results use held-out shots drawn from the same experimental runs to evaluate generalization to unseen data while remaining within the statistics and apparatus of the collected dataset; this tests performance against shot-to-shot fluctuations and any intra-run drifts that were present. The array model learns a small number of neighbor weights that primarily capture the fixed optical crosstalk geometry of the array, which is expected to remain stable for a given experimental setup. Because the model contains only a few dozen trainable parameters, it can be retrained or fine-tuned rapidly if calibration changes occur. We nevertheless acknowledge that the absence of explicit cross-run or cross-calibration tests limits the strength of the scalability claim. In the revised manuscript we will add a paragraph in §4 discussing this limitation and noting that the low parameter count makes periodic re-calibration on new data computationally inexpensive. We believe this clarification addresses the referee’s concern without altering the core experimental claims. revision: partial
Circularity Check
No significant circularity: performance metrics are measured outcomes on held-out experimental data
full rationale
The paper trains local and array matched-filter models on experimental fluorescence traces collected from the neutral-atom apparatus and reports error rates, parameter counts, and multiplication counts directly from evaluation on held-out shots. These quantities are not defined by the fitted weights themselves nor obtained by renaming a fit as a prediction. No self-definitional equations, load-bearing self-citations, uniqueness theorems, or ansatzes smuggled via prior work appear in the central claims. The derivation chain consists of standard supervised learning on real hardware data followed by empirical benchmarking, which remains self-contained and falsifiable outside the fitted values.
Axiom & Free-Parameter Ledger
free parameters (1)
- filter coefficients
axioms (1)
- domain assumption Fluorescence intensity traces from each site are statistically independent of distant sites once nearest-neighbor crosstalk is modeled.
Reference graph
Works this paper leans on
-
[1]
Preprocessing The first step is to crop the full 442 × 62-pixel image to obtain the 28 × 28-pixel image corresponding to the secondary path. For each measurement time, there are 6,002 images, randomly shuffled and split into 60% for training, 20% for testing, and 20% for validation. Next, we find the mean pixel intensity across all training images and subtrac...
-
[2]
Estimating Statistical Uncertainty in Infidelity As mentioned in the previous section, the images are split and randomly shuffled into training, testing, and validation sets. For Figures 2 and 3, and Tables I and II, the dataset is split using 10 independent random shuffles, the infidelities at each measurement time are computed, and the infidelities for each ra...
-
[3]
Training In the training step, the feature vectors for each im- age x ∈ Rd× 1 (see Fig. 1), where d is the number of elements in the feature vector, are placed into a matrix X ∈ Rd× M , where M is the number of images in the training dataset, and the corresponding ground truth la- bels from the primary path y ∈ [0, 1] are placed into a matrix Y ∈ R1× M . ...
-
[4]
Optimizing model metaparameters For each qubit and measurement time, we simultane- ously optimize the threshold and boundary size. For each boundary size, the weights learned in the training step are used to predict the qubit states for all feature vectors x in the validation set by computing ˆ y = ˆWx. Next, ˆy′ is computed for all thresholds ranging fro...
-
[5]
Testing Lastly, the optimal boundary size, weights, and thresh- olds for each qubit are used to obtain ˆy′ for each image in the testing image set and compared to the ground truth values y to obtain the classification fidelity
-
[6]
Cross-Fidelity Calculation To obtain a more representative cross-fidelity calcu- lation, we apply the trained models to a larger dataset 9 a) b) c) FIG. 6. (a) Infidelity comparison between CNN-Site (Baselin e-355k Model) and Pruned CNN-Site (5k Model) (b) Infidelity comparison between CNN-Array (Baseline-74M Model) and Pru ned CNN-Array (18k Model) (c) Over...
-
[7]
Baseline CNN Model The original single-qubit CNN model [ 16] has approx- imately 355,000 trainable parameters. When scaled di- rectly to multi-qubit arrays, the model complexity grows to over 74 million parameters, rendering real-time infer- ence infeasible
-
[8]
Pruned Model Our optimized single-qubit model eliminates unnec- essary convolutional and feed-forward layers through strategic pruning, reducing the parameter count from 355k to approximately 5k—a 70 × reduction—while pre- serving high readout fidelity. Key insights driving this reduction include the removal of fully connected layers, which contributed min...
-
[9]
Higher-level layers are added selectively to account for crosstalk between adjacent qubits
Hierarchical Multi-Qubit Model To scale effectively, we introduce a hierarchical CNN architecture that reuses our lightweight single-qubit CNN as a modular component. Higher-level layers are added selectively to account for crosstalk between adjacent qubits. This hierarchical design enables high-fidelity multi-qubit readout with just 18k parameters—a 4,000 ...
-
[10]
P. Shor, Algorithms for quantum computation: discrete logarithms and factoring, in Proceedings 35th Annual Symposium on Foundations of Computer Science (1994) pp. 124–134
work page 1994
-
[11]
D. J. Bernstein and T. Lange, Post-quantum cryptogra- phy, Nature 549, 188 (2017)
work page 2017
-
[12]
C. Gidney and M. Eker ˚ a, How to factor 2048 bit rsa in- tegers in 8 hours using 20 million noisy qubits, Quantum 5, 433 (2021)
work page 2048
-
[13]
A Quantum Approximate Optimization Algorithm
E. Farhi, J. Goldstone, and S. Gutmann, A quan- tum approximate optimization algorithm (2014), arXiv:1411.4028
work page internal anchor Pith review Pith/arXiv arXiv 2014
-
[14]
L. Zhou, S.-T. Wang, S. Choi, H. Pichler, and M. D. Lukin, Quantum approximate optimization algorithm: Performance, mechanism, and implementation on near- term devices, Physical Review X 10, 021067 (2020)
work page 2020
-
[15]
A. Abbas et al., Challenges and opportunities in quantum optimization, Nature Reviews Physics 6, 718 (2024)
work page 2024
-
[16]
Y. Cao, J. Romero, and A. Aspuru-Guzik, Potential of quantum computing for drug discovery, IBM Journal of Research and Development 62, 6 (2018)
work page 2018
-
[17]
N. S. Blunt, J. Camps, O. Crawford, R. Izs´ ak, S. Leon- tica, A. Mirani, A. E. Moylett, S. A. Scivier, C. Sunder- hauf, P. Schopf, et al. , Perspective on the current state- of-the-art of quantum computing for drug discovery ap- plications, J. Chem. Theory Comput. 18, 7001 (2022)
work page 2022
-
[18]
P. W. Shor, Scheme for reducing decoherence in quantum computer memory, Physical review A 52, R2493 (1995)
work page 1995
-
[19]
A. G. Fowler, M. Mariantoni, J. M. Martinis, and A. N. Cleland, Surface codes: Towards practical large-scale quantum computation, Phys. Rev. A 86, 032324 (2012)
work page 2012
-
[20]
R. Acharya et al. , Quantum error correction below the surface code threshold, Nature 638, 920 (2025)
work page 2025
- [21]
-
[22]
L. Henriet, L. Beguin, A. Signoles, T. Lahaye, A. Browaeys, G.-O. Reymond, and C. Jurczak, Quantum computing with neutral atoms, Quantum 4, 327 (2020)
work page 2020
- [23]
-
[24]
H. J. Manetsch, G. Nomura, E. Bataille, K. H. Leung, X. Lv, and M. Endres, A tweezer array with 6100 highly coherent atomic qubits (2024), arXiv:2403.12021
work page internal anchor Pith review arXiv 2024
-
[25]
L. Phuttitarn, B. M. Becker, R. Chinnarasu, T. M. Graham, and M. Saffman, Enhanced measurement of neutral-atom qubits with machine learning, Phys. Rev. Appl. 22, 024011 (2024)
work page 2024
- [26]
- [27]
-
[28]
Google Quantum AI, Exponential suppression of bit or phase errors with cyclic error correction, Nature 595, 383 (2021)
work page 2021
-
[29]
I. Goodfellow, Y. Bengio, and A. Courville, Deep Learn- ing (MIT Press, 2016)
work page 2016
-
[30]
Hertz, Sequential ridge regression, IEEE Trans
D. Hertz, Sequential ridge regression, IEEE Trans. Aerosp. Electron. Syst. 27, 571 (1991)
work page 1991
-
[31]
C. R. Vogel, Computational methods for inverse problems (SIAM, 2002)
work page 2002
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.