Large Scale Optimization of Disordered Hubbard Models through Tensor and Neural Networks
Pith reviewed 2026-05-10 03:27 UTC · model grok-4.3
The pith
Neural networks trained on 3x3 charge-stability data can tune the central dot in larger 5x5 disordered arrays.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
A simulatable local 3×3 window contains sufficient information to tune the central dot within a much larger array. This validates a sliding-window approach in which one tunes a local region and then translates that window across the lattice to calibrate a larger device. For only on-site disorder unknown, the neural network predicts parameters with R² > 0.99 in 3×3 and retains R² ≈ 0.98 in 5×5 after fine tuning. Even when all parameters are unknown, on-site disorder prediction remains robust with R² > 0.9.
What carries the argument
Vision-based neural networks trained on tensor-network generated charge-stability data from local 3×3 windows to enable sliding-window tuning of larger arrays.
If this is right
- High accuracy in predicting on-site disorder for both small and scaled-up arrays.
- Robust inference of key disorder parameters even in fully disordered cases.
- Scalable calibration without computing ground states of exponentially large systems.
- Translation of local tuning across the entire lattice via repeated window application.
Where Pith is reading between the lines
- The approach suggests local correlations dominate for initial parameter estimation in these models.
- Similar local-window strategies could apply to tuning other complex quantum devices facing Hilbert space explosion.
- Real-device tests would need to account for noise levels not present in the tensor simulations.
Load-bearing premise
The local 3x3 charge-stability data is representative enough that the neural network generalizes accurately to the central dot in larger arrays without major degradation from long-range effects or noise.
What would settle it
A significant drop in prediction accuracy below R² of 0.9 when testing the neural network on charge-stability data from actual or simulated 5x5 arrays that include long-range interactions would show the local window is insufficient.
Figures
read the original abstract
We theoretically demonstrate a practical method for tuning randomly disordered 2D quantum-dot grids underlying spin qubit platforms using vision-based neural networks trained on tensor-network generated charge-stability data. We show that a simulatable local $3\times 3$ window already contains sufficient information to tune the central dot within a much larger array, thereby validating a sliding-window approach in which one tunes a local region and then translates that window across the lattice to calibrate a larger device. This avoids the computationally intractable necessity for obtaining the ground states for large systems with exponentially large Hilbert space. For the experimentally relevant case where only the on-site disorder is unknown, the neural network predicts the relevant parameters with very high fidelity in the $3\times 3$ setting [$R^2 >0.99$], and after fine tuning on only a small number of larger-device samples, it retains high accuracy for the central dot of a $5\times 5$ plaquette [$R^2\approx 0.98$]. When all the dots parameters are treated as unknown, prediction of the on-site disorder remains robust [$R^2>0.9$ for both $3\times 3$ and $5\times 5$], although the remaining parameters are substantially more difficult to infer from the same charge-stability data. This shows that the most practically important disorder parameter for tuning can still be inferred reliably even in the fully disordered setting for the computationally difficult 5x5 arrays.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper demonstrates a neural-network approach trained on tensor-network simulations of charge-stability diagrams from 3×3 windows of the disordered Hubbard model to predict on-site disorder and other parameters for the central dot in larger arrays. It reports R² > 0.99 for 3×3 on-site disorder, R² ≈ 0.98 for 5×5 after fine-tuning on a few larger samples, and R² > 0.9 for on-site disorder even when all parameters are unknown, thereby supporting a sliding-window tuning protocol that avoids direct simulation of large systems.
Significance. If the local-window sufficiency holds beyond the tested sizes, the method would provide a computationally tractable route to calibrating large-scale disordered quantum-dot arrays for spin-qubit applications, where full Hilbert-space simulations become intractable. The high fidelity for the experimentally dominant on-site disorder parameter is a concrete practical strength, and the use of independent tensor-network data for training avoids obvious circularity.
major comments (2)
- [Abstract] Abstract and results on 5×5 performance: the central claim that a simulatable 3×3 window already contains sufficient information for the central dot in a 'much larger array' is supported only by data up to 5×5 plaquettes. In the disordered Hubbard model, virtual hopping and global particle-number constraints can generate size-dependent corrections to the central dot's chemical potential and charging energy; no direct test or scaling analysis is provided for arrays larger than 5×5, so the extrapolation required for the sliding-window protocol remains unverified.
- [Results] Section on fully disordered case: when all dot parameters are treated as unknown, on-site disorder is still recovered with R² > 0.9, but the remaining parameters are substantially harder to infer. If the paper's primary practical goal is reliable extraction of the dominant disorder term, this distinction should be quantified with an explicit error budget or ablation showing which parameters drive the tuning accuracy.
minor comments (2)
- The abstract states R² > 0.99 for 3×3 and R² ≈ 0.98 for 5×5; the methods section should report the precise definition of R² (e.g., over which parameter set and test-set size) and any cross-validation protocol used to obtain these numbers.
- Notation for the neural-network input (charge-stability diagrams) and output (predicted disorder values) should be made consistent between the abstract, methods, and figure captions to avoid ambiguity when readers attempt to reproduce the training pipeline.
Simulated Author's Rebuttal
We thank the referee for the positive assessment of our work and the constructive major comments. We address each point below with clarifications and revisions to the manuscript where appropriate.
read point-by-point responses
-
Referee: [Abstract] Abstract and results on 5×5 performance: the central claim that a simulatable 3×3 window already contains sufficient information for the central dot in a 'much larger array' is supported only by data up to 5×5 plaquettes. In the disordered Hubbard model, virtual hopping and global particle-number constraints can generate size-dependent corrections to the central dot's chemical potential and charging energy; no direct test or scaling analysis is provided for arrays larger than 5×5, so the extrapolation required for the sliding-window protocol remains unverified.
Authors: We agree that our numerical validation is limited to 5×5 arrays, which represent the practical limit for tensor-network simulations with sufficient accuracy. The modest degradation from R² > 0.99 (3×3) to R² ≈ 0.98 (5×5 central dot) after minimal fine-tuning already indicates that local 3×3 data captures the dominant physics. In the revised manuscript we have updated the abstract to qualify the claim as applying to 'larger arrays (demonstrated up to 5×5)' and added a dedicated paragraph in the discussion section. There we argue, based on the short-range nature of the Hubbard interactions and electrostatic screening, that size-dependent corrections to on-site disorder and charging energy are expected to remain small beyond 5×5. A full scaling study for much larger arrays is not feasible with current methods, which is the central motivation for the sliding-window protocol. revision: partial
-
Referee: [Results] Section on fully disordered case: when all dot parameters are treated as unknown, on-site disorder is still recovered with R² > 0.9, but the remaining parameters are substantially harder to infer. If the paper's primary practical goal is reliable extraction of the dominant disorder term, this distinction should be quantified with an explicit error budget or ablation showing which parameters drive the tuning accuracy.
Authors: We thank the referee for highlighting this distinction. In the revised manuscript we have added an ablation study in the results section that systematically removes or fixes subsets of parameters during training and evaluates the resulting impact on on-site disorder prediction. We have also included an explicit error budget that propagates the inference uncertainties through to the effective chemical potential and charging energy of the central dot. This analysis confirms that the robust recovery of on-site disorder (R² > 0.9) dominates the tuning accuracy, while errors in the other parameters contribute only marginally to the overall calibration error for qubit operation. revision: yes
- Direct numerical tests or scaling analysis for arrays substantially larger than 5×5 remain computationally intractable with tensor networks, preventing empirical verification of the sliding-window protocol beyond the sizes already reported.
Circularity Check
No circularity: independent tensor-network data generation and NN prediction on unseen larger systems
full rationale
The paper generates training data via tensor-network simulations of 3x3 Hubbard models, trains a vision-based neural network to map charge-stability diagrams to on-site disorder parameters, and reports R^2 scores on held-out 3x3 cases plus fine-tuned 5x5 cases. No equation or step reduces a claimed prediction to a fitted input by construction, no self-citation supplies a load-bearing uniqueness theorem, and no ansatz is smuggled in. The sliding-window claim is an empirical generalization tested within the simulated data regime rather than a definitional tautology.
Axiom & Free-Parameter Ledger
free parameters (1)
- Neural network weights and biases
axioms (2)
- domain assumption The disordered Hubbard model accurately captures the physics of the quantum-dot grid
- ad hoc to paper A local 3x3 window contains sufficient information to tune the central dot in a larger array
Reference graph
Works this paper leans on
-
[1]
in ITensor [33] to model3×3and5×5quantum- dot grids. Since an MPS is intrinsically one-dimensional, the 2D lattice is mapped onto a 1D ordering by snaking through the sites (Fig. S1). For each configuration of ⃗ µ(m) in a sample device, we perform DMRG to obtain the ground state and then compute the site occupations ⟨ni⟩, yielding⃗ n(⃗ µ). Due to computat...
-
[2]
J. R. Taylor and S. Das Sarma, Neural network based deep learning analysis of semiconductor quantum dot qubits for automated control, Physical Review B111, 035301 (2025)
work page 2025
- [3]
-
[4]
H. Moon, D. T. Lennon, J. Kirkpatrick, N. M. van Es- broeck, L. C. Camenzind, L. Yu, F. Vigneau, D. M. Zum- bühl, G. A. D. Briggs, M. A. Osborne, D. Sejdinovic, E. A. Laird, and N. Ares, Machine learning enables com- pletely automatic tuning of a quantum device faster than humanexperts,NatureCommunications11,4161(2020)
work page 2020
- [5]
-
[6]
J. R. Taylor and S. Das Sarma, Mitigating dis- order and optimizing topological indicators with vision-transformer-based neural networks in majorana nanowires, Physical Review B112, L041110 (2025)
work page 2025
-
[7]
J. R. Taylor, K. Laubscher, and S. D. Sarma, Predicting spin-orbit coupling in hole spin qubit arrays with vision- transformer-based neural networks on a generalized hub- bard model, arXiv preprint arXiv:2604.05052 (2026)
work page internal anchor Pith review Pith/arXiv arXiv 2026
-
[8]
J. P. Zwolak, J. M. Taylor, R. W. Andrews, J. Ben- son, G. W. Bryant, D. Buterakos, A. Chatterjee, S. Das Sarma, M. A. Eriksson, E. Greplová,et al., Data needs and challenges for quantum dot devices automa- tion, npj Quantum Information10, 105 (2024)
work page 2024
-
[9]
A. S. Rao, D. Buterakos, B. van Straaten, V. John, C. X. Yu, S. D. Oosterhout, L. Stehouwer, G. Scappucci, M. Veldhorst, F. Borsoi,et al., Modular autonomous virtualization system for two-dimensional semiconduc- tor quantum dot arrays, Physical Review X15, 021034 (2025)
work page 2025
-
[10]
D. L. Buterakos, S. S. Kalantre, J. Ziegler, J. M. Taylor, and J. P. Zwolak, Qdflow: A python package for physics simulations of quantum dot devices, SciPost Physics Codebases , 065 (2026)
work page 2026
- [11]
-
[12]
S. D. Sarma, J. Fabian, X. Hu, and I. Zutić, Spin elec- tronics and spin computation, Solid State Communica- tions119, 207 (2001)
work page 2001
-
[13]
W. G. Van der Wiel, S. De Franceschi, J. M. Elzerman, T. Fujisawa, S. Tarucha, and L. P. Kouwenhoven, Elec- tron transport through double quantum dots, Reviews of modern physics75, 1 (2002)
work page 2002
-
[14]
S. D. Sarma, R. de Sousa, X. Hu, and B. Koiller, Spin quantum computation in silicon nanostructures, Solid state communications133, 737 (2005)
work page 2005
-
[15]
A. Chatterjee, P. Stevenson, S. De Franceschi, A. Morello, N. P. de Leon, and F. Kuemmeth, Semicon- ductor qubits in practice, Nature Reviews Physics3, 157 (2021)
work page 2021
-
[16]
G. Burkard, T. D. Ladd, A. Pan, J. M. Nichol, and J. R. Petta, Semiconductor spin qubits, Reviews of Modern Physics95, 025003 (2023)
work page 2023
-
[17]
P. Barthelemy and L. M. Vandersypen, Quantum dot systems: a versatile platform for quantum simulations, Annalen der Physik525, 808 (2013)
work page 2013
- [18]
-
[19]
T. Hensgens, T. Fujita, L. Janssen, X. Li, C. Van Diepen, C. Reichl, W. Wegscheider, S. Das Sarma, and L. M. Vandersypen, Quantum simulation of a fermi–hubbard model using a semiconductor quantum dot array, Nature 548, 70 (2017)
work page 2017
-
[20]
C. Stafford and S. D. Sarma, Collective coulomb blockade in an array of quantum dots: A mott-hubbard approach, Physical review letters72, 3590 (1994)
work page 1994
-
[21]
R. Kotlyar, C. A. Stafford, and S. Das Sarma, Addi- tion spectrum, persistent current, and spin polarization in coupled quantum dot arrays: Coherence, correlation, and disorder, Phys. Rev. B58, 3989 (1998)
work page 1998
-
[22]
R. Kotlyar, C. Stafford, and S. D. Sarma, Correlated charge polarization in a chain of coupled quantum dots, Physical Review B58, R1746 (1998)
work page 1998
-
[23]
Y.-Z. Chou and S. Das Sarma, Spin ladder quantum sim- ulators from spin-orbit coupled quantum dot spin qubits, Phys. Rev. B113, 035124 (2026)
work page 2026
- [24]
-
[25]
J. P. Zwolak and J. M. Taylor, Colloquium: Advances in automation of quantum dot devices control, Reviews of modern physics95, 011006 (2023)
work page 2023
- [26]
-
[27]
J. Schuff, M. J. Carballido, M. Kotzagiannidis, J. C. Calvo, M. Caselli, J. Rawling, D. L. Craig, B. van Straaten, B. Severin, F. Fedele, S. Svab, P. Cheva- lier Kwon, R. S. Eggli, T. Patlatiuk, N. Korda, D. M. Zumbühl, and N. Ares, Fully autonomous tuning of a spin qubit, Nature Electronics9, 304 (2026)
work page 2026
-
[28]
X. Wang, S. Yang, and S. D. Sarma, Quantum theory of the charge-stability diagram of semiconductor double- quantum-dot systems, Physical Review B84, 115301 (2011)
work page 2011
-
[29]
S. R. White, Density matrix formulation for quantum renormalization groups, Physical review letters69, 2863 (1992)
work page 1992
-
[30]
U.Schollwöck,Thedensity-matrixrenormalizationgroup in the age of matrix product states, Annals of physics 326, 96 (2011)
work page 2011
-
[31]
A. Krizhevsky, I. Sutskever, and G. E. Hinton, Imagenet classification with deep convolutional neural networks, Advances in neural information processing systems25 (2012)
work page 2012
-
[32]
K. He, X. Zhang, S. Ren, and J. Sun, Deep residual learn- ingforimagerecognition,inProceedings of the IEEE con- ference on computer vision and pattern recognition(2016) pp. 770–778
work page 2016
-
[33]
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
A. Dosovitskiy, L. Beyer, A. Kolesnikov, D. Weissenborn, X. Zhai, T. Unterthiner, M. Dehghani, M. Minderer, G. Heigold, S. Gelly,et al., An image is worth 16x16 words: Transformersforimagerecognitionatscale,arXiv preprint arXiv:2010.11929 (2020)
work page internal anchor Pith review Pith/arXiv arXiv 2010
-
[34]
M. Fishman, S. R. White, and E. M. Stoudenmire, The ITensor Software Library for Tensor Network Calcula- tions, SciPost Physics Codebases , 4 (2022)
work page 2022
-
[35]
Large Scale Optimization of Quantum Dots through Tensor and Neural Networks
J. Hu, L. Shen, and G. Sun, Squeeze-and-excitation net- works, inProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition(2018) pp. 7132–7141. 1 Supplemental Materials for “Large Scale Optimization of Quantum Dots through Tensor and Neural Networks” Supplementary Figure S1: Tensor network diagram of the MPS representing the system...
work page 2018
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.