A Scalable Configuration-Interaction Impurity Solver via Active Learning
Pith reviewed 2026-05-11 00:50 UTC · model grok-4.3
The pith
Active learning selects the relevant determinants so configuration-interaction impurity solvers scale to larger baths with only weak cost growth.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
AL-ATCI uses active learning to locate the physically relevant subset of Slater determinants inside the rapidly expanding Hilbert space of an enlarged impurity problem; the query size N_query both controls the approximation and serves as an internal convergence parameter. Because the correlated state occupies only a small fraction of the full determinant space, the computational effort grows only weakly with bath size. In one-dimensional Hubbard-model benchmarks the resulting spectra and energies reproduce those of exact diagonalization, while cellular calculations become feasible up to ten-site clusters. For a rotationally invariant three-orbital Sr2RuO4 impurity the method yields converged
What carries the argument
AL-ATCI: an active-learning extension of adaptive-truncation configuration interaction that iteratively queries and retains only the determinant manifold relevant to the correlated state, with accuracy controlled by the finite query size N_query.
If this is right
- AL-ATCI reproduces exact-diagonalization accuracy on one-dimensional Hubbard-model DMFT benchmarks.
- Cellular DMFT calculations become practical for clusters as large as ten sites.
- Dynamical quantities for a three-orbital Sr2RuO4 impurity converge systematically when bath orbitals are increased from nine to eighteen.
- The relevant determinant manifold stays far smaller than the combinatorial space, so cost grows only weakly with bath size.
Where Pith is reading between the lines
- The same selection strategy could be tested on other many-body methods whose Hilbert spaces are dominated by a sparse set of important configurations.
- If N_query proves reliable across a wider range of models, it could serve as a parameter-free convergence diagnostic for any determinant-based impurity solver.
- The approach might be combined with existing bath-optimization techniques to further reduce the number of orbitals needed for a given accuracy.
Load-bearing premise
The active-learning procedure with a fixed query size N_query will always locate every determinant that materially affects the dynamical quantities, even as the bath grows larger.
What would settle it
A case in which increasing N_query after the reported convergence threshold still shifts the impurity spectral function or self-energy by more than numerical noise.
Figures
read the original abstract
Finite-Hamiltonian impurity solvers provide direct real-frequency spectra and a natural route to enlarged impurity Hamiltonians, but their applicability is limited by the rapid Hilbert-space growth with the number of bath or other added one-particle orbitals. We introduce an active-learning extension of adaptive-truncation configuration interaction (AL-ATCI) that identifies the determinant manifold relevant to the correlated state. The approximation is systematically controlled by the query size N_query, which also provides an internal convergence parameter when no external benchmark is available. Over the benchmark range studied here, the computational cost grows only weakly with bath size, because enlarging the bath mainly expands the combinatorial determinant space rather than the physically relevant manifold. In dynamical mean-field-theory benchmarks for the one-dimensional Hubbard model, AL-ATCI reproduces exact-diagonalization accuracy and extends cellular calculations to clusters as large as N_c = 10. For a three-orbital rotationally invariant Sr2RuO4 impurity problem, we demonstrate systematic convergence of dynamical quantities and a highly compressed configuration space as N_b is increased from 9 to 18. These results substantially alleviate the bath-discretization bottleneck of exact-diagonalization- and configuration-interaction-based impurity solvers and make large-bath and enlarged-orbital calculations more practical.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript introduces an active-learning extension (AL-ATCI) to adaptive-truncation configuration interaction for finite-Hamiltonian impurity solvers in DMFT. It claims that the query size N_query systematically controls the approximation by identifying the physically relevant determinant manifold, yielding only weak growth in computational cost with bath size N_b (because the manifold does not expand combinatorially), exact-diagonalization accuracy for the 1D Hubbard model, extension of cellular calculations to N_c=10, and systematic convergence of dynamical quantities for a three-orbital Sr2RuO4 impurity as N_b increases from 9 to 18.
Significance. If the active-learning selection is shown to be complete for dynamical quantities, the work would meaningfully extend the reach of configuration-interaction impurity solvers by reducing the effective Hilbert-space size, thereby addressing the bath-discretization bottleneck that currently limits ED- and CI-based DMFT calculations for multi-orbital and large-cluster problems.
major comments (2)
- [Abstract and Sr2RuO4 results] Abstract and Sr2RuO4 benchmarks: the central claim that AL-ATCI reproduces ED-level accuracy for dynamical quantities while compressing the configuration space rests on the assumption that finite-N_query active learning never omits low-weight determinants that shift poles, residues, or the self-energy at finite frequencies. The presented results show convergence of dynamical quantities but supply no error bars, no quantitative truncation-error estimates for the Green's function, and no direct comparison to full ED (where feasible for smaller N_b) that would confirm the manifold is complete for spectra rather than only for the ground-state energy or density matrix.
- [Methods (active-learning section)] Methods description of the active-learning procedure: N_query is introduced as an external control parameter that also serves as an internal convergence diagnostic, yet no equation or pseudocode specifies the acquisition function or uncertainty estimator. Without this, it is impossible to assess whether the selection criterion is insensitive to virtual excitations that contribute only at specific Matsubara or real frequencies, which directly affects the reliability of the reported weak scaling and spectral accuracy.
minor comments (2)
- The abstract states that the cost 'grows only weakly with bath size' and that the configuration space is 'highly compressed,' but the main text should include a quantitative plot or table of the retained determinant fraction versus N_b to make the compression claim concrete.
- Figure captions and axis labels for the dynamical-quantity convergence plots should explicitly state the value of N_query used and whether the curves are for fixed or increasing N_query.
Simulated Author's Rebuttal
We thank the referee for the careful reading and constructive comments on our manuscript. We appreciate the recognition that AL-ATCI could meaningfully extend the reach of configuration-interaction impurity solvers. We address each major comment below and will incorporate revisions to strengthen the quantitative support for our claims.
read point-by-point responses
-
Referee: [Abstract and Sr2RuO4 results] Abstract and Sr2RuO4 benchmarks: the central claim that AL-ATCI reproduces ED-level accuracy for dynamical quantities while compressing the configuration space rests on the assumption that finite-N_query active learning never omits low-weight determinants that shift poles, residues, or the self-energy at finite frequencies. The presented results show convergence of dynamical quantities but supply no error bars, no quantitative truncation-error estimates for the Green's function, and no direct comparison to full ED (where feasible for smaller N_b) that would confirm the manifold is complete for spectra rather than only for the ground-state energy or density matrix.
Authors: We agree that additional quantitative error analysis would strengthen the presentation of dynamical accuracy. For the 1D Hubbard model we already provide direct ED comparisons up to N_c=10 that include both energies and spectral functions. For the Sr2RuO4 impurity, full ED becomes prohibitive beyond N_b=9, but we will add in revision: (i) a direct ED versus AL-ATCI comparison at N_b=9 to quantify spectral truncation error, and (ii) error bars on the self-energy and Green's function obtained from the variation across N_query values. These additions will explicitly demonstrate that the selected manifold captures the relevant poles and residues for the reported dynamical quantities. revision: yes
-
Referee: [Methods (active-learning section)] Methods description of the active-learning procedure: N_query is introduced as an external control parameter that also serves as an internal convergence diagnostic, yet no equation or pseudocode specifies the acquisition function or uncertainty estimator. Without this, it is impossible to assess whether the selection criterion is insensitive to virtual excitations that contribute only at specific Matsubara or real frequencies, which directly affects the reliability of the reported weak scaling and spectral accuracy.
Authors: We acknowledge that the current methods section does not provide an explicit equation or pseudocode for the acquisition function. In the revision we will insert a dedicated subsection that defines the uncertainty estimator (based on the variance of CI coefficients across an ensemble of trial wavefunctions) and the acquisition function used to rank determinants. We will also include pseudocode for the iterative selection loop, clarifying how N_query controls inclusion of both ground-state and frequency-relevant virtual excitations. This will make the weak-scaling claim and spectral reliability fully reproducible. revision: yes
Circularity Check
No significant circularity; N_query is external control and scaling claims rest on benchmarks
full rationale
The paper presents AL-ATCI as an active-learning extension of adaptive-truncation CI, with N_query as an explicit external convergence parameter that controls the determinant manifold size. The central observation—that computational cost grows only weakly with bath size because the physically relevant manifold remains compressed—is reported as an empirical outcome from explicit benchmarks (1D Hubbard model and Sr2RuO4 impurity) rather than a mathematical identity or self-fit. No equation in the provided text reduces the reported accuracy, compression ratio, or weak scaling to a redefinition of the input data or to a parameter fitted to the same observables being predicted. Self-citations, if present, are not invoked to justify uniqueness or to close a derivation loop. The method is therefore self-contained against external benchmarks.
Axiom & Free-Parameter Ledger
free parameters (1)
- N_query
axioms (1)
- domain assumption The physically relevant determinant manifold for the correlated impurity state can be identified by repeated active-learning queries without exhaustive enumeration.
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
AL-ATCI addresses this by introducing a targeted selection step before Hamiltonian construction and diagonalization... Only the top N_query configurations are then retained for Hamiltonian construction and diagonalization
-
IndisputableMonolith/Foundation/BranchSelection.leanbranch_selection unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
the computational cost grows only weakly with bath size, because enlarging the bath mainly expands the combinatorial determinant space rather than the physically relevant manifold
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
Single-site DMFT (a) captures local correlations but misses spatial fluctuations. AsN c grows (b)–(d), the spectral weight redistributes and the dominant fea- tures align more closely with the Bethe-ansatz spinon and holon branches, reflecting the improved treatment 4 210 212 214 216 218 Nquery 10 7 10 5 10 3 (a) 210 212 214 216 218 Nquery 10 4 10 3 10 2 ...
work page 2023
-
[2]
A. Georges, G. Kotliar, W. Krauth, and M. J. Rozenberg, Rev. Mod. Phys.68, 13 (1996)
work page 1996
-
[3]
G. Kotliar, S. Y. Savrasov, K. Haule, V. S. Oudovenko, O. Parcollet, and C. A. Marianetti, Rev. Mod. Phys.78, 865 (2006)
work page 2006
- [4]
-
[5]
E. Gull, A. J. Millis, A. I. Lichtenstein, A. N. Rubtsov, M. Troyer, and P. Werner, Rev. Mod. Phys.83, 349 (2011)
work page 2011
-
[6]
E. Eidelstein, E. Gull, and G. Cohen, Phys. Rev. Lett. 124, 206405 (2020)
work page 2020
-
[7]
A. Erpenbeck, T. Blommel, L. Zhang, W.-T. Lin, G. Co- hen, and E. Gull, The Journal of Chemical Physics161, 094104 (2024)
work page 2024
-
[8]
K. G. Wilson, Rev. Mod. Phys.47, 773 (1975)
work page 1975
- [9]
-
[10]
K. M. Stadler, Z. P. Yin, J. von Delft, G. Kotliar, and A. Weichselbaum, Phys. Rev. Lett.115, 136401 (2015)
work page 2015
-
[11]
F. B. Kugler, M. Zingl, H. U. R. Strand, S.-S. B. Lee, J. von Delft, and A. Georges, Phys. Rev. Lett.124, 016401 (2020)
work page 2020
-
[12]
A. Gleis, S.-S. B. Lee, G. Kotliar, and J. von Delft, Phys. Rev. Lett.134, 106501 (2025)
work page 2025
- [13]
- [14]
- [15]
- [16]
-
[17]
D. Zgid, E. Gull, and G. K.-L. Chan, Phys. Rev. B86, 165128 (2012)
work page 2012
- [18]
-
[19]
Y. Lu, M. H¨ oppner, O. Gunnarsson, and M. W. Haverkort, Phys. Rev. B90, 085102 (2014)
work page 2014
- [20]
- [21]
-
[22]
C. Mejuto-Zaera, N. M. Tubman, and K. B. Whaley, Phys. Rev. B100, 125165 (2019)
work page 2019
- [23]
-
[24]
Y. Wang, G. Fabbris, M. Dean, and G. Kotliar, Com- puter Physics Communications243, 151 (2019)
work page 2019
- [25]
-
[26]
K. Higashi, M. Winder, J. Kuneˇ s, and A. Hariki, Phys. Rev. X11, 041009 (2021)
work page 2021
-
[27]
J. P. Coe, J. Chem. Theory Comput.14, 5739 (2018)
work page 2018
- [28]
- [29]
- [30]
- [31]
- [32]
-
[33]
E. R. Davidson, J. Comput. Phys.17, 87 (1975)
work page 1975
-
[34]
See Supplemental Material at [URL will be inserted by publisher] for additional benchmark results
- [35]
-
[36]
F. Pedregosa, G. Varoquaux, A. Gramfort, V. Michel, B. Thirion, O. Grisel, M. Blondel, P. Prettenhofer, R. Weiss, V. Dubourg, J. Vanderplas, A. Passos, D. Cour- napeau, M. Brucher, M. Perrot, and E. Duchesnay, J. Mach. Learn. Res.12, 2825 (2011)
work page 2011
- [37]
-
[38]
E. H. Lieb and F. Y. Wu, Phys. Rev. Lett.20, 1445 (1968)
work page 1968
-
[39]
K. Penc, K. Hallberg, F. Mila, and H. Shiba, Phys. Rev. Lett.77, 1390 (1996)
work page 1996
- [40]
- [41]
-
[42]
G. Kotliar, S. Y. Savrasov, G. P´ alsson, and G. Biroli, Phys. Rev. Lett.87, 186401 (2001)
work page 2001
- [43]
-
[44]
J. Mravlje, M. Aichhorn, T. Miyake, K. Haule, G. Kotliar, and A. Georges, Phys. Rev. Lett.106, 096401 (2011)
work page 2011
-
[45]
A. Georges, L. d. Medici, and J. Mravlje, Annual Review of Condensed Matter Physics4, 137 (2013)
work page 2013
- [46]
-
[47]
H. J. Lee, C. H. Kim, and A. Go, Phys. Rev. B102, 195115 (2020)
work page 2020
-
[48]
M. W. Haverkort, I. S. Elfimov, L. H. Tjeng, G. A. Sawatzky, and A. Damascelli, Phys. Rev. Lett.101, 026406 (2008). 6 Labeled(trainset) RASCI reference SDs reference SDs +Additional SDs LanczosConverged? Ground state Orbital rotation Training Prediction Query SDs FIG. S1. Flowchart of AL-ATCI, highlighting the active learning enhancements: classifier trai...
work page 2008
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.