Integrating Out, Twice:The Open-System Case That Neural-Network Ensemble Theory Is Missing
Pith reviewed 2026-06-27 17:11 UTC · model grok-4.3
The pith
Neural-network ensemble averaging produces only closed-system covariances and misses the open case that conserves flux into a continuous spectrum.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The closed case of neural ensemble averaging is the Schur complement of a Gaussian block that returns a covariance and its inverse; this maps the neural tangent kernel to the Fisher sensitivity kernel, the infinite-width limit to the Gaussian-process emulator, and the lazy-to-feature transition to the validity boundary of a reduced-basis emulator. The open case requires an eliminated sector with continuous spectrum and wave-like dynamics to produce a non-Hermitian effective generator that itemizes conserved flux, as in the nuclear optical model. The three tested partitions lack this sector, so the open ledger is either absent, an artifact of the partition, or pinned near a floor by the train
What carries the argument
The Schur complement of the eliminated block, which returns covariance and inverse when the sector is closed and a non-Hermitian generator that conserves lost probability when the sector is open with continuous spectrum.
If this is right
- The neural tangent kernel is the Fisher sensitivity kernel under the closed-case dictionary.
- The infinite-width Gaussian limit is the Gaussian-process emulator.
- The lazy-to-feature transition marks the validity boundary of a reduced-basis emulator.
- The conserved flux ledger appears only where openness is genuinely present with a continuous-spectrum sector.
Where Pith is reading between the lines
- Architectures whose internal partitions naturally eliminate a continuum sector could make the open ledger usable for uncertainty that current ensembles treat as epistemic.
- Training objectives that penalize flux loss may systematically suppress the open-sector signature even when the partition geometry allows it.
- Replacing relaxational layers with wave-propagating ones in selected blocks would provide a direct test of whether the missing dynamics can be engineered inside existing networks.
Load-bearing premise
That the three tested neural objects constitute representative instances of genuine openness with an eliminated continuous-spectrum sector.
What would settle it
Observation of a neural-network partition whose eliminated sector exhibits continuous spectrum and produces wave-like rather than relaxational dynamics, yielding a non-Hermitian generator whose flux ledger matches the open-system predictions.
read the original abstract
Averaging a neural network over its random parameters and marginalizing a Gaussian sector are the same operation, the Schur complement of the eliminated block, and when that block is closed it returns a covariance and its inverse. That is all a network ensemble produces, the closed case. The open case is missing, and nuclear reaction theory has it worked out. Projecting a scattering problem onto a chosen set of channels, with the rest carrying probability irreversibly to a continuum, leaves a non-Hermitian effective generator that conserves and itemizes exactly what it loses: the nuclear optical model and its generalized optical theorem. I set the two cases side by side using only the moments of a distribution, the algebra of Gaussians, and block inversion, no field theory, and give the closed-case dictionary in full: the neural tangent kernel is the Fisher sensitivity kernel, the infinite-width Gaussian limit is the Gaussian-process emulator, and the lazy-to-feature transition is the validity boundary of a reduced-basis emulator. I then test the open export on a truncated attention map, a token-level transfer operator, and a sparse expert router, and report a mostly negative result. The conserved flux ledger ports wherever openness is genuinely present, but its distinctive content is absent, an artifact of the chosen partition, or pinned near a floor by the training objective, and the operationally useful uncertainty turns out to be epistemic, living in the closed half of the correspondence, not the open one. The negative has a structural reason this note makes precise: the open case needs an eliminated sector with a continuous spectrum and wave-like, not relaxational, dynamics, which mainstream learning's finite or dissipative objects do not supply. This is a note, not a result; its main finding is that negative one, and its value is the map that locates it.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript claims that neural-network ensemble theory realizes only the closed case of marginalization over random parameters, which via the Schur complement on a Gaussian sector yields a covariance and its inverse; the open case from nuclear reaction theory—producing a non-Hermitian effective generator that conserves and itemizes flux to a continuous-spectrum eliminated sector—is absent. It supplies a closed-case dictionary (NTK as Fisher sensitivity kernel, infinite-width limit as GP emulator, lazy-to-feature transition as reduced-basis validity boundary) using only moments and block inversion, then tests three NN objects (truncated attention map, token-level transfer operator, sparse expert router) and reports mostly negative results for open-case signatures such as the conserved flux ledger. The structural negative is attributed to the lack of an eliminated sector with continuous spectrum and wave-like (non-relaxational) dynamics in mainstream finite or dissipative NN objects.
Significance. If the structural diagnosis is correct, the work supplies a precise diagnostic map locating why open-system features have not appeared in ensemble theory and unifies several existing NN equivalences in a parameter-free manner via moments and block inversion. The closed-case dictionary is a clear contribution; the negative result on the three objects, while preliminary, identifies a concrete obstruction that future constructions would need to overcome.
major comments (2)
- [Testing section (three NN objects)] Testing section on the three NN objects: the partitions applied to the truncated attention map, token-level transfer operator, and sparse expert router must be shown explicitly to eliminate a sector whose spectrum is continuous and whose dynamics are wave-like rather than relaxational; absent such verification, the reported absence of conserved-flux signatures follows by construction from the Schur-complement algebra and does not establish that no NN ensemble can realize the required open sector.
- [Abstract and conclusion] Abstract and concluding paragraph: the claim that 'mainstream learning's finite or dissipative objects do not supply' the eliminated continuous-spectrum sector is load-bearing for the central negative; the three chosen examples are treated as representative, yet the manuscript provides no general argument that every possible partition of an NN object must eliminate only finite or dissipative sectors.
minor comments (1)
- The manuscript would benefit from an explicit side-by-side equation block comparing the closed-case Schur complement (covariance) with the open-case non-Hermitian generator and its optical theorem, to make the dictionary immediately usable.
Simulated Author's Rebuttal
We thank the referee for the careful and constructive report. The comments correctly identify points where additional clarification would strengthen the manuscript. We respond to each major comment below.
read point-by-point responses
-
Referee: Testing section (three NN objects): the partitions applied to the truncated attention map, token-level transfer operator, and sparse expert router must be shown explicitly to eliminate a sector whose spectrum is continuous and whose dynamics are wave-like rather than relaxational; absent such verification, the reported absence of conserved-flux signatures follows by construction from the Schur-complement algebra and does not establish that no NN ensemble can realize the required open sector.
Authors: We agree that the partitions should be characterized explicitly to confirm the nature of the eliminated sectors. In the revised manuscript we will add explicit descriptions of the partitions for the truncated attention map, token-level transfer operator, and sparse expert router, together with arguments based on their finite dimensionality or dissipative character showing why the eliminated sectors lack continuous spectra and wave-like dynamics. This will make clear that the reported absence of open-case signatures follows from the structure of the tested objects rather than from the algebra alone. revision: yes
-
Referee: Abstract and conclusion: the claim that 'mainstream learning's finite or dissipative objects do not supply' the eliminated continuous-spectrum sector is load-bearing for the central negative; the three chosen examples are treated as representative, yet the manuscript provides no general argument that every possible partition of an NN object must eliminate only finite or dissipative sectors.
Authors: The three examples are presented as representative of mainstream finite or dissipative NN objects. We acknowledge that the manuscript supplies no general proof that every conceivable partition of every NN object must eliminate only finite or dissipative sectors; such a proof would require a classification of all possible architectures and partitions and lies outside the scope of this note. We will revise the abstract and conclusion to qualify the claim accordingly while preserving the structural diagnosis and the negative result for the tested cases. revision: partial
- A general argument establishing that no possible partition of any neural-network object can eliminate a continuous-spectrum sector with wave-like dynamics.
Circularity Check
Derivation self-contained; no load-bearing reductions to inputs or self-citations
full rationale
Closed-case results follow directly from moments, Gaussian algebra, and block inversion (Schur complement) with no fitted parameters or self-citations invoked as justification. Open-case negative result is anchored in external nuclear-reaction-theory results whose authors do not overlap with the present paper, so the central claim does not reduce to any quantity defined inside the paper's own objects or fits. The three tested partitions are presented as representative instances rather than tautological choices that force the negative outcome by construction. No step matches any enumerated circularity pattern.
Axiom & Free-Parameter Ledger
axioms (2)
- standard math Block inversion and Schur complement yield the marginal covariance and its inverse for a partitioned Gaussian
- domain assumption An eliminated sector with continuous spectrum produces a non-Hermitian effective generator obeying a generalized optical theorem
Reference graph
Works this paper leans on
-
[1]
Jin Lei. Reduced basis emulator for elastic scattering in continuum-discretized coupled- channels calculations.Phys. Rev. C, 113:044610, 2026. doi: 10.1103/n24x-d9gm. URL https://doi.org/10.1103/n24x-d9gm
-
[2]
Bidirectional neural networks for global nucleon-nucleus optical model calculations
Jin Lei. Bidirectional neural networks for global nucleon-nucleus optical model calculations. arXiv preprint, 2025. URLhttps://arxiv.org/abs/2512.22500. under review, Phys. Rev. C
Pith/arXiv arXiv 2025
-
[3]
Unified theory of nuclear reactions.Ann
Herman Feshbach. Unified theory of nuclear reactions.Ann. Phys. (N.Y.), 5:357–390,
-
[4]
URLhttps://doi.org/10.1016/0003-4916(58) 90007-1
doi: 10.1016/0003-4916(58)90007-1. URLhttps://doi.org/10.1016/0003-4916(58) 90007-1
-
[5]
A unified theory of nuclear reactions
Herman Feshbach. A unified theory of nuclear reactions. II.Ann. Phys. (N.Y.), 19:287–313,
-
[6]
URLhttps://doi.org/10.1016/0003-4916(62) 90221-X
doi: 10.1016/0003-4916(62)90221-X. URLhttps://doi.org/10.1016/0003-4916(62) 90221-X
-
[7]
R. J. Furnstahl, A. J. Garcia, P. J. Millican, and Xilin Zhang. Efficient emulators for scattering using eigenvector continuation.Phys. Lett. B, 809:135719, 2020. doi: 10.1016/j.physletb.2020. 135719. URLhttps://doi.org/10.1016/j.physletb.2020.135719
-
[8]
C. Drischler, J. A. Melendez, R. J. Furnstahl, A. J. Garcia, and Xilin Zhang. BUQEYE guide to projection-based emulators in nuclear physics.Front. Phys., 10:1092931, 2023. doi: 10.3389/fphy.2022.1092931. URLhttps://doi.org/10.3389/fphy.2022.1092931
-
[9]
Neal.Bayesian Learning for Neural Networks, volume 118 ofLecture Notes in Statistics
Radford M. Neal.Bayesian Learning for Neural Networks, volume 118 ofLecture Notes in Statistics. Springer, New York, 1996. ISBN 978-0-387-94724-2. doi: 10.1007/ 978-1-4612-0745-0. URLhttps://doi.org/10.1007/978-1-4612-0745-0
-
[10]
Roberts, Sho Yaida, and Boris Hanin.The Principles of Deep Learning Theory
Daniel A. Roberts, Sho Yaida, and Boris Hanin.The Principles of Deep Learning Theory. Cambridge University Press, 2022. URLhttps://arxiv.org/abs/2106.10165
arXiv 2022
-
[11]
N. Austern and C. M. Vincent. Inclusive breakup reactions.Phys. Rev. C, 23:1847, 1981. doi: 10.1103/PhysRevC.23.1847. URLhttps://doi.org/10.1103/PhysRevC.23.1847
-
[12]
M. Ichimura, N. Austern, and C. M. Vincent. Equivalence of post and prior sum rules for inclusive breakup reactions.Phys. Rev. C, 32:431, 1985. doi: 10.1103/PhysRevC.32.431. URL https://doi.org/10.1103/PhysRevC.32.431
-
[13]
Hao Liu, Jin Lei, and Zhongzhou Ren. Exact treatment of continuum couplings in nuclear optical potentials via feshbach theory.arXiv preprint, 2025. URLhttps://arxiv.org/abs/ 2508.07584. 22
arXiv 2025
-
[14]
R. Navarro P´ erez and Jin Lei. Is the unusual near-threshold potential behavior in elastic scattering of weakly-bound nuclei a precision error?Phys. Lett. B, 795:200, 2019. doi: 10. 1016/j.physletb.2019.06.005. URLhttps://doi.org/10.1016/j.physletb.2019.06.005
-
[15]
Neural tangent kernel: Convergence and generalization in neural networks
Arthur Jacot, Franck Gabriel, and Cl´ ement Hongler. Neural tangent kernel: Convergence and generalization in neural networks. InAdvances in Neural Information Processing Systems, volume 31, 2018. URLhttps://arxiv.org/abs/1806.07572
arXiv 2018
-
[16]
A. Ekstr¨ om and G. Hagen. Global sensitivity analysis of bulk properties of an atomic nucleus. Phys. Rev. Lett., 123:252501, 2019. doi: 10.1103/PhysRevLett.123.252501. URLhttps: //doi.org/10.1103/PhysRevLett.123.252501
-
[17]
D. Frame, R. He, I. Ipsen, D. Lee, D. Lee, and E. Rrapaj. Eigenvector continuation with subspace learning.Phys. Rev. Lett., 121:032501, 2018. doi: 10.1103/PhysRevLett.121.032501. URLhttps://doi.org/10.1103/PhysRevLett.121.032501
-
[18]
Jin Lei. Direct boundary matching: A bound-state technique for nuclear scattering with lagrange-legendre functions.Phys. Rev. C, 113:024614, 2026. doi: 10.1103/ddcx-cslb. URL https://doi.org/10.1103/ddcx-cslb
-
[19]
COLOSS: Complex-scaled optical and coulomb scattering solver.Comput
Junzhe Liu, Jin Lei, and Zhongzhou Ren. COLOSS: Complex-scaled optical and coulomb scattering solver.Comput. Phys. Commun., 311:109568, 2025. doi: 10.1016/j.cpc.2025.109568. URLhttps://doi.org/10.1016/j.cpc.2025.109568
-
[20]
G. Lindblad. On the generators of quantum dynamical semigroups.Commun. Math. Phys., 48:119–130, 1976. doi: 10.1007/BF01608499. URLhttps://doi.org/10.1007/BF01608499
-
[21]
A baseline for detecting misclassified and out-of- distribution examples in neural networks
Dan Hendrycks and Kevin Gimpel. A baseline for detecting misclassified and out-of- distribution examples in neural networks. InInternational Conference on Learning Repre- sentations (ICLR), 2017. URLhttps://arxiv.org/abs/1610.02136
Pith/arXiv arXiv 2017
-
[22]
Selective classification for deep neural networks
Yonatan Geifman and Ran El-Yaniv. Selective classification for deep neural networks. In Advances in Neural Information Processing Systems (NeurIPS), 2017. URLhttps://arxiv. org/abs/1705.08500
Pith/arXiv arXiv 2017
-
[23]
Naftali Tishby and Noga Zaslavsky. Deep learning and the information bottleneck principle. In2015 IEEE Information Theory Workshop (ITW), 2015. doi: 10.1109/ITW.2015.7133169. URLhttps://doi.org/10.1109/ITW.2015.7133169
-
[24]
Balaji Lakshminarayanan, A. Pritzel, and C. Blundell. Simple and scalable predictive uncer- tainty estimation using deep ensembles. InAdvances in Neural Information Processing Systems (NeurIPS), 2017. URLhttps://arxiv.org/abs/1612.01474
Pith/arXiv arXiv 2017
-
[25]
Ian Davidson, Michael Livanos, Antoine Gourru, Peter Walker, Julien Velcin, and S
Sebastian Farquhar, Jannik Kossen, Lorenz Kuhn, and Yarin Gal. Detecting halluci- nations in large language models using semantic entropy.Nature, 630:625, 2024. doi: 10.1038/s41586-024-07421-0. URLhttps://doi.org/10.1038/s41586-024-07421-0
-
[26]
T. Udagawa and T. Tamura. Derivation of breakup-fusion cross sections from the optical theorem.Phys. Rev. C, 24:1348, 1981. doi: 10.1103/PhysRevC.24.1348. URLhttps://doi. org/10.1103/PhysRevC.24.1348. 23
-
[27]
S. R. Cotanch. Coupled channels optical theorem and non-elastic cross section sum rule.Nucl. Phys. A, 842:48–58, 2010. doi: 10.1016/j.nuclphysa.2010.04.011. URLhttps://doi.org/10. 1016/j.nuclphysa.2010.04.011
-
[28]
Coherent absorption dynamics: The dual role of off- diagonal couplings in weakly bound nuclei.Phys
Hao Liu, Jin Lei, and Zhongzhou Ren. Coherent absorption dynamics: The dual role of off- diagonal couplings in weakly bound nuclei.Phys. Rev. C, 113:054601, 2026. doi: 10.1103/ bgwc-x5wj. URLhttps://doi.org/10.1103/bgwc-x5wj
-
[29]
Hao Liu, Jin Lei, and Zhongzhou Ren. Channel couplings redirect absorbed flux from periph- eral loss to fusion in weakly bound nuclear reactions.Phys. Lett. B, 877:140479, 2026. doi: 10. 1016/j.physletb.2026.140479. URLhttps://doi.org/10.1016/j.physletb.2026.140479
-
[30]
Dynamical origin of spectroscopic quenching in knockout reactions.arXiv preprint,
Jin Lei. Dynamical origin of spectroscopic quenching in knockout reactions.arXiv preprint,
- [31]
-
[32]
F. Perey and B. Buck. A non-local potential model for the scattering of neutrons by nuclei. Nucl. Phys., 32:353–380, 1962. doi: 10.1016/0029-5582(62)90345-0. URLhttps://doi.org/ 10.1016/0029-5582(62)90345-0
-
[33]
C. Mahaux and R. Sartor. Single-particle motion in nuclei. In J. W. Negele and E. Vogt, editors,Advances in Nuclear Physics, volume 20, pages 1–223. Springer, Boston, MA, 1991. doi: 10.1007/978-1-4613-9910-0 1. URLhttps://doi.org/10.1007/978-1-4613-9910-0_1
-
[34]
Exact construction and uniqueness of the coupled- channel green’s function.arXiv preprint, 2026
Hao Liu, Jin Lei, and Zhongzhou Ren. Exact construction and uniqueness of the coupled- channel green’s function.arXiv preprint, 2026. URLhttps://arxiv.org/abs/2604.00471
Pith/arXiv arXiv 2026
-
[35]
Assessing continuum channel importance in continuum-discretized coupled-channels via dynamic polarization potential decomposition
Jin Lei and Hao Liu. Assessing continuum channel importance in continuum-discretized coupled-channels via dynamic polarization potential decomposition. submitted to Phys. Rev. C (Letter), manuscript CRR1074, 2026
2026
-
[36]
W. D. Heiss. The physics of exceptional points.J. Phys. A: Math. Theor., 45:444016, 2012. doi: 10.1088/1751-8113/45/44/444016. URLhttps://doi.org/10.1088/1751-8113/45/44/ 444016
-
[37]
Carl M. Bender and Stefan Boettcher. Real spectra in non-hermitian hamiltonians having PT symmetry.Phys. Rev. Lett., 80:5243–5246, 1998. doi: 10.1103/PhysRevLett.80.5243. URL https://doi.org/10.1103/PhysRevLett.80.5243
-
[38]
J. A. Melendez, C. Drischler, R. J. Furnstahl, A. J. Garcia, and X. Zhang. Model reduction methods for nuclear emulators.J. Phys. G: Nucl. Part. Phys., 49:102001, 2022. doi: 10.1088/ 1361-6471/ac83dd. URLhttps://doi.org/10.1088/1361-6471/ac83dd
-
[39]
Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, and Illia Polosukhin. Attention is all you need. InAdvances in Neural Infor- mation Processing Systems, volume 30, 2017. URLhttps://arxiv.org/abs/1706.03762. 24 Neural network ensemble Nuclear reaction theory Status Closed, statistical correspondence (b...
Pith/arXiv arXiv 2017
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.