Algebraic Networks and Architectural Degenerations
Pith reviewed 2026-06-26 22:10 UTC · model grok-4.3
The pith
For fully connected networks with non-increasing widths and scalar output, non-degenerate parameters yield smooth points on the associated neurovariety.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
For fully connected networks with non-increasing widths and scalar output, full parameters give smooth points of the corresponding neurovariety under explicit layerwise regularity assumptions. In particular, for these architectures, the singular locus is contained in the architectural degeneracy locus.
What carries the argument
The architectural degeneracy locus, the set of functions that admit a representation with a rank-deficient layer or an inactive hidden neuron; it is shown to contain all singular points of the neurovariety for the specified architectures.
If this is right
- Singularities of the neurovariety can be detected by checking whether a function admits a degenerate parameter representation.
- The symmetry groups and quotient parameter spaces defined in the framework classify distinct realizations of the same function.
- Geometric identifiability and reducibility become questions about the fibers of the realization map and the components of the neurovariety.
- The containment of the singular locus inside the degeneracy locus holds only for the listed width and output conditions together with the regularity assumptions.
Where Pith is reading between the lines
- The same containment might be tested numerically by sampling parameters and checking the rank of the Jacobian of the realization map at those points.
- Extending the framework beyond monomial activations would require replacing the polynomial realization map with a different algebraic map while preserving the degeneracy notion.
- If the containment holds more generally, then training algorithms that avoid degenerate parameters would automatically avoid singular points in parameter space.
Load-bearing premise
The networks must have non-increasing widths, scalar output, and satisfy the explicit layerwise regularity assumptions.
What would settle it
Exhibit a parameter point with full-rank layers and no inactive neurons whose image lies on a singular point of the neurovariety for one of the networks covered by the theorem.
read the original abstract
We study the geometry of polynomial neural networks with monomial activation functions and no bias. We introduce a general framework of algebraic networks, together with their realization maps and associated affine neurovarieties. In this setting we define morphisms, subnetworks, symmetry groups and quotient parameter spaces and we discuss geometric notions of identifiability and reducibility. Our main goal is to relate the singularities of neurovarieties to degenerations of the underlying architecture. For fully connected networks, we define the architectural degeneracy locus as the locus of functions admitting a representation by parameters with a rank-deficient layer or an inactive hidden neuron. We prove that, for fully connected networks with non-increasing widths and scalar output, full parameters give smooth points of the corresponding neurovariety under explicit layerwise regularity assumptions. In particular, for these architectures, the singular locus is contained in the architectural degeneracy locus.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper introduces a framework of algebraic networks with monomial activations (no bias), defines realization maps, affine neurovarieties, morphisms, subnetworks, symmetry groups, and quotient spaces, and discusses identifiability and reducibility. Its central result is that, for fully connected networks with non-increasing widths and scalar output, parameters with full-rank layers and active neurons are smooth points of the neurovariety under explicit layerwise regularity assumptions; consequently the singular locus is contained in the architectural degeneracy locus (parameters with a rank-deficient layer or inactive hidden neuron).
Significance. If the containment holds, the work supplies a precise geometric link between singularities of the neurovariety and architectural degenerations, extending algebraic-geometry techniques to polynomial networks and offering tools for studying identifiability. The introduction of the general algebraic-network formalism and the explicit treatment of symmetry groups and quotients are concrete contributions that could be reused beyond the fully-connected scalar-output case.
major comments (1)
- [Abstract / main theorem] Abstract and main theorem (presumably §4 or §5): the smoothness statement is conditioned on separate layerwise regularity assumptions, yet the subsequent claim that 'the singular locus is contained in the architectural degeneracy locus' is stated unconditionally. If regularity can fail for some full-rank, active-neuron parameters, then singularities at those points would lie outside the degeneracy locus, falsifying the containment. The manuscript must either prove that regularity is automatic for full parameters or revise the containment statement to account for the regularity locus.
minor comments (1)
- [Introduction / §2] Notation for the realization map and the neurovariety could be introduced earlier and used consistently; several definitions (e.g., architectural degeneracy locus) appear only after the main claim is stated.
Simulated Author's Rebuttal
We thank the referee for the careful reading of the manuscript and for identifying this important point of clarification in the presentation of the main result. We address the comment below.
read point-by-point responses
-
Referee: [Abstract / main theorem] Abstract and main theorem (presumably §4 or §5): the smoothness statement is conditioned on separate layerwise regularity assumptions, yet the subsequent claim that 'the singular locus is contained in the architectural degeneracy locus' is stated unconditionally. If regularity can fail for some full-rank, active-neuron parameters, then singularities at those points would lie outside the degeneracy locus, falsifying the containment. The manuscript must either prove that regularity is automatic for full parameters or revise the containment statement to account for the regularity locus.
Authors: We agree that the current wording of the abstract and the main theorem statement risks being read as claiming the containment unconditionally. The smoothness result is proven only under the stated layerwise regularity assumptions in addition to the full-rank and active-neuron hypotheses. The 'in particular' clause in the abstract is intended to indicate that the containment follows from this conditional smoothness, but the logical dependence is not made fully explicit. We will revise both the abstract and the theorem statement to condition the containment claim explicitly on the regularity assumptions as well. This change will be incorporated in the revised manuscript. revision: yes
Circularity Check
No circularity: definitions and theorem are independent of inputs
full rationale
The paper defines algebraic networks, realization maps, neurovarieties, architectural degeneracy locus, and related notions from first principles, then states and proves a containment result (singular locus inside degeneracy locus) for fully connected networks under explicit layerwise regularity assumptions plus non-increasing widths and scalar output. No quoted step reduces a claimed prediction or uniqueness result to a fitted parameter, self-citation chain, or definitional tautology; the central claim is a standard containment theorem whose hypotheses are stated separately from the conclusion.
Axiom & Free-Parameter Ledger
Reference graph
Works this paper leans on
-
[1]
Aluffi , title =
P. Aluffi , title =. 2013 , pages =
2013
-
[2]
Ballico , title =
E. Ballico , title =. Linear and Multilinear Algebra , year =
-
[3]
M. M. Bronstein and J. Bruna and T. Cohen and P. Veli. arXiv preprint arXiv:2104.13478 , year =
-
[4]
H. S. arXiv preprint arXiv:2508.02723 , year =
-
[5]
V. Borovik and H. Friedman and S. Hosten and M. Pfeffer , title =. arXiv preprint arXiv:2512.06939 , year =
-
[6]
Breiding and S
P. Breiding and S. Timme , title =. 2018 , pages =
2018
-
[7]
K. Usevich and R. Borsoi and C. D. 2025 , note =. 2506.17093 , archivePrefix =
arXiv 2025
-
[8]
Chen and I
T. Chen and I. Goodfellow and J. Shlens , title =. 2016 , eprint =
2016
-
[9]
Draisma and E
J. Draisma and E. Horobet and G. Ottaviani and B. Sturmfels and R. R. Thomas , title =. Foundations of Computational Mathematics , year =
-
[10]
B. Finkel and J. I. Rodriguez and C. Wu and T. Yahl , title =. Algebraic Statistics , year =. doi:10.2140/astat.2025.16.113 , eprint =
-
[11]
Fulton , title =
W. Fulton , title =. 1998 , address =
1998
-
[12]
Gavranovi
B. Gavranovi. 2024 , url =
2024
-
[13]
T. L. G. Proceedings of the Indian Academy of Sciences. Mathematical Sciences , year =. doi:10.1007/BF02829538 , eprint =
-
[14]
Graziani , title =
G. Graziani , title =. 2026 , eprint =
2026
-
[15]
N. W. Henry and G. L. Marchetti and K. Kohn , title =. International Conference on Learning Representations , year =
-
[16]
B. Hollering and E. Mazzucchelli and M. Parisi and B. Sturmfels , title =. arXiv preprint arXiv:2511.21333 , year =
-
[17]
K. Kubjas and J. Li and M. Wiesmann , title =. Algebraic Statistics , year =. doi:10.2140/astat.2024.15.295 , eprint =
-
[18]
K. Kohn and T. Merkh and G. Mont. SIAM Journal on Applied Algebra and Geometry , year =. doi:10.1137/21M1441183 , url =
-
[19]
K. Kozhasov and A. Muniz and Y. Qi and L. Sodomaco , title =. Mathematics of Computation , year =. 2309.15105 , archivePrefix =
-
[20]
Kileel and M
J. Kileel and M. Trager and J. Bruna , title =. 2019 , eprint =
2019
-
[21]
J. M. Landsberg , title =. 2012 , address =
2012
-
[22]
A. Massarenti and M. Mella , title =. arXiv preprint arXiv:2511.19703 , year =
-
[23]
G. L. Marchetti and V. Shahverdi and S. Mereta and M. Trager and K. Kohn , title =. arXiv preprint arXiv:2501.18915 , year =
-
[24]
Mumford and J
D. Mumford and J. Fogarty and F. Kirwan , title =
-
[25]
Piene , title =
R. Piene , title =. Annales Scientifiques de l'
-
[26]
Pragacz , title =
P. Pragacz , title =. Annales Scientifiques de l'
-
[27]
I. R. Shafarevich , title =. 1994 , address =
1994
- [28]
-
[29]
Sonoda and N
S. Sonoda and N. Murata , title =. Neural Networks , year =
- [30]
-
[31]
A. J. Sommese and C. W. Wampler , title =. 2005 , address =
2005
-
[32]
Watanabe , title =
S. Watanabe , title =. 2009 , series =
2009
-
[33]
Weyman , title =
J. Weyman , title =. 2003 , address =
2003
-
[34]
Shahverdi and G
V. Shahverdi and G. L. Marchetti and K. Kohn , title =. 2026 , eprint =
2026
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.