Maximum likelihood thresholds of generic linear concentration models

Daniel Irving Bernstein; Louis Theran; Steven J. Gortler

arxiv: 2305.06280 · v2 · pith:SFKSIIBBnew · submitted 2023-05-10 · 🧮 math.ST · math.AG· stat.TH

Maximum likelihood thresholds of generic linear concentration models

Daniel Irving Bernstein , Steven J. Gortler , Louis Theran This is my paper

Pith reviewed 2026-05-24 08:51 UTC · model grok-4.3

classification 🧮 math.ST math.AGstat.TH

keywords linear concentration modelsmaximum likelihood thresholdgeneric modelssemi-algebraic setsdimension countalgebraic statisticsGaussian graphical models

0 comments

The pith

The maximum likelihood threshold of a generic linear concentration model equals the number predicted by a naive dimension count.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper determines the smallest number of data points needed to fit a generic linear concentration model by maximum likelihood estimation. This number turns out to match the value obtained by simply counting dimensions in the model, even though the threshold is governed by semi-algebraic conditions that could produce irregular behavior. The result applies when the linear space of the model sits in general position relative to the stratification that controls existence and uniqueness of the estimator. The authors also give a geometric description of the cases where a non-generic model deviates from the expected count. A sympathetic reader would care because this supplies an explicit, computable rule for when these models become identifiable from data.

Core claim

For generic linear concentration models the maximum likelihood threshold equals the number one might expect from a naive dimension count. This holds because the defining linear space is generic with respect to the semi-algebraic stratification that governs existence and uniqueness of the maximum-likelihood estimator. The paper also describes geometrically how a linear concentration model can fail to exhibit this generic behavior.

What carries the argument

generic linear concentration model, defined by a linear space of symmetric matrices in general position relative to the semi-algebraic stratification of the parameter space

If this is right

The threshold for generic models is given directly by the dimension formula without further semi-algebraic analysis.
Non-generic models can deviate from the dimension count, with the failures classifiable by geometric conditions on the linear space.
Computation of the threshold reduces to checking a genericity condition rather than solving the full semi-algebraic problem.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same dimension-count rule may apply to other statistical thresholds that are semi-algebraic but behave regularly on a dense open set.
One could test the result by enumerating all linear spaces up to a given dimension and verifying the threshold matches the count whenever genericity holds.
The geometric description of failures suggests a stratification of the space of all linear concentration models by their actual thresholds.

Load-bearing premise

The linear space defining the concentration model must be generic with respect to the semi-algebraic stratification that governs existence and uniqueness of the maximum-likelihood estimator.

What would settle it

An explicit low-dimensional linear concentration model whose linear space is generic yet requires a different number of samples for unique maximum-likelihood estimation than the dimension count predicts.

read the original abstract

The maximum likelihood threshold of a statistical model is the minimum number of datapoints required to fit the model via maximum likelihood estimation. In this paper we determine the maximum likelihood thresholds of generic linear concentration models. This turns out to be the number that one might expect from a naive dimension count, which is nontrivial to prove given that the maximum likelihood threshold is a semi-algebraic concept. We also describe geometrically how a linear concentration model can fail to exhibit this generic behavior.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

Generic linear concentration models have ML thresholds equal to the naive dimension count, with a geometric description of the exceptions.

read the letter

The main point is that for generic linear concentration models the maximum likelihood threshold equals the dimension count from a naive linear algebra argument. The paper proves this equality holds despite the threshold being a semi-algebraic quantity, and it also describes the geometric locus where a model fails to behave generically. That resolves the sample-complexity question for this class of models in a sharp way. The result is new; earlier literature had not established that the semi-algebraic threshold coincides with the algebraic count under genericity. The argument targets the genericity condition directly by working with the relevant stratification, which avoids obvious circularity. The geometric description of failures adds a concrete handle on when the generic bound does not apply. A minor limitation is that the result is stated for generic models, so in a concrete application one still needs to confirm that the linear space sits outside the exceptional locus; the paper characterizes that locus but does not supply a general decision procedure. The work is aimed at researchers in algebraic statistics, graphical models, and covariance estimation. Anyone needing explicit, computable bounds on the number of samples required for maximum likelihood fitting will find it directly useful. The paper shows clear engagement with the literature and the underlying geometry, so it deserves a serious referee even if some details of the proof need checking.

Referee Report

0 major / 1 minor

Summary. The paper determines the maximum likelihood threshold (MLT) of generic linear concentration models, proving that it equals the value expected from a naive dimension count of the model parameters. This is shown to hold despite the semi-algebraic nature of the MLT; the authors also give a geometric description of the locus of linear spaces where the generic behavior fails.

Significance. If the result holds, it supplies a clean, parameter-free algebraic count for the sample size at which generic linear concentration models admit a unique MLE. The argument directly confronts the semi-algebraic stratification that governs existence and uniqueness, and the geometric characterization of non-generic failures is a useful byproduct for applications in Gaussian graphical models and algebraic statistics.

minor comments (1)

The abstract and introduction would benefit from an explicit statement of the dimension count formula (e.g., in terms of the rank or codimension of the linear space) before the main theorem is stated.

Simulated Author's Rebuttal

0 responses · 0 unresolved

We thank the referee for their positive report and recommendation to accept the manuscript.

Circularity Check

0 steps flagged

No significant circularity

full rationale

The paper proves that the ML threshold of a generic linear concentration model equals the naive dimension count by directly analyzing the semi-algebraic stratification governing existence and uniqueness of the MLE. The central claim is established via geometric description of the non-generic locus rather than by fitting parameters, renaming known results, or load-bearing self-citations. The derivation is self-contained against external algebraic geometry benchmarks and does not reduce any prediction to its own inputs by construction.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The proof relies on standard facts from algebraic geometry concerning generic linear spaces and the dimension of semi-algebraic sets; no free parameters or invented entities are introduced in the abstract.

axioms (1)

standard math Generic linear spaces intersect semi-algebraic strata in the expected dimension.
Invoked to equate the semi-algebraic threshold with the algebraic dimension count.

pith-pipeline@v0.9.0 · 5598 in / 1105 out tokens · 24483 ms · 2026-05-24T08:51:00.869143+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

23 extracted references · 23 canonical work pages

[1]

A. Y. Alfakih. Graph connectivity and universal rigidit y of bar frameworks. Discrete Appl. Math., 217(part 3):707–710, 2017. doi: 10.1016/j.dam.2016.10. 008

work page doi:10.1016/j.dam.2016.10 2017
[2]

A. I. Barvinok. Problems of distance geometry and convex properties of quadratic maps. Discrete & Computational Geometry , 13:189–202, 1995

work page 1995
[3]

S. Basu, R. Pollack, and M.-F. Roy. Algorithms in real algebraic geometry . Springer, 2006

work page 2006
[4]

Ben-David

E. Ben-David. Sharp lower and upper bounds for the gaussi an rank of a graph. Journal of Multivariate Analysis , 139:207–218, 2015

work page 2015
[5]

D. I. Bernstein, S. Dewar, S. J. Gortler, A. Nixon, M. Sith aram, and L. Theran. Maximum likelihood thresholds via graph rigidity. arXiv preprint arXiv:2108.02185 , 2021

work page arXiv 2021
[6]

D. I. Bernstein, S. Dewar, S. J. Gortler, A. Nixon, M. Sith aram, and L. Theran. Computing maximum likelihood thresholds using graph rigidity. arXiv preprint arXiv:2210.11081 , 2022

work page arXiv 2022
[7]

Blekherman and R

G. Blekherman and R. Sinn. Maximum likelihood threshold and generic completion rank of graphs. Discrete & Computational Geometry , 61:303–324, 2019

work page 2019
[8]

Bochnak, M

J. Bochnak, M. Coste, and M.-F. Roy. Real algebraic geometry , volume 36. Springer Science & Business Media, 2013. 2In the rigidity theory literature, an equilibrium stress ma trix for a graph G on n vertices refers to an n × n symmetric matrix Ω with vanishing row sums and Ω ij = 0 for every non-edge {i, j} of G. There is an invertible linear map from ou...

work page 2013
[9]

E. D. Bolker and B. Roth. When is a bipartite graph a rigid f rame- work? Paciﬁc J. Math. , 90(1):27–44, 1980. ISSN 0030-8730. URL http://projecteuclid.org/getRecord?id=euclid.pjm/1102779115

work page arXiv 1980
[10]

Connelly, S

R. Connelly, S. J. Gortler, and L. Theran. Aﬃne rigidity and conics at inﬁnity. International Mathematics Research Notices , 2018(13):4084–4102, 2018

work page 2018
[11]

Connelly, S

R. Connelly, S. J. Gortler, and L. Theran. Generically g lobally rigid graphs have generic universally rigid frameworks. Combinatorica, 40(1):1–37, 2020

work page 2020
[12]

A. P. Dempster. Covariance selection. Biometrics, pages 157–175, 1972

work page 1972
[13]

Dobra, C

A. Dobra, C. Hans, B. Jones, J. R. Nevins, G. Yao, and M. We st. Sparse graphical models for exploring gene expression data. Journal of Multivariate Analysis , 90(1):196–212, 2004

work page 2004
[14]

Gross and S

E. Gross and S. Sullivant. The maximum likelihood thres hold of a graph. Bernoulli, 24(1):386 – 407, 2018. doi: 10.3150/16-BEJ881. URL https://doi.org/10.3150/16-BEJ881

work page doi:10.3150/16-bej881 2018
[15]

Guillemin and A

V. Guillemin and A. Pollack. Diﬀerential topology, volume 370. American Mathematical Soc., 2010

work page 2010
[16]

J. Harris. Algebraic geometry: a ﬁrst course , volume 133. Springer Science & Business Media, 2013

work page 2013
[17]

Hastie, R

T. Hastie, R. Tibshirani, and J. Friedman. The elements of statistical learning . Springer Series in Statistics. Springer, New York, second edition, 2009. do i: 10.1007/978-0-387-84858-7. Data mining, inference, and prediction

work page doi:10.1007/978-0-387-84858-7 2009
[18]

Krumsiek, K

J. Krumsiek, K. Suhre, T. Illig, J. Adamski, and F. J. The is. Gaussian graphical modeling re- constructs pathway reactions from high-throughput metabo lomics data. BMC systems biology , 5(1):1–16, 2011

work page 2011
[19]

G. Pataki. Cone-lp’s and semideﬁnite programs: Geomet ry and a simplex-type method. In Integer Programming and Combinatorial Optimization: 5th Inte rnational IPCO Conference Vancouver, British Columbia, Canada, June 3–5, 1996 Proceedi ngs 5 , pages 162–174. Springer, 1996

work page 1996
[20]

G. Pataki. The geometry of semideﬁnite programming. In Handbook of semideﬁnite program- ming, pages 29–65. Springer, 2000

work page 2000
[21]

Sch¨ afer and K

J. Sch¨ afer and K. Strimmer. An empirical Bayes approac h to inferring large-scale gene asso- ciation networks. Bioinformatics, 21(6):754–764, 2005

work page 2005
[22]

C. Uhler. Geometry of maximum likelihood estimation in gaussian graphical models. The Annals of Statistics , 40(1):238–261, 2012

work page 2012
[23]

X. Wu, Y. Ye, and K. R. Subramanian. Interactive analysi s of gene interactions using graph- ical gaussian model. In Proceedings of the 3rd International Conference on Data Minin g in Bioinformatics, pages 63–69, 2003. 16

work page 2003

[1] [1]

A. Y. Alfakih. Graph connectivity and universal rigidit y of bar frameworks. Discrete Appl. Math., 217(part 3):707–710, 2017. doi: 10.1016/j.dam.2016.10. 008

work page doi:10.1016/j.dam.2016.10 2017

[2] [2]

A. I. Barvinok. Problems of distance geometry and convex properties of quadratic maps. Discrete & Computational Geometry , 13:189–202, 1995

work page 1995

[3] [3]

S. Basu, R. Pollack, and M.-F. Roy. Algorithms in real algebraic geometry . Springer, 2006

work page 2006

[4] [4]

Ben-David

E. Ben-David. Sharp lower and upper bounds for the gaussi an rank of a graph. Journal of Multivariate Analysis , 139:207–218, 2015

work page 2015

[5] [5]

D. I. Bernstein, S. Dewar, S. J. Gortler, A. Nixon, M. Sith aram, and L. Theran. Maximum likelihood thresholds via graph rigidity. arXiv preprint arXiv:2108.02185 , 2021

work page arXiv 2021

[6] [6]

D. I. Bernstein, S. Dewar, S. J. Gortler, A. Nixon, M. Sith aram, and L. Theran. Computing maximum likelihood thresholds using graph rigidity. arXiv preprint arXiv:2210.11081 , 2022

work page arXiv 2022

[7] [7]

Blekherman and R

G. Blekherman and R. Sinn. Maximum likelihood threshold and generic completion rank of graphs. Discrete & Computational Geometry , 61:303–324, 2019

work page 2019

[8] [8]

Bochnak, M

J. Bochnak, M. Coste, and M.-F. Roy. Real algebraic geometry , volume 36. Springer Science & Business Media, 2013. 2In the rigidity theory literature, an equilibrium stress ma trix for a graph G on n vertices refers to an n × n symmetric matrix Ω with vanishing row sums and Ω ij = 0 for every non-edge {i, j} of G. There is an invertible linear map from ou...

work page 2013

[9] [9]

E. D. Bolker and B. Roth. When is a bipartite graph a rigid f rame- work? Paciﬁc J. Math. , 90(1):27–44, 1980. ISSN 0030-8730. URL http://projecteuclid.org/getRecord?id=euclid.pjm/1102779115

work page arXiv 1980

[10] [10]

Connelly, S

R. Connelly, S. J. Gortler, and L. Theran. Aﬃne rigidity and conics at inﬁnity. International Mathematics Research Notices , 2018(13):4084–4102, 2018

work page 2018

[11] [11]

Connelly, S

R. Connelly, S. J. Gortler, and L. Theran. Generically g lobally rigid graphs have generic universally rigid frameworks. Combinatorica, 40(1):1–37, 2020

work page 2020

[12] [12]

A. P. Dempster. Covariance selection. Biometrics, pages 157–175, 1972

work page 1972

[13] [13]

Dobra, C

A. Dobra, C. Hans, B. Jones, J. R. Nevins, G. Yao, and M. We st. Sparse graphical models for exploring gene expression data. Journal of Multivariate Analysis , 90(1):196–212, 2004

work page 2004

[14] [14]

Gross and S

E. Gross and S. Sullivant. The maximum likelihood thres hold of a graph. Bernoulli, 24(1):386 – 407, 2018. doi: 10.3150/16-BEJ881. URL https://doi.org/10.3150/16-BEJ881

work page doi:10.3150/16-bej881 2018

[15] [15]

Guillemin and A

V. Guillemin and A. Pollack. Diﬀerential topology, volume 370. American Mathematical Soc., 2010

work page 2010

[16] [16]

J. Harris. Algebraic geometry: a ﬁrst course , volume 133. Springer Science & Business Media, 2013

work page 2013

[17] [17]

Hastie, R

T. Hastie, R. Tibshirani, and J. Friedman. The elements of statistical learning . Springer Series in Statistics. Springer, New York, second edition, 2009. do i: 10.1007/978-0-387-84858-7. Data mining, inference, and prediction

work page doi:10.1007/978-0-387-84858-7 2009

[18] [18]

Krumsiek, K

J. Krumsiek, K. Suhre, T. Illig, J. Adamski, and F. J. The is. Gaussian graphical modeling re- constructs pathway reactions from high-throughput metabo lomics data. BMC systems biology , 5(1):1–16, 2011

work page 2011

[19] [19]

G. Pataki. Cone-lp’s and semideﬁnite programs: Geomet ry and a simplex-type method. In Integer Programming and Combinatorial Optimization: 5th Inte rnational IPCO Conference Vancouver, British Columbia, Canada, June 3–5, 1996 Proceedi ngs 5 , pages 162–174. Springer, 1996

work page 1996

[20] [20]

G. Pataki. The geometry of semideﬁnite programming. In Handbook of semideﬁnite program- ming, pages 29–65. Springer, 2000

work page 2000

[21] [21]

Sch¨ afer and K

J. Sch¨ afer and K. Strimmer. An empirical Bayes approac h to inferring large-scale gene asso- ciation networks. Bioinformatics, 21(6):754–764, 2005

work page 2005

[22] [22]

C. Uhler. Geometry of maximum likelihood estimation in gaussian graphical models. The Annals of Statistics , 40(1):238–261, 2012

work page 2012

[23] [23]

X. Wu, Y. Ye, and K. R. Subramanian. Interactive analysi s of gene interactions using graph- ical gaussian model. In Proceedings of the 3rd International Conference on Data Minin g in Bioinformatics, pages 63–69, 2003. 16

work page 2003