Concentration of the matrix-valued minimum mean-square error in optimal Bayesian inference

Jean Barbier

arxiv: 1907.07103 · v1 · pith:FEBKQMZYnew · submitted 2019-07-15 · 💻 cs.IT · cs.LG· eess.SP· math.IT· math.PR

Concentration of the matrix-valued minimum mean-square error in optimal Bayesian inference

Jean Barbier This is my paper

Pith reviewed 2026-05-24 21:06 UTC · model grok-4.3

classification 💻 cs.IT cs.LGeess.SPmath.ITmath.PR

keywords Bayesian inferenceminimum mean-square errorconcentration of measurespin glassesmutual informationspiked modelsgeneralized linear models

0 comments

The pith

In optimal Bayesian inference of vector-valued signals, the matrix-valued minimum mean-square error concentrates to a deterministic limit as dimensions grow large.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper shows that concentration techniques from spin-glass physics can be extended to prove that the matrix-valued MMSE becomes non-random in the large-system limit for Bayesian inference problems. This holds when the generative model and all parameters are known exactly. The result matters because many proofs of single-letter mutual information formulas rely on such concentration to replace random quantities by their expectations. It applies directly to models such as spiked matrices, tensors, committee machines, and multi-layer generalized linear models.

Core claim

Extending concentration techniques from the mathematical physics of spin glasses, we show that the matrix-valued minimum mean-square error concentrates when the size of the problem increases. Such results are often crucial for proving single-letter formulas for the mutual information when they exist. Our proof is valid in the optimal Bayesian inference setting, meaning that it relies on the assumption that the model and all its hyper-parameters are known.

What carries the argument

Spin-glass concentration techniques applied to the matrix-valued MMSE in the optimal Bayesian setting.

If this is right

Single-letter formulas for mutual information become provable once matrix MMSE concentration is established.
The result covers spiked matrix and tensor models in the large-size regime.
It covers the committee machine neural network in the teacher-student scenario with few hidden neurons.
It covers multi-layer generalized linear models under optimal Bayesian inference.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same concentration may hold in mismatched Bayesian settings if the model mismatch can be controlled uniformly.
The technique could be tested on other inference problems where vector or matrix observations appear, such as certain random-matrix estimation tasks.
If concentration holds, it opens the door to replacing matrix MMSE by its expectation inside more complex information-theoretic expressions.

Load-bearing premise

The inference model and all its hyper-parameters are known exactly to the observer.

What would settle it

A concrete spiked-matrix or tensor model of growing size in which the matrix-valued MMSE remains random with positive variance in the limit.

read the original abstract

We consider Bayesian inference of signals with vector-valued entries. Extending concentration techniques from the mathematical physics of spin glasses, we show that the matrix-valued minimum mean-square error concentrates when the size of the problem increases. Such results are often crucial for proving single-letter formulas for the mutual information when they exist. Our proof is valid in the optimal Bayesian inference setting, meaning that it relies on the assumption that the model and all its hyper-parameters are known. Examples of inference and learning problems covered by our results are spiked matrix and tensor models, the committee machine neural network with few hidden neurons in the teacher-student scenario, or multi-layers generalized linear models.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

Barbier extends scalar MMSE concentration to the matrix case for vector signals under optimal Bayesian inference, giving a usable lemma for mutual information calculations in several spiked and GLM models.

read the letter

The central result is that the matrix-valued MMSE concentrates in the large-system limit when the estimator knows the exact generative model and all hyperparameters. This is presented as a direct extension of earlier scalar and spin-glass concentration arguments, and the abstract lists the usual target applications: spiked matrices and tensors, committee machines, and multi-layer GLMs. The paper is explicit that the claim holds only in the matched Bayesian setting, which removes one common source of overstatement. The technical step is therefore narrow but potentially useful for anyone trying to turn replica-symmetric formulas into rigorous single-letter expressions. The spin-glass machinery is invoked rather than re-derived, so the novelty sits in carrying the matrix structure through the existing concentration arguments without introducing new free parameters or post-hoc fitting. The main limitation is the optimal-Bayesian restriction itself; once hyperparameters must be learned or the model is misspecified, the concentration statement no longer applies and the paper does not claim otherwise. Because the full derivation is not reproduced in the abstract, any referee will still need to check the error-control steps when the signal entries are vectors rather than scalars. The citation pattern looks standard for this subfield and does not appear circular. Readers working on exact asymptotics for high-dimensional inference will find the lemma worth having on hand; the rest of the community can treat it as a supporting technical note. The work is coherent on its own terms and deserves a serious referee rather than a desk rejection.

Referee Report

0 major / 2 minor

Summary. The manuscript extends concentration-of-measure techniques from the mathematical physics of spin glasses to prove that the matrix-valued minimum mean-square error (MMSE) concentrates around its expectation in the large-system limit for a class of optimal Bayesian inference problems. The result is explicitly restricted to the setting in which the generative model and all hyperparameters are known to the estimator. The authors indicate that the concentration is useful for establishing single-letter mutual-information formulas and illustrate the scope with spiked matrix/tensor models, the committee machine, and multi-layer generalized linear models.

Significance. If the claimed concentration holds, the result supplies a technically useful lemma for asymptotic analysis of high-dimensional Bayesian inference. The explicit restriction to the optimal-Bayesian (known-model) regime is stated clearly and avoids over-claiming. The work therefore strengthens the toolbox available for proving exact asymptotic characterizations in information-theoretic learning problems.

minor comments (2)

Notation for the matrix-valued MMSE (e.g., the precise definition of the error matrix and its Frobenius or operator norm) should be introduced once in a dedicated preliminary section rather than inline in the main argument.
The statement of the main theorem would benefit from an explicit list of the technical conditions (growth rates, bounded moments, etc.) inherited from the spin-glass literature that are being invoked.

Simulated Author's Rebuttal

0 responses · 0 unresolved

We thank the referee for their careful reading of the manuscript, their accurate summary of our contributions, and their recommendation to accept. No major comments were raised.

Circularity Check

0 steps flagged

No significant circularity identified

full rationale

The paper presents a mathematical derivation that extends established concentration techniques from spin-glass theory to prove concentration of the matrix-valued MMSE in the optimal Bayesian setting. The abstract explicitly frames the result as relying on external methods and the known-model assumption, with no equations or steps that reduce the claimed concentration to a self-defined quantity, a fitted parameter renamed as prediction, or a load-bearing self-citation chain. The derivation is therefore self-contained against external benchmarks and receives the default non-circularity finding.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The central claim rests on the domain assumption of optimal Bayesian inference (model and hyperparameters known) and on importing concentration techniques from spin-glass theory without new entities or fitted parameters visible in the abstract.

axioms (1)

domain assumption The model and all its hyper-parameters are known (optimal Bayesian inference setting)
Explicitly stated in the abstract as the regime in which the proof holds.

pith-pipeline@v0.9.0 · 5634 in / 1135 out tokens · 39360 ms · 2026-05-24T21:06:43.119348+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

49 extracted references · 49 canonical work pages · 5 internal anchors

[1]

Ghirlanda and F

S. Ghirlanda and F. Guerra. General properties of overla p probability distributions in disordered spin systems. towards parisi u ltrametricity. Journal of Physics A: Mathematical and General , 31(46):9149, 1998

work page 1998
[2]

Guerra and F

F. Guerra and F. L. Toninelli. The thermodynamic limit in mean ﬁeld spin glass models. Communications in Mathematical Physics , 230(1):71–79, 2002

work page 2002
[3]

Talagrand

M. Talagrand. Spin glasses: a challenge for mathematicians: cavity and mean ﬁeld models , volume 46. Springer, 2003

work page 2003
[4]

Panchenko

D. Panchenko. The Sherrington-Kirkpatrick model . Springer Science & Business Media, 2013

work page 2013
[5]

Johnstone

I. Johnstone. On the distribution of the largest eigenva lue in principal components analysis. The Annals of statistics , 29(2):295–327, 2001

work page 2001
[6]

S. B. Korada and N. Macris. Exact solution of the gauge sym metric p- spin glass model on a complete graph. Journal of Statistical Physics , 136(2):205–230, 2009

work page 2009
[7]

Deshpande, E

Y . Deshpande, E. Abbe, and A. Montanari. Asymptotic mutu al infor- mation for the balanced binary stochastic block model. Information and Inference: A Journal of the IMA , 6(2):125–170, 2016

work page 2016
[8]

Krzakala, J

F. Krzakala, J. Xu, and L. Zdeborov´ a. Mutual informatio n in rank- one matrix estimation. In 2016 IEEE Information Theory W orkshop (ITW), pages 71–75, Sept 2016

work page 2016
[9]

Barbier, M

J. Barbier, M. Dia, N. Macris, F. Krzakala, T. Lesieur, an d L. Zde- borov´ a. Mutual information for symmetric rank-one matrixestimation: A proof of the replica formula. In Advances in Neural Information Processing Systems 29 , page 424432. 2016

work page 2016
[10]

Rank-one matrix estimation: analysis of algorithmic and information theoretic limits by the spatial coupling method

J. Barbier, M. Dia, N. Macris, F. Krzakala, and L. Zdebor ov´ a. Rank-one matrix estimation: analysis of algorithmic and in formation theoretic limits by the spatial coupling method. arXiv:1812.02537, 2018

work page internal anchor Pith review Pith/arXiv arXiv 2018
[11]

Lelarge and L

M. Lelarge and L. Miolane. Fundamental limits of symmet ric low- rank matrix estimation. Probability Theory and Related Fields , 173(3- 4):859–929, 2019

work page 2019
[12]

Lesieur, L

T. Lesieur, L. Miolane, M. Lelarge, F. Krzakala, and L. Z deborov´ a. Statistical and computational phase transitions in spiked tensor estima- tion. In 2017 IEEE International Symposium on Information Theory (ISIT), pages 511–515. IEEE, 2017

work page 2017
[13]

Barbier, N

J. Barbier, N. Macris, and L. Miolane. The Layered Struc ture of Tensor Estimation and its Mutual Information. In 55th Annual Allerton Conference on Communication, Control, and Computing , 2017

work page 2017
[14]

Barbier and N

J. Barbier and N. Macris. The adaptive interpolation me thod: a simple scheme to prove replica formulas in bayesian inference. Probability Theory and Related Fields , Oct 2018

work page 2018
[15]

El Alaoui and F

A. El Alaoui and F. Krzakala. Estimation in the Spiked Wi gner Model: A Short Proof of the Replica Formula. In IEEE International Symposium on Information Theory (ISIT) , 2017

work page 2017
[16]

J.-C. Mourrat. Hamilton-jacobi equations for mean-ﬁe ld disordered systems. preprint arXiv:1811.01432, 2018

work page internal anchor Pith review Pith/arXiv arXiv 2018
[17]

Barbier, C

J. Barbier, C. Luneau, and N. Macris. Mutual informatio n for low-rank even-order symmetric tensor factorization. In 2019 IEEE Information Theory W orkshop

work page 2019
[18]

J.-C. Mourrat. Hamilton-jacobi equations for ﬁnite-r ank matrix inference. preprint arXiv:1904.05294, 2019

work page internal anchor Pith review Pith/arXiv arXiv 1904
[19]

S. B. Korada and N. Macris. Tight bounds on the capacity o f binary input random CDMA systems. IEEE Trans. on Information Theory , 56(11):5590–5613, Nov 2010

work page 2010
[20]

Barbier, M

J. Barbier, M. Dia, N. Macris, and F. Krzakala. The Mutua l Infor- mation in Random Linear Estimation. In in the 54th Annual Allerton Conference on Communication, Control, and Computing , 2016

work page 2016
[21]

Barbier, N

J. Barbier, N. Macris, M. Dia, and F. Krzakala. Mutual in formation and optimality of approximate message-passing in random li near estimation. preprint arXiv:1701.05823, 2017

work page arXiv 2017
[22]

Reeves and H

G. Reeves and H. D. Pﬁster. The replica-symmetric predi ction for compressed sensing with gaussian matrices is exact. In IEEE International Symposium on Information Theory (ISIT) , pages 665– 669, 2016

work page 2016
[23]

Barbier, F

J. Barbier, F. Krzakala, N. Macris, L. Miolane, and L. Zd eborov´ a. Optimal errors and phase transitions in high-dimensional g eneralized linear models. Proceedings of the National Academy of Sciences , 116(12):5451–5460, 2019

work page 2019
[24]

Barbier, N

J. Barbier, N. Macris, A. Maillard, and F. Krzakala. The Mutual Information in Random Linear Estimation Beyond i.i.d. Matr ices. In IEEE International Symposium on Information Theory (ISIT) , 2018

work page 2018
[25]

Aubin, A

B. Aubin, A. Maillard, J. Barbier, F. Krzakala, N. Macri s, and L. Zdeborov´ a. The committee machine: Computational to sta tistical gaps in learning a two-layers neural network. In Advances in Neural Information Processing Systems 31 , pages 3227–3238, 2018

work page 2018
[26]

Gabri´ e, A

M. Gabri´ e, A. Manoel, C. Luneau, J. Barbier, N. Macris, F. Krzakala, and L. Zdeborov´ a. Entropy and mutual information in models of deep neural networks. In Advances in Neural Information Processing Systems 31 , pages 1824–1834. 2018

work page 2018
[27]

Coja-Oghlan, F

A. Coja-Oghlan, F. Krzakala, W. Perkins, and L. Zdeboro v´ a. Information-theoretic thresholds from the cavity method. In Pro- ceedings of the 49th Annual ACM SIGACT Symposium on Theory of Computing (STOC) , pages 146–157, 2017

work page 2017
[28]

E. Abbe. Community detection and stochastic block mode ls: Recent developments. Journal of Machine Learning Research , 2018

work page 2018
[29]

Barbier, C

J. Barbier, C. L. Chan, and N. Macris. Adaptive path inte rpolation for sparse systems: Application to a simple censored block m odel. In IEEE International Symposium on Information Theory (ISIT) , pages 1879–1883, 2018

work page 2018
[30]

Barbier and N

J. Barbier and N. Macris. The adaptive interpolation me thod for proving replica formulas. applications to the curie–weiss and wigner spike models. Journal of Physics A: Mathematical and Theoretical , 52(29):294002, jun 2019

work page 2019
[31]

N. Macris. Grifﬁth–kelly–sherman correlation inequa lities: A useful tool in the theory of error correcting codes. IEEE Transactions on Information Theory , 53(2):664–683, 2007

work page 2007
[32]

Kudekar and N

S. Kudekar and N. Macris. Sharp bounds for optimal decod ing of low-density parity-check codes. IEEE Transactions on Information Theory, 55(10):4635–4650, Oct 2009

work page 2009
[33]

Montanari

A. Montanari. Estimating random variables from random sparse observations. Europ. Trans. on Telecomm. , 19(4):385–403, 2008

work page 2008
[34]

Aizenman and P

M. Aizenman and P . Contucci. On the stability of the quen ched state in mean-ﬁeld spin-glass models. Journal of statistical physics , 92(5- 6):765–783, 1998

work page 1998
[35]

Contucci and C

P . Contucci and C. Giardina. Spin-glass stochastic sta bility: a rigorous proof. In Annales Henri Poincare , volume 6. Springer, 2005

work page 2005
[36]

Panchenko

D. Panchenko. The ghirlanda–guerra identities for mix ed p-spin model. Comptes Rendus Mathematique , 348(3-4):189–192, 2010

work page 2010
[37]

Schwarze and J

H. Schwarze and J. Hertz. Generalization in a large comm ittee machine. EPL (Europhysics Letters) , 20(4):375, 1992

work page 1992
[38]

Monasson and R

R. Monasson and R. Zecchina. Weight space structure and internal representations: a direct approach to learning and general ization in multilayer neural networks. Physical review letters , 75(12):2432, 1995

work page 1995
[39]

Engel and C

A. Engel and C. P . V an den Broeck. Statistical Mechanics of Learning . Cambridge University Press, 2001

work page 2001
[40]

Panchenko

D. Panchenko. Free energy in the potts spin glass. The Annals of Probability, 46(2):829–864, 2018

work page 2018
[41]

Panchenko

D. Panchenko. Free energy in the mixed p-spin models with vector spins. The Annals of Probability , 46(2):865–896, 2018

work page 2018
[42]

Agliari, D

E. Agliari, D. Migliozzi, and D. Tantari. Non-convex mu lti-species hopﬁeld models. Journal of Stat. Phys. , 172(5):1247–1269, 2018

work page 2018
[43]

Manoel, F

A. Manoel, F. Krzakala, M. Mzard, and L. Zdeborov´ a. Mul ti-layer generalized linear estimation. In IEEE International Symposium on Information Theory (ISIT) , 2017

work page 2017
[44]

G. Reeves. Additivity of Information in Multilayer Net works via Ad- ditive Gaussian Noise Transforms. In 55th Annual Allerton Conference on Communication, Control, and Computing , 2017

work page 2017
[45]

A. K. Fletcher and S. Rangan. Inference in Deep Networks in High Dimensions. arXiv:1706.06549, 2017

work page internal anchor Pith review Pith/arXiv arXiv 2017
[46]

Asymptotics of MAP Inference in Deep Networks

P . Pandit, M. Sahraee, S. Rangan, and A. K. Fletcher. Asy mptotics of map inference in deep networks. preprint arXiv:1903.01293, 2019

work page internal anchor Pith review Pith/arXiv arXiv 1903
[47]

Aubin, B

B. Aubin, B. Loureiro, A. Maillard, F. Krzakala, and L. Z de- borov´ a. The spiked matrix model with generative priors. preprint arXiv:1905.12385, 2019

work page arXiv 1905
[48]

R. A. Horn and C. R. Johnson. Matrix analysis. Cambridge university press, 1990

work page 1990
[49]

J. Barbier. Overlap matrix concentration in optimal ba yesian inference. preprint arXiv:1904.02808, 2019

work page arXiv 1904

[1] [1]

Ghirlanda and F

S. Ghirlanda and F. Guerra. General properties of overla p probability distributions in disordered spin systems. towards parisi u ltrametricity. Journal of Physics A: Mathematical and General , 31(46):9149, 1998

work page 1998

[2] [2]

Guerra and F

F. Guerra and F. L. Toninelli. The thermodynamic limit in mean ﬁeld spin glass models. Communications in Mathematical Physics , 230(1):71–79, 2002

work page 2002

[3] [3]

Talagrand

M. Talagrand. Spin glasses: a challenge for mathematicians: cavity and mean ﬁeld models , volume 46. Springer, 2003

work page 2003

[4] [4]

Panchenko

D. Panchenko. The Sherrington-Kirkpatrick model . Springer Science & Business Media, 2013

work page 2013

[5] [5]

Johnstone

I. Johnstone. On the distribution of the largest eigenva lue in principal components analysis. The Annals of statistics , 29(2):295–327, 2001

work page 2001

[6] [6]

S. B. Korada and N. Macris. Exact solution of the gauge sym metric p- spin glass model on a complete graph. Journal of Statistical Physics , 136(2):205–230, 2009

work page 2009

[7] [7]

Deshpande, E

Y . Deshpande, E. Abbe, and A. Montanari. Asymptotic mutu al infor- mation for the balanced binary stochastic block model. Information and Inference: A Journal of the IMA , 6(2):125–170, 2016

work page 2016

[8] [8]

Krzakala, J

F. Krzakala, J. Xu, and L. Zdeborov´ a. Mutual informatio n in rank- one matrix estimation. In 2016 IEEE Information Theory W orkshop (ITW), pages 71–75, Sept 2016

work page 2016

[9] [9]

Barbier, M

J. Barbier, M. Dia, N. Macris, F. Krzakala, T. Lesieur, an d L. Zde- borov´ a. Mutual information for symmetric rank-one matrixestimation: A proof of the replica formula. In Advances in Neural Information Processing Systems 29 , page 424432. 2016

work page 2016

[10] [10]

Rank-one matrix estimation: analysis of algorithmic and information theoretic limits by the spatial coupling method

J. Barbier, M. Dia, N. Macris, F. Krzakala, and L. Zdebor ov´ a. Rank-one matrix estimation: analysis of algorithmic and in formation theoretic limits by the spatial coupling method. arXiv:1812.02537, 2018

work page internal anchor Pith review Pith/arXiv arXiv 2018

[11] [11]

Lelarge and L

M. Lelarge and L. Miolane. Fundamental limits of symmet ric low- rank matrix estimation. Probability Theory and Related Fields , 173(3- 4):859–929, 2019

work page 2019

[12] [12]

Lesieur, L

T. Lesieur, L. Miolane, M. Lelarge, F. Krzakala, and L. Z deborov´ a. Statistical and computational phase transitions in spiked tensor estima- tion. In 2017 IEEE International Symposium on Information Theory (ISIT), pages 511–515. IEEE, 2017

work page 2017

[13] [13]

Barbier, N

J. Barbier, N. Macris, and L. Miolane. The Layered Struc ture of Tensor Estimation and its Mutual Information. In 55th Annual Allerton Conference on Communication, Control, and Computing , 2017

work page 2017

[14] [14]

Barbier and N

J. Barbier and N. Macris. The adaptive interpolation me thod: a simple scheme to prove replica formulas in bayesian inference. Probability Theory and Related Fields , Oct 2018

work page 2018

[15] [15]

El Alaoui and F

A. El Alaoui and F. Krzakala. Estimation in the Spiked Wi gner Model: A Short Proof of the Replica Formula. In IEEE International Symposium on Information Theory (ISIT) , 2017

work page 2017

[16] [16]

J.-C. Mourrat. Hamilton-jacobi equations for mean-ﬁe ld disordered systems. preprint arXiv:1811.01432, 2018

work page internal anchor Pith review Pith/arXiv arXiv 2018

[17] [17]

Barbier, C

J. Barbier, C. Luneau, and N. Macris. Mutual informatio n for low-rank even-order symmetric tensor factorization. In 2019 IEEE Information Theory W orkshop

work page 2019

[18] [18]

J.-C. Mourrat. Hamilton-jacobi equations for ﬁnite-r ank matrix inference. preprint arXiv:1904.05294, 2019

work page internal anchor Pith review Pith/arXiv arXiv 1904

[19] [19]

S. B. Korada and N. Macris. Tight bounds on the capacity o f binary input random CDMA systems. IEEE Trans. on Information Theory , 56(11):5590–5613, Nov 2010

work page 2010

[20] [20]

Barbier, M

J. Barbier, M. Dia, N. Macris, and F. Krzakala. The Mutua l Infor- mation in Random Linear Estimation. In in the 54th Annual Allerton Conference on Communication, Control, and Computing , 2016

work page 2016

[21] [21]

Barbier, N

J. Barbier, N. Macris, M. Dia, and F. Krzakala. Mutual in formation and optimality of approximate message-passing in random li near estimation. preprint arXiv:1701.05823, 2017

work page arXiv 2017

[22] [22]

Reeves and H

G. Reeves and H. D. Pﬁster. The replica-symmetric predi ction for compressed sensing with gaussian matrices is exact. In IEEE International Symposium on Information Theory (ISIT) , pages 665– 669, 2016

work page 2016

[23] [23]

Barbier, F

J. Barbier, F. Krzakala, N. Macris, L. Miolane, and L. Zd eborov´ a. Optimal errors and phase transitions in high-dimensional g eneralized linear models. Proceedings of the National Academy of Sciences , 116(12):5451–5460, 2019

work page 2019

[24] [24]

Barbier, N

J. Barbier, N. Macris, A. Maillard, and F. Krzakala. The Mutual Information in Random Linear Estimation Beyond i.i.d. Matr ices. In IEEE International Symposium on Information Theory (ISIT) , 2018

work page 2018

[25] [25]

Aubin, A

B. Aubin, A. Maillard, J. Barbier, F. Krzakala, N. Macri s, and L. Zdeborov´ a. The committee machine: Computational to sta tistical gaps in learning a two-layers neural network. In Advances in Neural Information Processing Systems 31 , pages 3227–3238, 2018

work page 2018

[26] [26]

Gabri´ e, A

M. Gabri´ e, A. Manoel, C. Luneau, J. Barbier, N. Macris, F. Krzakala, and L. Zdeborov´ a. Entropy and mutual information in models of deep neural networks. In Advances in Neural Information Processing Systems 31 , pages 1824–1834. 2018

work page 2018

[27] [27]

Coja-Oghlan, F

A. Coja-Oghlan, F. Krzakala, W. Perkins, and L. Zdeboro v´ a. Information-theoretic thresholds from the cavity method. In Pro- ceedings of the 49th Annual ACM SIGACT Symposium on Theory of Computing (STOC) , pages 146–157, 2017

work page 2017

[28] [28]

E. Abbe. Community detection and stochastic block mode ls: Recent developments. Journal of Machine Learning Research , 2018

work page 2018

[29] [29]

Barbier, C

J. Barbier, C. L. Chan, and N. Macris. Adaptive path inte rpolation for sparse systems: Application to a simple censored block m odel. In IEEE International Symposium on Information Theory (ISIT) , pages 1879–1883, 2018

work page 2018

[30] [30]

Barbier and N

J. Barbier and N. Macris. The adaptive interpolation me thod for proving replica formulas. applications to the curie–weiss and wigner spike models. Journal of Physics A: Mathematical and Theoretical , 52(29):294002, jun 2019

work page 2019

[31] [31]

N. Macris. Grifﬁth–kelly–sherman correlation inequa lities: A useful tool in the theory of error correcting codes. IEEE Transactions on Information Theory , 53(2):664–683, 2007

work page 2007

[32] [32]

Kudekar and N

S. Kudekar and N. Macris. Sharp bounds for optimal decod ing of low-density parity-check codes. IEEE Transactions on Information Theory, 55(10):4635–4650, Oct 2009

work page 2009

[33] [33]

Montanari

A. Montanari. Estimating random variables from random sparse observations. Europ. Trans. on Telecomm. , 19(4):385–403, 2008

work page 2008

[34] [34]

Aizenman and P

M. Aizenman and P . Contucci. On the stability of the quen ched state in mean-ﬁeld spin-glass models. Journal of statistical physics , 92(5- 6):765–783, 1998

work page 1998

[35] [35]

Contucci and C

P . Contucci and C. Giardina. Spin-glass stochastic sta bility: a rigorous proof. In Annales Henri Poincare , volume 6. Springer, 2005

work page 2005

[36] [36]

Panchenko

D. Panchenko. The ghirlanda–guerra identities for mix ed p-spin model. Comptes Rendus Mathematique , 348(3-4):189–192, 2010

work page 2010

[37] [37]

Schwarze and J

H. Schwarze and J. Hertz. Generalization in a large comm ittee machine. EPL (Europhysics Letters) , 20(4):375, 1992

work page 1992

[38] [38]

Monasson and R

R. Monasson and R. Zecchina. Weight space structure and internal representations: a direct approach to learning and general ization in multilayer neural networks. Physical review letters , 75(12):2432, 1995

work page 1995

[39] [39]

Engel and C

A. Engel and C. P . V an den Broeck. Statistical Mechanics of Learning . Cambridge University Press, 2001

work page 2001

[40] [40]

Panchenko

D. Panchenko. Free energy in the potts spin glass. The Annals of Probability, 46(2):829–864, 2018

work page 2018

[41] [41]

Panchenko

D. Panchenko. Free energy in the mixed p-spin models with vector spins. The Annals of Probability , 46(2):865–896, 2018

work page 2018

[42] [42]

Agliari, D

E. Agliari, D. Migliozzi, and D. Tantari. Non-convex mu lti-species hopﬁeld models. Journal of Stat. Phys. , 172(5):1247–1269, 2018

work page 2018

[43] [43]

Manoel, F

A. Manoel, F. Krzakala, M. Mzard, and L. Zdeborov´ a. Mul ti-layer generalized linear estimation. In IEEE International Symposium on Information Theory (ISIT) , 2017

work page 2017

[44] [44]

G. Reeves. Additivity of Information in Multilayer Net works via Ad- ditive Gaussian Noise Transforms. In 55th Annual Allerton Conference on Communication, Control, and Computing , 2017

work page 2017

[45] [45]

A. K. Fletcher and S. Rangan. Inference in Deep Networks in High Dimensions. arXiv:1706.06549, 2017

work page internal anchor Pith review Pith/arXiv arXiv 2017

[46] [46]

Asymptotics of MAP Inference in Deep Networks

P . Pandit, M. Sahraee, S. Rangan, and A. K. Fletcher. Asy mptotics of map inference in deep networks. preprint arXiv:1903.01293, 2019

work page internal anchor Pith review Pith/arXiv arXiv 1903

[47] [47]

Aubin, B

B. Aubin, B. Loureiro, A. Maillard, F. Krzakala, and L. Z de- borov´ a. The spiked matrix model with generative priors. preprint arXiv:1905.12385, 2019

work page arXiv 1905

[48] [48]

R. A. Horn and C. R. Johnson. Matrix analysis. Cambridge university press, 1990

work page 1990

[49] [49]

J. Barbier. Overlap matrix concentration in optimal ba yesian inference. preprint arXiv:1904.02808, 2019

work page arXiv 1904