Optimal Errors and Phase Transitions in High-Dimensional Generalized Linear Models

Florent Krzakala; Jean Barbier; Lenka Zdeborov\'a; L\'eo Miolane; Nicolas Macris

arxiv: 1708.03395 · v3 · pith:JA3CPVDWnew · submitted 2017-08-10 · 💻 cs.IT · cond-mat.dis-nn· cs.AI· cs.LG· math-ph· math.IT· math.MP

Optimal Errors and Phase Transitions in High-Dimensional Generalized Linear Models

Jean Barbier , Florent Krzakala , Nicolas Macris , L\'eo Miolane , Lenka Zdeborov\'a This is my paper

classification 💻 cs.IT cond-mat.dis-nncs.AIcs.LGmath-phmath.ITmath.MP

keywords glmserrorsgeneralizedhigh-dimensionalmainmodelsoptimalpart

0 comments

read the original abstract

Generalized linear models (GLMs) arise in high-dimensional machine learning, statistics, communications and signal processing. In this paper we analyze GLMs when the data matrix is random, as relevant in problems such as compressed sensing, error-correcting codes or benchmark models in neural networks. We evaluate the mutual information (or "free entropy") from which we deduce the Bayes-optimal estimation and generalization errors. Our analysis applies to the high-dimensional limit where both the number of samples and the dimension are large and their ratio is fixed. Non-rigorous predictions for the optimal errors existed for special cases of GLMs, e.g. for the perceptron, in the field of statistical physics based on the so-called replica method. Our present paper rigorously establishes those decades old conjectures and brings forward their algorithmic interpretation in terms of performance of the generalized approximate message-passing algorithm. Furthermore, we tightly characterize, for many learning problems, regions of parameters for which this algorithm achieves the optimal performance, and locate the associated sharp phase transitions separating learnable and non-learnable regions. We believe that this random version of GLMs can serve as a challenging benchmark for multi-purpose algorithms. This paper is divided in two parts that can be read independently: The first part (main part) presents the model and main results, discusses some applications and sketches the main ideas of the proof. The second part (supplementary informations) is much more detailed and provides more examples as well as all the proofs.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

When Stronger Triggers Backfire: A High-Dimensional Theory of Backdoor Attacks
cs.LG 2026-05 unverdicted novelty 8.0

In the proportional high-dimensional regime, stronger backdoor training triggers improve clean accuracy and make attack success non-monotonic for regularized GLMs on Gaussian mixtures, with closed-form proofs for squa...
Ultrametric OGP - parametric RDT \emph{symmetric} binary perceptron connection
cs.LG 2026-04 unverdicted novelty 7.0

Upper bounds on ultrametric OGPs at levels 1 and 2 for symmetric binary perceptrons are approximately 1.6578 and 1.6219, closely matching the 3rd and 4th lifting-level parametric RDT estimates, supporting conjectures ...