Generic chaining and the l1-penalty
classification
🧮 math.ST
stat.TH
keywords
chainingchoicegenericlambdamodelnumberaddressasymp
read the original abstract
We address the choice of the tuning parameter $\lambda$ in $\ell_1$-penalized M-estimation. Our main concern is models which are highly nonlinear, such as the Gaussian mixture model. The number of parameters $p$ is moreover large, possibly larger than the number of observations $n$. The generic chaining technique of Talagrand[2005] is tailored for this problem. It leads to the choice $\lambda \asymp \sqrt {\log p / n}$, as in the standard Lasso procedure (which concerns the linear model and least squares loss).
This paper has not been read by Pith yet.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.