High-dimensional classification by sparse logistic regression

Felix Abramovich; Vadim Grinshtein

arxiv: 1706.08344 · v3 · pith:6R6ZJCJ2new · submitted 2017-06-26 · 🧮 math.ST · stat.ML· stat.TH

High-dimensional classification by sparse logistic regression

Felix Abramovich , Vadim Grinshtein This is my paper

classification 🧮 math.ST stat.MLstat.TH

keywords complexityhigh-dimensionallogisticmodelregressionsparseadditionalbounds

0 comments

read the original abstract

We consider high-dimensional binary classification by sparse logistic regression. We propose a model/feature selection procedure based on penalized maximum likelihood with a complexity penalty on the model size and derive the non-asymptotic bounds for the resulting misclassification excess risk. The bounds can be reduced under the additional low-noise condition. The proposed complexity penalty is remarkably related to the VC-dimension of a set of sparse linear classifiers. Implementation of any complexity penalty-based criterion, however, requires a combinatorial search over all possible models. To find a model selection procedure computationally feasible for high-dimensional data, we extend the Slope estimator for logistic regression and show that under an additional weighted restricted eigenvalue condition it is rate-optimal in the minimax sense.

This paper has not been read by Pith yet.

High-dimensional classification by sparse logistic regression

discussion (0)