pith. sign in

arxiv: 1906.03058 · v4 · pith:APNXXUEGnew · submitted 2019-06-07 · 🧮 math.ST · math.OC· stat.TH

Robust subgaussian estimation of a mean vector in nearly linear time

classification 🧮 math.ST math.OCstat.TH
keywords ratealgorithmdatamathcaloutlierssigmaconstructionequation
0
0 comments X
read the original abstract

We construct an algorithm, running in time $\tilde{\mathcal O}(N d + uK d)$, which is robust to outliers and heavy-tailed data and which achieves the subgaussian rate from [Lugosi, Mendelson] \begin{equation}\label{eq:intro_subgaus_rate} \sqrt{\frac{{\rm Tr}(\Sigma)}{N}}+\sqrt{\frac{||\Sigma||_{op}K}{N}} \end{equation}with probability at least $1-\exp(-c_0K)-\exp(-c_1 u)$ where $\Sigma$ is the covariance matrix of the informative data, $K\in\{1, \ldots, K\}$ is some parameter (number of block means) and $u>0$ is another parameter of the algorithm. This rate is achieved when $K\geq c_1 |\mathcal O|$ where $|\mathcal O|$ is the number of outliers in the database and under the only assumption that the informative data have a second moment. The algorithm is fully data-dependent and does not use in its construction the proportion of outliers nor the rate above. Its construction combines recently developed tools for Median-of-Means estimators and covering-Semi-definite Programming [Chen, Diakonikolas, Ge] and [Peng, Tangwongsan, Zhang].

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. A Unified Approach to Robust Mean Estimation

    stat.ML 2019-07 unverdicted novelty 7.0

    A connection between Huber's contamination and heavy-tailed models yields unified robust mean estimators that are both computationally efficient and statistically optimal under certain conditions.

  2. Quantum Entropy Scoring for Fast Robust Mean Estimation and Improved Outlier Detection

    cs.DS 2019-06 unverdicted novelty 7.0

    QUE-scoring via quantum entropy regularization achieves optimal robust mean estimation in Õ(nd) time and outperforms prior outlier detection methods in experiments.