Sever: A Robust Meta-Algorithm for Stochastic Optimization

Alistair Stewart; Daniel M. Kane; Gautam Kamath; Ilias Diakonikolas; Jacob Steinhardt; Jerry Li

arxiv: 1803.02815 · v2 · pith:6LNX2FOMnew · submitted 2018-03-07 · 💻 cs.LG · cs.AI· cs.DS· stat.ML

Sever: A Robust Meta-Algorithm for Stochastic Optimization

Ilias Diakonikolas , Gautam Kamath , Daniel M. Kane , Jerry Li , Jacob Steinhardt , Alistair Stewart This is my paper

classification 💻 cs.LG cs.AIcs.DSstat.ML

keywords dataseterrorbaselineslearnerseverachievedbasecompared

0 comments

read the original abstract

In high dimensions, most machine learning methods are brittle to even a small fraction of structured outliers. To address this, we introduce a new meta-algorithm that can take in a base learner such as least squares or stochastic gradient descent, and harden the learner to be resistant to outliers. Our method, Sever, possesses strong theoretical guarantees yet is also highly scalable -- beyond running the base learner itself, it only requires computing the top singular vector of a certain $n \times d$ matrix. We apply Sever on a drug design dataset and a spam classification dataset, and find that in both cases it has substantially greater robustness than several baselines. On the spam dataset, with $1\%$ corruptions, we achieved $7.4\%$ test error, compared to $13.4\%-20.5\%$ for the baselines, and $3\%$ error on the uncorrupted dataset. Similarly, on the drug design dataset, with $10\%$ corruptions, we achieved $1.42$ mean-squared error test error, compared to $1.51$-$2.33$ for the baselines, and $1.23$ error on the uncorrupted dataset.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

A Unified Approach to Robust Mean Estimation
stat.ML 2019-07 unverdicted novelty 7.0

A connection between Huber's contamination and heavy-tailed models yields unified robust mean estimators that are both computationally efficient and statistically optimal under certain conditions.