pith. sign in

arxiv: 1907.00927 · v1 · pith:2YRWB6UBnew · submitted 2019-07-01 · 📊 stat.ML · cs.AI· cs.LG

A Unified Approach to Robust Mean Estimation

classification 📊 stat.ML cs.AIcs.LG
keywords estimatorsrobustconnectionhubermodelalgorithmsefficientestimation
0
0 comments X
read the original abstract

In this paper, we develop connections between two seemingly disparate, but central, models in robust statistics: Huber's epsilon-contamination model and the heavy-tailed noise model. We provide conditions under which this connection provides near-statistically-optimal estimators. Building on this connection, we provide a simple variant of recent computationally-efficient algorithms for mean estimation in Huber's model, which given our connection entails that the same efficient sample-pruning based estimators is simultaneously robust to heavy-tailed noise and Huber contamination. Furthermore, we complement our efficient algorithms with statistically-optimal albeit computationally intractable estimators, which are simultaneously optimally robust in both models. We study the empirical performance of our proposed estimators on synthetic datasets, and find that our methods convincingly outperform a variety of practical baselines.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Estimating location parameters in entangled single-sample distributions

    math.ST 2019-07 unverdicted novelty 7.0

    Proposes an adaptive hybrid estimator for common mean estimation under independent but non-identical symmetric unimodal distributions, with near-optimality guarantees even when only log n / n samples are low-noise.