pith. sign in

arxiv: 1610.04599 · v1 · pith:3POPYJRDnew · submitted 2016-10-14 · 💻 cs.LG · math.ST· stat.ML· stat.TH

Data-Driven Threshold Machine: Scan Statistics, Change-Point Detection, and Extreme Bandits

classification 💻 cs.LG math.STstat.MLstat.TH
keywords thresholdextremalextremestatisticsapproachbanditschange-pointdata
0
0 comments X
read the original abstract

We present a novel distribution-free approach, the data-driven threshold machine (DTM), for a fundamental problem at the core of many learning tasks: choose a threshold for a given pre-specified level that bounds the tail probability of the maximum of a (possibly dependent but stationary) random sequence. We do not assume data distribution, but rather relying on the asymptotic distribution of extremal values, and reduce the problem to estimate three parameters of the extreme value distributions and the extremal index. We specially take care of data dependence via estimating extremal index since in many settings, such as scan statistics, change-point detection, and extreme bandits, where dependence in the sequence of statistics can be significant. Key features of our DTM also include robustness and the computational efficiency, and it only requires one sample path to form a reliable estimate of the threshold, in contrast to the Monte Carlo sampling approach which requires drawing a large number of sample paths. We demonstrate the good performance of DTM via numerical examples in various dependent settings.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.