Online Active Linear Regression via Thresholding

Baosen Zhang; Carlos Riquelme; Ramesh Johari

arxiv: 1602.02845 · v4 · pith:ZOUEFMCZnew · submitted 2016-02-09 · 📊 stat.ML · cs.LG

Online Active Linear Regression via Thresholding

Carlos Riquelme , Ramesh Johari , Baosen Zhang This is my paper

classification 📊 stat.ML cs.LG

keywords algorithmlinearregressionactiveconsiderhighonlinebenefits

0 comments

read the original abstract

We consider the problem of online active learning to collect data for regression modeling. Specifically, we consider a decision maker with a limited experimentation budget who must efficiently learn an underlying linear population model. Our main contribution is a novel threshold-based algorithm for selection of most informative observations; we characterize its performance and fundamental lower bounds. We extend the algorithm and its guarantees to sparse linear regression in high-dimensional settings. Simulations suggest the algorithm is remarkably robust: it provides significant benefits over passive random sampling in real-world datasets that exhibit high nonlinearity and high dimensionality --- significantly reducing both the mean and variance of the squared error.

This paper has not been read by Pith yet.

Online Active Linear Regression via Thresholding

discussion (0)