Approximate Newton-based statistical inference using only stochastic gradients

Anastasios Kyrillidis; Constantine Caramanis; Liu Liu; Tianyang Li

arxiv: 1805.08920 · v2 · pith:RED652WRnew · submitted 2018-05-23 · 💻 cs.LG · cs.CV· math.OC· math.ST· stat.ML· stat.TH

Approximate Newton-based statistical inference using only stochastic gradients

Tianyang Li , Anastasios Kyrillidis , Liu Liu , Constantine Caramanis This is my paper

classification 💻 cs.LG cs.CVmath.OCmath.STstat.MLstat.TH

keywords statisticalinferencestochasticapproximateconvexframeworkinformationlearning

0 comments

read the original abstract

We present a novel statistical inference framework for convex empirical risk minimization, using approximate stochastic Newton steps. The proposed algorithm is based on the notion of finite differences and allows the approximation of a Hessian-vector product from first-order information. In theory, our method efficiently computes the statistical error covariance in $M$-estimation, both for unregularized convex learning problems and high-dimensional LASSO regression, without using exact second order information, or resampling the entire data set. We also present a stochastic gradient sampling scheme for statistical inference in non-i.i.d. time series analysis, where we sample contiguous blocks of indices. In practice, we demonstrate the effectiveness of our framework on large-scale machine learning problems, that go even beyond convexity: as a highlight, our work can be used to detect certain adversarial attacks on neural networks.

This paper has not been read by Pith yet.

Approximate Newton-based statistical inference using only stochastic gradients

discussion (0)