pith. sign in

arxiv: 1805.08920 · v2 · pith:RED652WRnew · submitted 2018-05-23 · 💻 cs.LG · cs.CV· math.OC· math.ST· stat.ML· stat.TH

Approximate Newton-based statistical inference using only stochastic gradients

classification 💻 cs.LG cs.CVmath.OCmath.STstat.MLstat.TH
keywords statisticalinferencestochasticapproximateconvexframeworkinformationlearning
0
0 comments X
read the original abstract

We present a novel statistical inference framework for convex empirical risk minimization, using approximate stochastic Newton steps. The proposed algorithm is based on the notion of finite differences and allows the approximation of a Hessian-vector product from first-order information. In theory, our method efficiently computes the statistical error covariance in $M$-estimation, both for unregularized convex learning problems and high-dimensional LASSO regression, without using exact second order information, or resampling the entire data set. We also present a stochastic gradient sampling scheme for statistical inference in non-i.i.d. time series analysis, where we sample contiguous blocks of indices. In practice, we demonstrate the effectiveness of our framework on large-scale machine learning problems, that go even beyond convexity: as a highlight, our work can be used to detect certain adversarial attacks on neural networks.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.