pith. sign in

arxiv: 1905.00529 · v1 · pith:SYV4PQ3Hnew · submitted 2019-05-01 · 💻 cs.LG · math.OC· stat.ML

Stabilized SVRG: Simple Variance Reduction for Nonconvex Optimization

classification 💻 cs.LG math.OCstat.ML
keywords svrgepsilonsimplestationaryfindnonconvexpointsecond-order
0
0 comments X
read the original abstract

Variance reduction techniques like SVRG provide simple and fast algorithms for optimizing a convex finite-sum objective. For nonconvex objectives, these techniques can also find a first-order stationary point (with small gradient). However, in nonconvex optimization it is often crucial to find a second-order stationary point (with small gradient and almost PSD hessian). In this paper, we show that Stabilized SVRG (a simple variant of SVRG) can find an $\epsilon$-second-order stationary point using only $\widetilde{O}(n^{2/3}/\epsilon^2+n/\epsilon^{1.5})$ stochastic gradients. To our best knowledge, this is the first second-order guarantee for a simple variant of SVRG. The running time almost matches the known guarantees for finding $\epsilon$-first-order stationary points.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Distributed Learning in Non-Convex Environments -- Part I: Agreement at a Linear Rate

    math.OC 2019-07 unverdicted novelty 5.0

    Diffusion learning achieves linear-rate agreement around the network centroid in stochastic non-convex distributed optimization.