Stochastic Recursive Gradient Algorithm for Nonconvex Optimization
classification
📊 stat.ML
cs.LGmath.OC
keywords
gradientnonconvexstochasticrecursivealgorithmconvergencefunctionslosses
read the original abstract
In this paper, we study and analyze the mini-batch version of StochAstic Recursive grAdient algoritHm (SARAH), a method employing the stochastic recursive gradient, for solving empirical loss minimization for the case of nonconvex losses. We provide a sublinear convergence rate (to stationary points) for general nonconvex functions and a linear convergence rate for gradient dominated functions, both of which have some advantages compared to other modern stochastic gradient algorithms for nonconvex losses.
This paper has not been read by Pith yet.
Forward citations
Cited by 1 Pith paper
-
Accelerating Mini-batch SARAH by Step Size Rules
MB-SARAH-RBB uses a random Barzilai-Borwein step size to accelerate mini-batch SARAH, with a linear convergence proof and improved complexity for strongly convex objectives.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.