pith. sign in

arxiv: 1710.09447 · v2 · pith:CAG3K3HJnew · submitted 2017-10-25 · 🧮 math.OC · cs.LG· stat.ML

Stochastic Non-convex Optimization with Strong High Probability Second-order Convergence

classification 🧮 math.OC cs.LGstat.ML
keywords stochasticsecond-orderconvergencenon-convexhighoptimizationprobabilityalgorithms
0
0 comments X
read the original abstract

In this paper, we study stochastic non-convex optimization with non-convex random functions. Recent studies on non-convex optimization revolve around establishing second-order convergence, i.e., converging to a nearly second-order optimal stationary points. However, existing results on stochastic non-convex optimization are limited, especially with a high probability second-order convergence. We propose a novel updating step (named NCG-S) by leveraging a stochastic gradient and a noisy negative curvature of a stochastic Hessian, where the stochastic gradient and Hessian are based on a proper mini-batch of random functions. Building on this step, we develop two algorithms and establish their high probability second-order convergence. To the best of our knowledge, the proposed stochastic algorithms are the first with a second-order convergence in {\it high probability} and a time complexity that is {\it almost linear} in the problem's dimensionality.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.