pith. sign in

arxiv: 1902.00629 · v4 · pith:A5OWNPM5new · submitted 2019-02-02 · 📊 stat.ML · cs.LG· math.OC

Non-asymptotic Analysis of Biased Stochastic Approximation Scheme

classification 📊 stat.ML cs.LGmath.OC
keywords learningmethodanalysisapproximationfunctiongradientnon-asymptoticobjective
0
0 comments X
read the original abstract

Stochastic approximation (SA) is a key method used in statistical learning. Recently, its non-asymptotic convergence analysis has been considered in many papers. However, most of the prior analyses are made under restrictive assumptions such as unbiased gradient estimates and convex objective function, which significantly limit their applications to sophisticated tasks such as online and reinforcement learning. These restrictions are all essentially relaxed in this work. In particular, we analyze a general SA scheme to minimize a non-convex, smooth objective function. We consider update procedure whose drift term depends on a state-dependent Markov chain and the mean field is not necessarily of gradient type, covering approximate second-order method and allowing asymptotic bias for the one-step updates. We illustrate these settings with the online EM algorithm and the policy-gradient method for average reward maximization in reinforcement learning.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Multiscale reconstruction of protein conformations from cryo-EM images

    eess.IV 2026-06 unverdicted novelty 5.0

    A multiscale optimization method using explicit protein backbone geometry reconstructs atomic models from cryo-EM data, showing improved RMSD and TM scores on three simulated datasets.