Stochastic quasi-Newton with adaptive step lengths for large-scale problems

Adrian Wills; Thomas Sch\"on

arxiv: 1802.04310 · v1 · pith:6W4POCKDnew · submitted 2018-02-12 · 📊 stat.ML · cs.LG

Stochastic quasi-Newton with adaptive step lengths for large-scale problems

Adrian Wills , Thomas Sch\"on This is my paper

classification 📊 stat.ML cs.LG

keywords stochasticproblemsconstructionlarge-scalenumericallystepadaptingadaptive

0 comments

read the original abstract

We provide a numerically robust and fast method capable of exploiting the local geometry when solving large-scale stochastic optimisation problems. Our key innovation is an auxiliary variable construction coupled with an inverse Hessian approximation computed using a receding history of iterates and gradients. It is the Markov chain nature of the classic stochastic gradient algorithm that enables this development. The construction offers a mechanism for stochastic line search adapting the step length. We numerically evaluate and compare against current state-of-the-art with encouraging performance on real-world benchmark problems where the number of observations and unknowns is in the order of millions.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Deep learning applied to computational mechanics: A comprehensive review, state of the art, and the classics
cs.LG 2022-12 unverdicted novelty 2.0

A comprehensive review of deep learning techniques for computational mechanics, including LSTM for constitutive modeling, PINNs for PDE solving, optimizers, and kernel methods.