Second-Order Stochastic Optimization for Machine Learning in Linear Time

Brian Bullins; Elad Hazan; Naman Agarwal

arxiv: 1602.03943 · v5 · pith:M6A33P7Qnew · submitted 2016-02-12 · 📊 stat.ML · cs.LG

Second-Order Stochastic Optimization for Machine Learning in Linear Time

Naman Agarwal , Brian Bullins , Elad Hazan This is my paper

classification 📊 stat.ML cs.LG

keywords methodssecond-orderlearningmachineoptimizationstochastictimecost

0 comments

read the original abstract

First-order stochastic methods are the state-of-the-art in large-scale machine learning optimization owing to efficient per-iteration complexity. Second-order methods, while able to provide faster convergence, have been much less explored due to the high cost of computing the second-order information. In this paper we develop second-order stochastic methods for optimization problems in machine learning that match the per-iteration cost of gradient based methods, and in certain settings improve upon the overall running time over popular first-order methods. Furthermore, our algorithm has the desirable property of being implementable in time linear in the sparsity of the input data.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Towards Certified Unlearning for Deep Neural Networks
cs.LG 2024-08 unverdicted novelty 5.0

Proposes simple techniques and inverse Hessian approximation to enable certified unlearning for nonconvex DNN objectives, including nonconvergent training and sequential unlearning requests.