Nonlinear Least Squares for Large-Scale Machine Learning using Stochastic Jacobian Estimates

Johannes J. Brust

arxiv: 2107.05598 · v1 · pith:A3LO2NRWnew · submitted 2021-07-12 · 💻 cs.LG · cs.NA· math.NA· stat.ML

Nonlinear Least Squares for Large-Scale Machine Learning using Stochastic Jacobian Estimates

Johannes J. Brust This is my paper

classification 💻 cs.LG cs.NAmath.NAstat.ML

keywords jacobianlearningleastlossmachinenonlinearpropertysquares

0 comments

read the original abstract

For large nonlinear least squares loss functions in machine learning we exploit the property that the number of model parameters typically exceeds the data in one batch. This implies a low-rank structure in the Hessian of the loss, which enables effective means to compute search directions. Using this property, we develop two algorithms that estimate Jacobian matrices and perform well when compared to state-of-the-art methods.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

A Nonmonotone Gradient-Based Algorithm for Symmetric Nonnegative Matrix Factorization and Graph Clustering
cs.LG 2026-06 unverdicted novelty 6.0

SNMPBB adapts nonmonotone projected Barzilai-Borwein methods to symmetric NMF, proving convergence and demonstrating 6x speedups over SymANLS on synthetic data plus competitive or better results on real clustering ben...