Nonlinear Least Squares for Large-Scale Machine Learning using Stochastic Jacobian Estimates
classification
💻 cs.LG
cs.NAmath.NAstat.ML
keywords
jacobianlearningleastlossmachinenonlinearpropertysquares
read the original abstract
For large nonlinear least squares loss functions in machine learning we exploit the property that the number of model parameters typically exceeds the data in one batch. This implies a low-rank structure in the Hessian of the loss, which enables effective means to compute search directions. Using this property, we develop two algorithms that estimate Jacobian matrices and perform well when compared to state-of-the-art methods.
This paper has not been read by Pith yet.
Forward citations
Cited by 1 Pith paper
-
A Nonmonotone Gradient-Based Algorithm for Symmetric Nonnegative Matrix Factorization and Graph Clustering
SNMPBB adapts nonmonotone projected Barzilai-Borwein methods to symmetric NMF, proving convergence and demonstrating 6x speedups over SymANLS on synthetic data plus competitive or better results on real clustering ben...
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.