Laplacian Smoothing Gradient Descent

Alex Lin; Bao Wang; Farzin Barekat; Minh Pham; Penghang Yin; Stanley Osher; Xiyang Luo

arxiv: 1806.06317 · v5 · pith:MLHUP6LBnew · submitted 2018-06-17 · 💻 cs.LG · math.NA· stat.ML

Laplacian Smoothing Gradient Descent

Stanley Osher , Bao Wang , Penghang Yin , Xiyang Luo , Farzin Barekat , Minh Pham , Alex Lin This is my paper

classification 💻 cs.LG math.NAstat.ML

keywords gradientdescentcomponentconvexdiscretefunctionlaplacianoptimization

0 comments

read the original abstract

We propose a class of very simple modifications of gradient descent and stochastic gradient descent. We show that when applied to a large variety of machine learning problems, ranging from logistic regression to deep neural nets, the proposed surrogates can dramatically reduce the variance, allow to take a larger step size, and improve the generalization accuracy. The methods only involve multiplying the usual (stochastic) gradient by the inverse of a positive definitive matrix (which can be computed efficiently by FFT) with a low condition number coming from a one-dimensional discrete Laplacian or its high order generalizations. It also preserves the mean and increases the smallest component and decreases the largest component. The theory of Hamilton-Jacobi partial differential equations demonstrates that the implicit version of the new algorithm is almost the same as doing gradient descent on a new function which (i) has the same global minima as the original function and (ii) is ``more convex". Moreover, we show that optimization algorithms with these surrogates converge uniformly in the discrete Sobolev $H_\sigma^p$ sense and reduce the optimality gap for convex optimization problems. The code is available at: \url{https://github.com/BaoWangMath/LaplacianSmoothing-GradientDescent}

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Graph Interpolating Activation Improves Both Natural and Robust Accuracies in Data-Efficient Deep Learning
cs.LG 2019-07 unverdicted novelty 5.0

Graph Laplacian interpolating activation replaces softmax in DNNs and improves natural accuracy, robust accuracy, and data efficiency.