pith. sign in

arxiv: 1805.01532 · v2 · pith:E5YMNHTYnew · submitted 2018-05-03 · 💻 cs.LG · stat.ML

Lifted Neural Networks

classification 💻 cs.LG stat.ML
keywords activationnetworksneuralproblemfunctionslayermodelstion
0
0 comments X
read the original abstract

We describe a novel family of models of multi- layer feedforward neural networks in which the activation functions are encoded via penalties in the training problem. Our approach is based on representing a non-decreasing activation function as the argmin of an appropriate convex optimiza- tion problem. The new framework allows for algo- rithms such as block-coordinate descent methods to be applied, in which each step is composed of a simple (no hidden layer) supervised learning problem that is parallelizable across data points and/or layers. Experiments indicate that the pro- posed models provide excellent initial guesses for weights for standard neural networks. In addi- tion, the model provides avenues for interesting extensions, such as robustness against noisy in- puts and optimizing over parameters in activation functions.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Augmented Lagrangian Predictive Coding

    cs.LG 2026-05 unverdicted novelty 7.0

    PC-ALM uses dual ascent on an augmented Lagrangian to achieve exact backpropagation gradients via layer-local updates in linear networks and matching performance in nonlinear networks up to depth 128.