pith. sign in

arxiv: 1906.04554 · v1 · pith:JZ43ZQMXnew · submitted 2019-06-11 · 📊 stat.ML · cs.LG· cs.NE

Principled Training of Neural Networks with Direct Feedback Alignment

classification 📊 stat.ML cs.LGcs.NE
keywords alignmentmethodsfeedbacknetworksbestdirectlayersneural
0
0 comments X
read the original abstract

The backpropagation algorithm has long been the canonical training method for neural networks. Modern paradigms are implicitly optimized for it, and numerous guidelines exist to ensure its proper use. Recently, synthetic gradients methods -where the error gradient is only roughly approximated - have garnered interest. These methods not only better portray how biological brains are learning, but also open new computational possibilities, such as updating layers asynchronously. Even so, they have failed to scale past simple tasks like MNIST or CIFAR-10. This is in part due to a lack of standards, leading to ill-suited models and practices forbidding such methods from performing to the best of their abilities. In this work, we focus on direct feedback alignment and present a set of best practices justified by observations of the alignment angles. We characterize a bottleneck effect that prevents alignment in narrow layers, and hypothesize it may explain why feedback alignment methods have yet to scale to large convolutional networks.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 3 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Score Broadcast and Decorrelation: A General Framework for Broadcast-Based Credit Assignment

    cs.LG 2026-05 unverdicted novelty 7.0

    SBD unifies broadcast credit assignment for losses including cross-entropy via output-score orthogonality to hidden activations and reports performance gains on CIFAR-10 and Tiny ImageNet.

  2. Overcoming Rank Collapse in Feedback Alignment

    cs.LG 2026-06 unverdicted novelty 6.0

    Feedback alignment in deep networks is limited by low-rank error signals; orthogonal weight updates and activity normalization raise effective rank and boost performance.

  3. Mono-Forward: Revisiting Forward-Forward through Objective-Locality Decomposition

    cs.LG 2025-01 unverdicted novelty 6.0

    Mono-Forward replaces Forward-Forward's contrastive goodness with local multi-class cross-entropy, outperforming vanilla FF and sometimes backpropagation while using 31% of its memory on MLP-Mixers for PathMNIST.