hub

The forward-forward algorithm: Some preliminary investigations

· 2022 · arXiv 2212.13345

17 Pith papers cite this work. Polarity classification is still indexing.

17 Pith papers citing it

read on arXiv browse 17 citing papers

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

background 3

citation-polarity summary

background 3

representative citing papers

Training Non-Differentiable Networks via Optimal Transport

cs.LG · 2026-05-03 · unverdicted · novelty 8.0

PolyStep optimizes non-differentiable networks via forward-only polytope evaluations and optimal-transport barycentric updates, reaching 93.4% accuracy on hard-LIF spiking networks while outperforming gradient-free baselines.

NeuroTrain: Surveying Local Learning Rules for Spiking Neural Networks with an Open Benchmarking Framework

cs.NE · 2026-05-14 · unverdicted · novelty 7.0

A taxonomy of SNN training algorithms is presented with the release of NeuroTrain, an open benchmarking framework for reproducible comparisons across datasets and architectures.

Selectivity and Shape in the Design of Forward-Forward Goodness Functions

cs.LG · 2026-03-28 · unverdicted · novelty 7.0

Shape- and peak-sensitive goodness functions for Forward-Forward deliver up to 72pp gains over sum-of-squares, reaching 98.2% on MNIST and 89% on Fashion-MNIST.

Self-supervised local learning rules learn the hidden hierarchical structure of high-dimensional data

cs.LG · 2026-05-18 · unverdicted · novelty 6.0

Layerwise self-supervised local rules learn the hierarchical structure of the Random Hierarchy Model as data-efficiently as supervised backpropagation, while direct feedback approximations fail due to missing masking nonlinearities.

Adaptive Multi-Scale Goodness Aggregation for Forward-Forward Learning

cs.LG · 2026-05-11 · unverdicted · novelty 6.0

AMSGA extends Forward-Forward learning via multi-scale goodness aggregation, curriculum-guided hard negative mining, and adaptive thresholds, reporting up to 1.5% accuracy gains on MNIST and Fashion-MNIST.

Cumulative-Goodness Free-Riding in Forward-Forward Networks: Real, Repairable, but Not Accuracy-Dominant

cs.LG · 2026-05-07 · unverdicted · novelty 6.0

Cumulative-goodness Forward-Forward networks exhibit layer free-riding where discrimination gradients decay exponentially with prior positive margins; per-block, hardness-gated, and depth-scaled remedies yield 4-45x better layer separation but <1% accuracy change on CIFAR and Tiny ImageNet.

A Synthesizable RTL Implementation of Predictive Coding Networks

cs.NE · 2026-03-18 · unverdicted · novelty 6.0

A complete RTL substrate executes discrete-time predictive coding dynamics directly in hardware with fixed local rules and adjacent-layer communication only.

Low-rank surrogate modeling and stochastic zero-order optimization for training of neural networks with black-box layers

cs.LG · 2025-09-18 · unverdicted · novelty 6.0

A framework combining stochastic zeroth-order optimization and dynamic low-rank surrogate modeling with an implicit projector-splitting integrator enables end-to-end training of hybrid neural networks containing black-box physical layers and reaches near-digital accuracy on vision, audio, and text任务

MemFlow: A Lightweight Forward Memorizing Framework for Quick Domain Adaptive Feature Mapping

cs.NE · 2024-02-04 · unverdicted · novelty 6.0

MemFlow uses forward-only memorization via randomly connected neurons and spiking signals on a frozen backbone for unsupervised domain adaptation, claiming up to 10% gains at under 1% of traditional compute costs.

Some[Body] Must Receive That Pain for Agent Accountability

cs.CY · 2026-05-16 · unverdicted · novelty 5.0

AI agents lack the persistent identity and feedback mechanisms needed for consequence reception, requiring new architectures or continued human accountability.

Position: Zeroth-Order Optimization in Deep Learning Is Underexplored, Not Underpowered

cs.LG · 2026-05-15 · unverdicted · novelty 5.0

Zeroth-order optimization is underexplored rather than underpowered in deep learning, with limitations stemming from full-space designs that can be addressed via subspace, spectral, and systems-aware approaches.

FAAST: Forward-Only Associative Learning via Closed-Form Fast Weights for Test-Time Supervised Adaptation

cs.LG · 2026-05-06 · unverdicted · novelty 5.0 · 2 refs

FAAST performs test-time supervised adaptation by analytically deriving fast weights from examples in one forward pass, matching backprop performance with over 90% less adaptation time and up to 95% memory savings versus memory-based methods.

Physical Foundation Models: Fixed hardware implementations of large-scale neural networks

cs.LG · 2026-04-30 · unverdicted · novelty 5.0

Physical Foundation Models are fixed physical hardware realizations of foundation-scale neural networks that compute via inherent material dynamics, potentially delivering orders-of-magnitude gains in energy efficiency, speed, and density over digital systems.

Introducing Echo Networks for Computational Neuroevolution

cs.LG · 2026-04-09 · unverdicted · novelty 5.0

Echo Networks are recurrent networks defined by a single connection matrix with no layers, enabling matrix-based mutation and recombination in neuroevolution, and demonstrated on ECG signal classification.

LightTune: Lightweight Forward-Only Online Fine-Tuning with Applications to Link Adaptation

cs.NI · 2026-04-14 · unverdicted · novelty 4.0 · 2 refs

LightTune is a backpropagation-free online fine-tuning framework that reduces BLER prediction error by up to 48.8% and improves throughput by 15.5% in 6G link adaptation.

Grid-SD2E: A General Grid-Feedback in a System for Cognitive Learning

cs.AI · 2023-04-04 · unverdicted · novelty 3.0

Proposes Grid-SD2E, a theoretical grid-feedback cognitive learning system combining grid-cell inspiration with Bayesian reasoning for self-reinforcing interaction.

ArrowFlow: Hierarchical Machine Learning in the Space of Permutations

cs.LG · 2026-04-05

citing papers explorer

Showing 17 of 17 citing papers.

Training Non-Differentiable Networks via Optimal Transport cs.LG · 2026-05-03 · unverdicted · none · ref 26
PolyStep optimizes non-differentiable networks via forward-only polytope evaluations and optimal-transport barycentric updates, reaching 93.4% accuracy on hard-LIF spiking networks while outperforming gradient-free baselines.
NeuroTrain: Surveying Local Learning Rules for Spiking Neural Networks with an Open Benchmarking Framework cs.NE · 2026-05-14 · unverdicted · none · ref 109
A taxonomy of SNN training algorithms is presented with the release of NeuroTrain, an open benchmarking framework for reproducible comparisons across datasets and architectures.
Selectivity and Shape in the Design of Forward-Forward Goodness Functions cs.LG · 2026-03-28 · unverdicted · none · ref 6
Shape- and peak-sensitive goodness functions for Forward-Forward deliver up to 72pp gains over sum-of-squares, reaching 98.2% on MNIST and 89% on Fashion-MNIST.
Self-supervised local learning rules learn the hidden hierarchical structure of high-dimensional data cs.LG · 2026-05-18 · unverdicted · none · ref 7
Layerwise self-supervised local rules learn the hierarchical structure of the Random Hierarchy Model as data-efficiently as supervised backpropagation, while direct feedback approximations fail due to missing masking nonlinearities.
Adaptive Multi-Scale Goodness Aggregation for Forward-Forward Learning cs.LG · 2026-05-11 · unverdicted · none · ref 8
AMSGA extends Forward-Forward learning via multi-scale goodness aggregation, curriculum-guided hard negative mining, and adaptive thresholds, reporting up to 1.5% accuracy gains on MNIST and Fashion-MNIST.
Cumulative-Goodness Free-Riding in Forward-Forward Networks: Real, Repairable, but Not Accuracy-Dominant cs.LG · 2026-05-07 · unverdicted · none · ref 12
Cumulative-goodness Forward-Forward networks exhibit layer free-riding where discrimination gradients decay exponentially with prior positive margins; per-block, hardness-gated, and depth-scaled remedies yield 4-45x better layer separation but <1% accuracy change on CIFAR and Tiny ImageNet.
A Synthesizable RTL Implementation of Predictive Coding Networks cs.NE · 2026-03-18 · unverdicted · none · ref 5
A complete RTL substrate executes discrete-time predictive coding dynamics directly in hardware with fixed local rules and adjacent-layer communication only.
Low-rank surrogate modeling and stochastic zero-order optimization for training of neural networks with black-box layers cs.LG · 2025-09-18 · unverdicted · none · ref 35
A framework combining stochastic zeroth-order optimization and dynamic low-rank surrogate modeling with an implicit projector-splitting integrator enables end-to-end training of hybrid neural networks containing black-box physical layers and reaches near-digital accuracy on vision, audio, and text任务
MemFlow: A Lightweight Forward Memorizing Framework for Quick Domain Adaptive Feature Mapping cs.NE · 2024-02-04 · unverdicted · none · ref 19
MemFlow uses forward-only memorization via randomly connected neurons and spiking signals on a frozen backbone for unsupervised domain adaptation, claiming up to 10% gains at under 1% of traditional compute costs.
Some[Body] Must Receive That Pain for Agent Accountability cs.CY · 2026-05-16 · unverdicted · none · ref 76
AI agents lack the persistent identity and feedback mechanisms needed for consequence reception, requiring new architectures or continued human accountability.
Position: Zeroth-Order Optimization in Deep Learning Is Underexplored, Not Underpowered cs.LG · 2026-05-15 · unverdicted · none · ref 112
Zeroth-order optimization is underexplored rather than underpowered in deep learning, with limitations stemming from full-space designs that can be addressed via subspace, spectral, and systems-aware approaches.
FAAST: Forward-Only Associative Learning via Closed-Form Fast Weights for Test-Time Supervised Adaptation cs.LG · 2026-05-06 · unverdicted · none · ref 8 · 2 links
FAAST performs test-time supervised adaptation by analytically deriving fast weights from examples in one forward pass, matching backprop performance with over 90% less adaptation time and up to 95% memory savings versus memory-based methods.
Physical Foundation Models: Fixed hardware implementations of large-scale neural networks cs.LG · 2026-04-30 · unverdicted · none · ref 107
Physical Foundation Models are fixed physical hardware realizations of foundation-scale neural networks that compute via inherent material dynamics, potentially delivering orders-of-magnitude gains in energy efficiency, speed, and density over digital systems.
Introducing Echo Networks for Computational Neuroevolution cs.LG · 2026-04-09 · unverdicted · none · ref 20
Echo Networks are recurrent networks defined by a single connection matrix with no layers, enabling matrix-based mutation and recombination in neuroevolution, and demonstrated on ECG signal classification.
LightTune: Lightweight Forward-Only Online Fine-Tuning with Applications to Link Adaptation cs.NI · 2026-04-14 · unverdicted · none · ref 9 · 2 links
LightTune is a backpropagation-free online fine-tuning framework that reduces BLER prediction error by up to 48.8% and improves throughput by 15.5% in 6G link adaptation.
Grid-SD2E: A General Grid-Feedback in a System for Cognitive Learning cs.AI · 2023-04-04 · unverdicted · none · ref 61
Proposes Grid-SD2E, a theoretical grid-feedback cognitive learning system combining grid-cell inspiration with Bayesian reasoning for self-reinforcing interaction.
ArrowFlow: Hierarchical Machine Learning in the Space of Permutations cs.LG · 2026-04-05 · unreviewed · ref 3

The forward-forward algorithm: Some preliminary investigations

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer