Learning to Adapt by Minimizing Discrepancy

· 2017 · cs.LG · arXiv 1711.11542

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

abstract

We explore whether useful temporal neural generative models can be learned from sequential data without back-propagation through time. We investigate the viability of a more neurocognitively-grounded approach in the context of unsupervised generative modeling of sequences. Specifically, we build on the concept of predictive coding, which has gained influence in cognitive science, in a neural framework. To do so we develop a novel architecture, the Temporal Neural Coding Network, and its learning algorithm, Discrepancy Reduction. The underlying directed generative model is fully recurrent, meaning that it employs structural feedback connections and temporal feedback connections, yielding information propagation cycles that create local learning signals. This facilitates a unified bottom-up and top-down approach for information transfer inside the architecture. Our proposed algorithm shows promise on the bouncing balls generative modeling problem. Further experiments could be conducted to explore the strengths and weaknesses of our approach.

representative citing papers

A Unified Framework of Online Learning Algorithms for Training Recurrent Neural Networks

cs.LG · 2019-07-05 · accept · novelty 6.0

A framework unifies recent online RNN training algorithms along four axes and demonstrates performance clustering on synthetic tasks, indicating that gradient alignment is insufficient to explain success especially for stochastic methods.

citing papers explorer

Showing 1 of 1 citing paper.

A Unified Framework of Online Learning Algorithms for Training Recurrent Neural Networks cs.LG · 2019-07-05 · accept · none · ref 10 · internal anchor
A framework unifies recent online RNN training algorithms along four axes and demonstrates performance clustering on synthetic tasks, indicating that gradient alignment is insufficient to explain success especially for stochastic methods.

Learning to Adapt by Minimizing Discrepancy

fields

years

verdicts

representative citing papers

citing papers explorer