Feed-Forward Networks with Attention Can Solve Some Long-Term Memory Problems
classification
💻 cs.LG
cs.NE
keywords
attentionfeed-forwardlong-termmemorymodelnetworksproblemssolve
read the original abstract
We propose a simplified model of attention which is applicable to feed-forward neural networks and demonstrate that the resulting model can solve the synthetic "addition" and "multiplication" long-term memory problems for sequence lengths which are both longer and more widely varying than the best published results for these tasks.
This paper has not been read by Pith yet.
Forward citations
Cited by 1 Pith paper
-
An Attention Mechanism for Musical Instrument Recognition
An attention model improves multi-label instrument recognition accuracy on the weakly labeled OpenMIC dataset compared to baseline, RNN, and fully connected networks across 20 instruments.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.