Short-term Memory of Deep RNN
read the original abstract
The extension of deep learning towards temporal data processing is gaining an increasing research interest. In this paper we investigate the properties of state dynamics developed in successive levels of deep recurrent neural networks (RNNs) in terms of short-term memory abilities. Our results reveal interesting insights that shed light on the nature of layering as a factor of RNN design. Noticeably, higher layers in a hierarchically organized RNN architecture results to be inherently biased towards longer memory spans even prior to training of the recurrent connections. Moreover, in the context of Reservoir Computing framework, our analysis also points out the benefit of a layered recurrent organization as an efficient approach to improve the memory skills of reservoir models.
This paper has not been read by Pith yet.
Forward citations
Cited by 2 Pith papers
-
Information Processing Capacity of Stationary Physical Systems: Theory, Data-efficient Estimation Methods, and Photonic Demonstration
Extends IPC to stationary physical systems with new bounds, bias analysis, and estimation methods, validated on photonic hardware showing link to ML performance.
-
Information Processing Capacity of Stationary Physical Systems: Theory, Data-efficient Estimation Methods, and Photonic Demonstration
Extends IPC to stationary physical systems with capacity bounds, asymptotic bias analysis, Richardson/Sobol estimators, and photonic validation linking total IPC to ML performance and effective dimensionality.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.