Short-term Memory of Deep RNN

Claudio Gallicchio

arxiv: 1802.00748 · v1 · pith:FSFXCJ7Jnew · submitted 2018-02-02 · 💻 cs.LG · cs.AI· math.DS· stat.ML

Short-term Memory of Deep RNN

Claudio Gallicchio This is my paper

classification 💻 cs.LG cs.AImath.DSstat.ML

keywords memorydeeprecurrentreservoirresultsshort-termtowardsabilities

0 comments

read the original abstract

The extension of deep learning towards temporal data processing is gaining an increasing research interest. In this paper we investigate the properties of state dynamics developed in successive levels of deep recurrent neural networks (RNNs) in terms of short-term memory abilities. Our results reveal interesting insights that shed light on the nature of layering as a factor of RNN design. Noticeably, higher layers in a hierarchically organized RNN architecture results to be inherently biased towards longer memory spans even prior to training of the recurrent connections. Moreover, in the context of Reservoir Computing framework, our analysis also points out the benefit of a layered recurrent organization as an efficient approach to improve the memory skills of reservoir models.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Information Processing Capacity of Stationary Physical Systems: Theory, Data-efficient Estimation Methods, and Photonic Demonstration
stat.ML 2026-05 unverdicted novelty 6.0

Extends IPC to stationary physical systems with new bounds, bias analysis, and estimation methods, validated on photonic hardware showing link to ML performance.
Information Processing Capacity of Stationary Physical Systems: Theory, Data-efficient Estimation Methods, and Photonic Demonstration
stat.ML 2026-05 unverdicted novelty 6.0

Extends IPC to stationary physical systems with capacity bounds, asymptotic bias analysis, Richardson/Sobol estimators, and photonic validation linking total IPC to ML performance and effective dimensionality.