pith. sign in

A Simple Way to Initialize Recurrent Networks of Rectified Linear Units

6 Pith papers cite this work. Polarity classification is still indexing.

6 Pith papers citing it
abstract

Learning long term dependencies in recurrent networks is difficult due to vanishing and exploding gradients. To overcome this difficulty, researchers have developed sophisticated optimization techniques and network architectures. In this paper, we propose a simpler solution that use recurrent neural networks composed of rectified linear units. Key to our solution is the use of the identity matrix or its scaled version to initialize the recurrent weight matrix. We find that our solution is comparable to LSTM on our four benchmarks: two toy problems involving long-range temporal structures, a large language modeling problem and a benchmark speech recognition problem.

years

2026 4 2019 2

verdicts

UNVERDICTED 6

clear filters

representative citing papers

R-Transformer: Recurrent Neural Network Enhanced Transformer

cs.LG · 2019-07-12 · unverdicted · novelty 6.0

R-Transformer integrates RNNs with multi-head attention to model local and global sequence dependencies without position embeddings and reports large-margin gains over prior methods on diverse tasks.

Geometric Analysis of Variational Quantum Eigensolver

quant-ph · 2026-05-27 · unverdicted · novelty 5.0

Unifies fixed-ansatz and adaptive VQE via ansatz-free product-unitary formulation on the unitary group and derives convergence rates, initialization guarantees, and noise-robust measurement strategies for Riemannian gradient descent.

SAFE Quantum Machine Learning with Variational Quantum Classifiers

cs.LG · 2026-05-15 · unverdicted · novelty 3.0

A variational quantum classifier with normalized amplitude embeddings and bounded observables achieves competitive accuracy with improved robustness and stability over classical baselines in safety-critical settings.

citing papers explorer

Showing 4 of 4 citing papers after filters.