pith. sign in

Dynamic Evaluation of Neural Sequence Models

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it
abstract

We present methodology for using dynamic evaluation to improve neural sequence models. Models are adapted to recent history via a gradient descent based mechanism, causing them to assign higher probabilities to re-occurring sequential patterns. Dynamic evaluation outperforms existing adaptation approaches in our comparisons. Dynamic evaluation improves the state-of-the-art word-level perplexities on the Penn Treebank and WikiText-2 datasets to 51.1 and 44.3 respectively, and the state-of-the-art character-level cross-entropies on the text8 and Hutter Prize datasets to 1.19 bits/char and 1.08 bits/char respectively.

fields

cs.LG 2

years

2026 1 2019 1

verdicts

UNVERDICTED 2

clear filters

representative citing papers

citing papers explorer

Showing 1 of 1 citing paper after filters.

  • Multiplicative Models for Recurrent Language Modeling cs.LG · 2019-06-30 · unverdicted · none · ref 14 · internal anchor

    New multiplicative RNN models are tested on char-level LM tasks to demonstrate the relevance of shared parametrization for the intermediate state.