pith. sign in

On the State of the Art of Evaluation in Neural Language Models

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it
abstract

Ongoing innovations in recurrent neural network architectures have provided a steady influx of apparently state-of-the-art results on language modelling benchmarks. However, these have been evaluated using differing code bases and limited computational resources, which represent uncontrolled sources of experimental variation. We reevaluate several popular architectures and regularisation methods with large-scale automatic black-box hyperparameter tuning and arrive at the somewhat surprising conclusion that standard LSTM architectures, when properly regularised, outperform more recent models. We establish a new state of the art on the Penn Treebank and Wikitext-2 corpora, as well as strong baselines on the Hutter Prize dataset.

citation-role summary

background 1

citation-polarity summary

roles

background 1

polarities

unclear 1

representative citing papers

Reproducibility in Machine Learning for Health

cs.LG · 2019-07-02 · unverdicted · novelty 5.0

Systematic evaluation of over 100 ML4H papers finds poorer reproducibility than other ML fields, driven by limited data and code access, and offers recommendations to data providers, publishers, and researchers.

citing papers explorer

Showing 5 of 5 citing papers.