Neural Architectures for Named Entity Recognition

Guillaume Lample , Miguel Ballesteros , Sandeep Subramanian , Kazuya Kawakami , Chris Dyer

Authors on Pith no claims yet

classification 💻 cs.CL

keywords corporaentityknowledgelearnedmodelsnamedneuralrecognition

read the original abstract

State-of-the-art named entity recognition systems rely heavily on hand-crafted features and domain-specific knowledge in order to learn effectively from the small, supervised training corpora that are available. In this paper, we introduce two new neural architectures---one based on bidirectional LSTMs and conditional random fields, and the other that constructs and labels segments using a transition-based approach inspired by shift-reduce parsers. Our models rely on two sources of information about words: character-based word representations learned from the supervised corpus and unsupervised word representations learned from unannotated corpora. Our models obtain state-of-the-art performance in NER in four languages without resorting to any language-specific knowledge or resources such as gazetteers.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

When Active Learning Falls Short: An Empirical Study on Chemical Reaction Extraction
cs.LG 2026-04 unverdicted novelty 5.0

Active learning for chemical reaction extraction frequently produces non-monotonic learning curves and fails to deliver stable gains over random sampling because of strong pretraining, structured CRF decoding, and lab...