Few-shot Learning for Named Entity Recognition in Medical Text

Alejo Nevado-Holgado; Andrey Kormilitzin; Maximilian Hofer; Paul Goldberg

arxiv: 1811.05468 · v1 · pith:2UYLM6F6new · submitted 2018-11-13 · 💻 cs.CL · cs.LG· stat.ML

Few-shot Learning for Named Entity Recognition in Medical Text

Maximilian Hofer , Andrey Kormilitzin , Paul Goldberg , Alejo Nevado-Holgado This is my paper

classification 💻 cs.CL cs.LGstat.ML

keywords annotatedexamplesmodelsstate-of-the-artachievableentitygainsmedical

0 comments

read the original abstract

Deep neural network models have recently achieved state-of-the-art performance gains in a variety of natural language processing (NLP) tasks (Young, Hazarika, Poria, & Cambria, 2017). However, these gains rely on the availability of large amounts of annotated examples, without which state-of-the-art performance is rarely achievable. This is especially inconvenient for the many NLP fields where annotated examples are scarce, such as medical text. To improve NLP models in this situation, we evaluate five improvements on named entity recognition (NER) tasks when only ten annotated examples are available: (1) layer-wise initialization with pre-trained weights, (2) hyperparameter tuning, (3) combining pre-training data, (4) custom word embeddings, and (5) optimizing out-of-vocabulary (OOV) words. Experimental results show that the F1 score of 69.3% achievable by state-of-the-art models can be improved to 78.87%.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Task Decomposition for Efficient Annotation
cs.CL 2026-06 unverdicted novelty 4.0

Decomposing annotation tasks using centers from centering theory reduces aggregate inferential load via a degrees-of-freedom model and enables better sub-task allocation.