pith. sign in

arxiv: 1504.06650 · v1 · pith:BHRB6NO4new · submitted 2015-04-24 · 💻 cs.CL · stat.ML

Learning Dictionaries for Named Entity Recognition using Minimal Supervision

classification 💻 cs.CL stat.ML
keywords embeddingscandidatedictionariesentityexamplesnamedphrasesrecognition
0
0 comments X
read the original abstract

This paper describes an approach for automatic construction of dictionaries for Named Entity Recognition (NER) using large amounts of unlabeled data and a few seed examples. We use Canonical Correlation Analysis (CCA) to obtain lower dimensional embeddings (representations) for candidate phrases and classify these phrases using a small number of labeled examples. Our method achieves 16.5% and 11.3% F-1 score improvement over co-training on disease and virus NER respectively. We also show that by adding candidate phrase embeddings as features in a sequence tagger gives better performance compared to using word embeddings.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.