pith. machine review for the scientific record. sign in

arxiv: 1510.02387 · v2 · submitted 2015-10-08 · 💻 cs.CL · cs.LG

Recognition: unknown

Mapping Unseen Words to Task-Trained Embedding Spaces

Authors on Pith no claims yet
classification 💻 cs.CL cs.LG
keywords embeddingswordsembeddinginitialtask-specificdatalearnsupervised
0
0 comments X
read the original abstract

We consider the supervised training setting in which we learn task-specific word embeddings. We assume that we start with initial embeddings learned from unlabelled data and update them to learn task-specific embeddings for words in the supervised training data. However, for new words in the test set, we must use either their initial embeddings or a single unknown embedding, which often leads to errors. We address this by learning a neural network to map from initial embeddings to the task-specific embedding space, via a multi-loss objective function. The technique is general, but here we demonstrate its use for improved dependency parsing (especially for sentences with out-of-vocabulary words), as well as for downstream improvements on sentiment analysis.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.