pith. sign in

arxiv: 1707.06556 · v1 · pith:EGTDUCCXnew · submitted 2017-07-20 · 💻 cs.CL · cs.LG

High-risk learning: acquiring new word vectors from tiny data

classification 💻 cs.CL cs.LG
keywords worddatamodellearnmodelsonlytasktiny
0
0 comments X
read the original abstract

Distributional semantics models are known to struggle with small data. It is generally accepted that in order to learn 'a good vector' for a word, a model must have sufficient examples of its usage. This contradicts the fact that humans can guess the meaning of a word from a few occurrences only. In this paper, we show that a neural language model such as Word2Vec only necessitates minor modifications to its standard architecture to learn new terms from tiny data, using background knowledge from a previously learnt semantic space. We test our model on word definitions and on a nonce task involving 2-6 sentences' worth of context, showing a large increase in performance over state-of-the-art models on the definitional task.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.