Problems With Evaluation of Word Embeddings Using Word Similarity Tasks

Chris Dyer; Manaal Faruqui; Pushpendre Rastogi; Yulia Tsvetkov

arxiv: 1605.02276 · v3 · pith:KNB7DPR3new · submitted 2016-05-08 · 💻 cs.CL

Problems With Evaluation of Word Embeddings Using Word Similarity Tasks

Manaal Faruqui , Yulia Tsvetkov , Pushpendre Rastogi , Chris Dyer This is my paper

classification 💻 cs.CL

keywords wordevaluationsimilarityvectorstasksmethodsproblemsassociated

0 comments

read the original abstract

Lacking standardized extrinsic evaluation methods for vector representations of words, the NLP community has relied heavily on word similarity tasks as a proxy for intrinsic evaluation of word vectors. Word similarity evaluation, which correlates the distance between vectors and human judgments of semantic similarity is attractive, because it is computationally inexpensive and fast. In this paper we present several problems associated with the evaluation of word vectors on word similarity datasets, and summarize existing solutions. Our study suggests that the use of word similarity tasks for evaluation of word vectors is not sustainable and calls for further research on evaluation methods.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Low-supervision urgency detection and transfer in short crisis messages
cs.CL 2019-07 unverdicted novelty 4.0

Presents a low-supervision urgency detection system using ensembles and transfer learning that outperforms baselines on multiple disaster datasets.