Take and Took, Gaggle and Goose, Book and Read: Evaluating the Utility of Vector Differences for Lexical Relation Learning

Ekaterina Vylomova; Laura Rimell; Timothy Baldwin; Trevor Cohn

arxiv: 1509.01692 · v4 · pith:BPR3DZQTnew · submitted 2015-09-05 · 💻 cs.CL

Take and Took, Gaggle and Goose, Book and Read: Evaluating the Utility of Vector Differences for Lexical Relation Learning

Ekaterina Vylomova , Laura Rimell , Trevor Cohn , Timothy Baldwin This is my paper

classification 💻 cs.CL

keywords learninglexicalrelationsvectorwordembeddingsrelationdifferences

0 comments

read the original abstract

Recent work on word embeddings has shown that simple vector subtraction over pre-trained embeddings is surprisingly effective at capturing different lexical relations, despite lacking explicit supervision. Prior work has evaluated this intriguing result using a word analogy prediction formulation and hand-selected relations, but the generality of the finding over a broader range of lexical relation types and different learning settings has not been evaluated. In this paper, we carry out such an evaluation in two learning settings: (1) spectral clustering to induce word relations, and (2) supervised learning to classify vector differences into relation types. We find that word embeddings capture a surprising amount of information, and that, under suitable supervised training, vector subtraction generalises well to a broad range of relations, including over unseen lexical items.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Open-SAT: LLM-Guided Query Embedding Refinement for Open-Vocabulary Object Retrieval in Satellite Imagery
cs.CV 2026-05 unverdicted novelty 5.0

Open-SAT refines query embeddings with LLMs to improve open-vocabulary object retrieval in satellite imagery, raising F1 scores by up to 16.04% on three benchmarks while keeping retrieval counts similar.