pith. sign in

arxiv: cmp-lg/9503025 · v1 · pith:JVFMIJQCnew · submitted 1995-04-01 · cmp-lg · cs.CL

Co-occurrence Vectors from Corpora vs. Distance Vectors from Dictionaries

classification cmp-lg cs.CL
keywords vectorsco-occurrencedistancewordscorporaderiveddictionarycollins
0
0 comments X
read the original abstract

A comparison was made of vectors derived by using ordinary co-occurrence statistics from large text corpora and of vectors derived by measuring the inter-word distances in dictionary definitions. The precision of word sense disambiguation by using co-occurrence vectors from the 1987 Wall Street Journal (20M total words) was higher than that by using distance vectors from the Collins English Dictionary (60K head words + 1.6M definition words). However, other experimental results suggest that distance vectors contain some different semantic information from co-occurrence vectors.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.